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Munich,  Germany  (1/06). 
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7.  “A  fault-tolerant  one-way  quantum  computer,”  R.  Raussendorf,  Max  Planck  Institute  for 
Quantum  Optics,  Garching,  Germany  (1/06). 

8.  “Applications  of  Quantum  Filtering  and  Feedback,”  H.  Mabuchi,  Workshop  on  Optimal 
Control  of  Quantum  Dynamics,  Ringberg,  Germany  (12/6/05). 

9.  “Magnetic  Microtraps  for  Cavity  QED,  BECs,  and  Atom  Optics,”  B.  Lev,  Physics 
Colloquium,  Williams  College,  Williamstown,  Massachusetts  (1 1/05). 

10.  “Magnetic  Microtraps  for  Cavity  QED,  BECs,  and  Atom  Optics,”  B.  Lev,  Physics  Seminar, 
Yale  University,  New  Haven,  Connecticut  (1 1/05). 

1 1 .  “Feedback  Stabilization  of  Asymmetric  States  Under  Symmetric  Measurement,”  (poster) 

A.  E.  Miller,  Hertz  Foundation  Retreat,  Pt.  Reyes,  CA  (10/30/05). 

12.  “Real-time  quantum  measurement  and  feedback  control”  (series  of  two  lectures),  H. 
Mabuchi,  Summer  School  on  Quantum  Information  Science,  Kochi,  Japan  (9/1-10/2005). 

13.  “Magnetoelectrostatic  Ring  Traps  for  Neutral  Atoms,”  (poster)  A.  Hopkins,  Summer 
School  on  Quantum  Information  Science,  Kochi,  Japan  (9/1-10/2005). 

14.  “Feedback  Stabilization  of  Asymmetric  States  Under  Symmetric  Measurement,”  (poster) 

A.  E.  Miller,  Summer  School  on  Quantum  Information  Science,  Kochi,  Japan  (9/1- 
10/2005). 

15.  “Real-time  quantum  measurement  and  feedback  control”  (series  of  two  lectures),  H. 
Mabuchi,  Quantum  Control  Summer  School,  Pasadena,  CA  (8/8-14/2005). 

16.  “Progress  in  Cavity  QED  with  Single  Trapped  Atoms,”  R.  Miller,  CLEO-PR/IQEC  2005, 
Tokyo,  Japan  (7/15/05). 

17.  “Real-time  quantum  measurement  and  feedback  control”  (series  of  three  lectures),  H. 
Mabuchi,  Enrico  Fermi  Summer  School  on  Quantum  Computers,  Algorithms  and  Chaos 
Varenna,  Italy  (7/5-15/2005). 

18.  “Magnetic  Microtraps  for  Cavity  QED,  BECs,  and  Atom  Optics,”  (poster)  B.  Lev,  Atomic 
Physics  Gordon  Research  Conference,  Tilton,  New  Hampshire  (6/05). 

19.  “Quantum  fdtering  and  nonlinear  dynamics,”  H.  Mabuchi,  Perimeter  Institute,  Waterloo, 
Canada  (5/1 1/05). 

20.  “Magnetic  Microtraps  for  Cavity  QED,  BECs,  and  Atom  Optics,”  Ben  Lev,  Caltech 
Seminar  Day,  Pasadena,  CA  (5/05). 

21.  “Long-range  entanglement  in  3D  noisy  cluster  states,”  R.  Raussendorf,  Murray  Hill  (Bell 
Labs),  NJ  (5/05). 

22.  “Cavity  QED  with  single  atoms  and  photons,”  T.  Northup,  JILA,  Boulder,  CO  (4/22/05). 

23.  “Quantum  Dynamics  with  Single  Atoms  and  Photons,”  H.  J.  Kimble,  Colloquium,  Welsh 
Lecture,  University  of  Toronto,  Toronto,  Canada  (4/21/05). 

24.  “The  New  Science  of  Quantum  Information:  From  Quantum  Computers  to  Teleportation  of 
Quantum  States,”  H.  J.  Kimble,  Public  Lecture,  Welsh  Lecture,  University  of  Toronto, 
Toronto,  Canada  (4/21/05). 

25.  “Cavity  QED  with  Single  Atoms  and  Photons,”  H.  J.  Kimble,  Seminar,  Universitat 
Autonoma  de  Barcelona,  Barcelona,  Spain  (4/8/05). 

26.  “The  Quantum  Optics  Circus  —  Flying  Photons,  Acrobatic  Atoms,  and  Teleported 
Tuataras,”  H.  J.  Kimble,  Year  of  Physics  Series,  CosmoCaixa  Science  Museum,  Barcelona, 
Spain  (4/7/05). 

27.  “The  Atom  Chip,”  B.  Lev,  Everheart  Lecture,  Caltech,  Pasadena,  CA  (4/05). 

28.  “Exploring  the  Jaynes-Cummings  ladder:  experiments  in  cavity  QED,”  T.  Northup,  Physics 
Colloquium,  U.S.  Air  Force  Academy,  Colorado  Springs,  Colorado  (3/31/05). 
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29.  “Measurement  and  control  in  quantum  information  science,”  H.  Mabuchi,  APS  March 
Meeting,  Los  Angeles,  CA  (3/23/05). 

30.  “Cavity  QED  with  Single  Atoms  and  Photons,”  H.  J.  Kimble,  2005  APS  March  Meeting 
(invited),  Los  Angeles,  California  (3/21/05). 

3 1 .  “Quantum  control  and  nano-photonics  in  quantum  information  science,”  H.  Mabuchi, 
Cornell  University,  Ithaca,  NY  (3/17/05). 

32.  “Measurement  and  control  in  quantum  information  science,”  H.  Mabuchi,  Quantum 
Information  Science  Gordon  Research  Conference,  Ventura,  CA  (3/3/05). 

33.  “Progress  Toward  a  Physical  Implementation  of  the  Dolinar  Receiver,”  JM  Geremia, 
NASA  Jet  Propulsion  Laboratory  DRDF  Seminar,  Pasadena,  CA  (3/05). 

34.  “Long-range  entanglement  in  3D  noisy  cluster  states,”  R.  Raussendorf,  Obergurgl,  Austria 
(3/05). 

35.  “Quantum  cellular  Automata  using  boundary,”  R.  Raussendorf,  APS  March  Meeting,  Los 
Angeles,  CA  (3/05). 

36.  “Measurement  and  control  in  quantum  information  science,”  H.  Mabuchi,  Institute  for 
Systems  Research  Distinguished  Lecture  Series,  College  Park,  MD  (2/23/05). 

37.  “Storage  time  of  a  quantum  memory  in  an  ensemble  of  cold  atoms,”  C.  W.  Chou,  5th 
Annual  Southwest  Quantum  Information  and  Technology  (SQuInT)  Workshop,  Tucson, 
Arizona  (2/20/05). 

38.  “Magnetic  Microtraps  for  Cavity  QED,  BECs,  and  Atom  Optics,”  Ben  Lev,  Southwest 
Quantum  Information  and  Technology  (SQuInT’ 05)  Annual  Meeting,  Tucson,  AZ  (2/17- 
20/2005). 

39.  “Optimal  Discrimination  Between  Two  Optical  Coherent  States,”  JM  Geremia,  Southwest 
Quantum  Information  and  Technology  (SQuInT’05)  Annual  Meeting,  Tucson,  AZ  (2/17- 
20/2005). 

40.  “Real-time  feedback  control  of  quantum  state  reduction,”  H.  Mabuchi,  AAAS  Annual 
Meeting,  Washington,  D.C.  (2/20/05). 

41.  “Cavity  QED  —  from  Purcell  and  Casimir  to  the  Era  of  Strong  Coupling  for  Single  Atoms 
and  Photons,”  H.  J.  Kimble,  Physics  Colloquium,  University  of  British  Columbia, 
Vancouver,  Canada  (2/10/05). 

42.  “Quantum  Feedback  Control  and  Precision  Measurement  with  Cold  Atoms,”  JM  Geremia, 
University  of  New  Mexico  Physics  Seminar  (invited),  Albuquerque,  NM  (2/05). 

43.  “Nonlinear  Dynamics  in  Single-Atom  Cavity  QED,”  B.  Lev,  Photonics  West  2005,  San 
Jose,  CA  (1/25/05). 

44.  “Quantum  Information  Science  Enabled  by  Quantum  Optics,”  H.  J.  Kimble,  Complexity, 
Entropy  and  the  Physics  of  Information  (CEPI)  Seminar  Series,  Santa  Fe  Institute,  Santa 
Fe,  New  Mexico  (1/19/05). 

45.  “Magnetic  Microtraps  for  Cavity  QED,  BECs,  and  Atom  Optics,”  B.  Lev,  Physics 
Colloquium,  California  State  University  of  Long  Beach,  CA  (1/05). 

46.  “Using  Continuous  Measurement  and  Real-Time  Feedback  to  Manipulate  the  Quantum 
State  of  Cold  Atoms,”  JM  Geremia,  University  of  Wisconsin  Physics  Department  Seminar 
(invited),  Madison,  WI  (1/05). 

47.  “Caltech  MURI  Center  for  Quantum  Networks,”  H.  Mabuchi  and  H.  J.  Kimble,  DARPA 
Quantum  Information  Science  and  Technology  (QuIST)  Program  Review,  Scottsdale,  AZ 
(11/16-17/2004). 
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48.  “Measurement  and  control  in  quantum  information  science,”  H.  Mabuchi,  Photonics 
Asia/APOC  Beijing  2004,  Beijing,  China  (1 1/5-13/2004). 

49.  “Cavity  QED  —  from  Purcell  and  Casimir  to  the  Era  of  Strong  Coupling  for  Single  Atoms 
and  Photons,”  H.  J.  Kimble,  Quantum  Information  &  Coherence  (QIJIC)  Seminar  Series 
(invited),  Naval  Research  Lab,  Washington  DC  (1 1/5/04). 

50.  “Real-Time  Quantum  Feedback  Control,”  JM  Geremia,  Optical  Science  of  America: 
Frontiers  in  Optics  and  Laser  Science  Meeting  (invited)  Rochester,  NY  (1 1/04). 

5 1 .  “Magnetic  Microtraps  for  Cavity  QED,  BECs,  and  Atom  Optics,”  B.  Lev,  Ion  Trapping 
Group  Seminar,  NIST,  Boulder,  CO  (10/04). 

52.  “Magnetic  Microtraps  for  Cavity  QED,  BECs,  and  Atom  Optics,”  B.  Lev,  JILA  Seminar, 
University  of  Colorado  and  NIST,  Boulder,  CO  (10/04). 

53.  “Continuous  Quantum  Measurement  and  Control  with  Atomic  Spin  Ensembles,”  J. 
Stockton,  Hertz  Foundation  Retreat,  Point  Reyes,  CA  (9/24-26/2004). 

54.  “Cavity  QED  with  Single  Atoms  and  Photons,”  H.  J.  Kimble,  Workshop  on  Microcavities 
in  Quantum  Optics  (invited),  Lake  Tegemsee,  Bavaria,  Germany  (9/23/04). 

55.  “Quantum  control,  information  and  metrology,”  Princeton  University,  Princeton,  NJ 
(9/16/04). 

56.  “Magnetic  Microtraps  for  Cavity  QED,  BECs,  and  Atom  Optics,”  B.  Lev,  Atomic, 
Molecular,  and  Optical  Sciences  Seminar,  University  of  California,  Berkeley,  CA  (9/04). 

57.  “Long-range  entanglement  in  3D  noisy  cluster  states,”  R.  Raussendorf,  Ann  Arbor,  MI 
(9/04). 

58.  “Control  and  Dynamics  Approach  to  Robust  Quantum  Computing,”  H.  Mabuchi,  2004 
Quantum  Computing  Program  Review  (QCPR),  Orlando,  FL  (8/15-17/2004). 

59.  “Converting  Photons  to  Decibels  —  Travels  with  John  Hall  in  Quantum  Optics  Land,”  H.  J. 
Kimble,  John  Hall  Symposium  (invited),  JILA,  University  of  Colorado,  Boulder  (8/14/04). 

60.  “Quantum  Aspects  of  Precision  Measurement,”  JM  Geremia,  Los  Alamos  QUEST  Annual 
Meeting,  Santa  Fe,  NM  (8/04). 

61.  “Optimal  Classical  Communication  at  the  Quantum  Noise  Limit,”  JM  Geremia,  Caltech 
Principles  and  Applications  of  Control  in  Quantum  Systems  (PRACQSYS’04)  Meeting, 
Pasadena,  CA  (8/04). 

62.  “Real-time  feedback  control  of  quantum  state  reduction,”  Hideo  Mabuchi,  XIX 
International  Conference  on  Atomic  Physics  (ICAP  2004),  Rio  de  Janeiro,  Brazil  (7/27/04). 

63.  “Cavity  QED  with  single  atoms  and  photons,”  H.  J.  Kimble,  XIX  International  Conference 
on  Atomic  Physics  (ICAP  2004)  (invited),  Rio  de  Janeiro,  Brazil  (7/26/04). 

64.  “Feedback  stabilization  of  quantum  entangled-state  preparation,”  Hideo  Mabuchi,  Fields 
Institute  Meeting  on  Quantum  Information  and  Quantum  Control,  Toronto,  Canada 
(7/22/04). 

65.  “Quantum  Computation,”  John  Preskill,  17th  International  Conference  on  General 
Relativity  and  Gravitation,  Dublin,  Ireland  (7/21/04). 

66.  “Feedback  Control  of  Continuous  Projective  Measurement,”  (poster)  J.  Stockton,  Quantum 
Information  and  Quantum  Control,  University  of  Toronto,  Toronto,  Canada  (7/19- 
23/2004). 

67.  “Feedback  control  for  quantum  and  classical  uncertainty  management,”  Hideo  Mabuchi, 
Connections:  50th  Birthday  Celebration  for  John  Doyle,  Pasadena,  CA  (7/15/04). 

68.  “Long-range  entanglement  in  3D  noisy  cluster  states,”  R.  Raussendorf,  Toronto,  Canada 
(7/04). 
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69.  “Cavity  QED  with  single  atoms  and  photons,”  H.  J.  Kimble,  2004  IEEE/LEOS  Summer 
Topical  Meetings  (invited),  San  Diego,  CA  (6/28/04). 

70.  “Continuous  Measurement  and  Control  of  Atomic  Spin  Ensembles,”  J.  Stockton,  Caltech 
Candidacy  Talk,  Pasadena,  CA  (6/17/04). 

71.  “Cavity  QED  with  single  atoms  and  photons,”  H.  J.  Kimble,  FOCUS  —  Building 
Computational  Devices  Using  Coherent  Control  (invited),  University  of  Michigan,  Ann 
Arbor,  MI  (6/9/04). 

72.  “Feedback  Control  of  Continuous  Projective  Measurement,”  (poster)  J.  Stockton,  FOCUS  - 
-  Building  Computational  Devices  Using  Coherent  Control  (invited),  University  of 
Michigan,  Ann  Arbor,  MI  (6/7-9/2004). 

73.  “Feedback  control  of  quantum  state  reduction,”  H.  Mabuchi,  FOCUS  —  Building 
Computational  Devices  Using  Coherent  Control  (invited),  University  of  Michigan,  Ann 
Arbor,  MI  (6/9/04). 

74.  “Topological  quantum  computing  for  beginners,”  John  Preskill,  KITP  Conference  on 
Exotic  Order,  Santa  Barbara,  CA  (6/7/04). 

75.  “Quantum  Measurement,  Entanglement  and  Feedback,”  J.  M.  Geremia,  Physics 
Department  Seminar  (invited),  California  State  University  Long  Beach,  Long  Beach,  CA 
(6/04). 

76.  “Quantum  measurement  and  feedback  with  atomic  spins,”  Hideo  Mabuchi,  APS  Division 
of  Atomic,  Molecular,  Optical,  and  Plasma  Physics,  Tucson,  AZ  (5/26/04). 

77.  “Cavity  QED  with  Trapped  Atoms  for  Quantum  Optics  and  Quantum  Information,”  J. 
McKeever,  Ph.D.  Defense,  Caltech,  Pasadena,  CA  (5/24/04). 

78.  “Cavity  QED  with  multiple  atomic  excited  states,”  K.  Bimbaum,  CLEO/IQEC  2004,  San 
Francisco,  CA  (5/18/04). 

79.  “Single  photon  generation  ‘on  demand’  from  a  single  trapped  atom  strongly  coupled  to  an 
optical  cavity,”  J.  McKeever,  A.  Boca,  A.  D.  Boozer,  R.  Miller,  J.  R.  Buck,  A.  Kuzmich 
and  H.  J.  Kimble,  New  Focus/Bookham  Student  Award  Session,  CLEO/IQEC  2004,  San 
Francisco,  CA  (5/16-21/2004). 

80.  “Single  photon  generation  ‘on  demand’  from  a  single  trapped  atom  strongly  coupled  to  an 
optical  cavity,”  J.  McKeever,  A.  Boca,  A.  D.  Boozer,  R.  Miller,  J.  R.  Buck,  A.  Kuzmich 
and  H.  J.  Kimble,  Single  Photon  Symposium  I,  CLEO/IQEC  2004,  San  Francisco,  CA 
(5/16-21/2004). 

8 1 .  “Nonclassical  Photon  Pairs  from  a  Cold  Atomic  Ensemble  for  Scalable  Quantum 
Communication,”  S.  Polyakov,  CLEO/IQEC  2004  (invited),  San  Francisco,  CA  (5/16- 
21/2004). 

82.  “Recent  Progress  in  Quantum  Information  Science,”  John  Preskill,  Seven  Pines 
Conference,  Stillwater,  MN  (5/8/04). 

83.  “Feedback  Control  of  Quantum  Star  Reduction,”  R.  van  Handel,  8th  Southern  California 
Nonlinear  Control  Workshop,  UCSB,  Santa  Barbara,  CA  (5/8/04). 

84.  “Cavity  QED  —  From  Purcell  and  Casimir  to  the  Era  of  Strong  Coupling  for  Single  Atoms 
and  Photons,”  H.  J.  Kimble,  Lilienfeld  Prize  Talk,  APS  April  Meeting  2004,  Denver,  CO 
(5/2/04). 

85.  “Collective  Spin  State  Preparation  with  Quantum  Measurement  and  Control,”  J.  K. 
Stockton,  Workshop  on  Control  of  Quantum  Mechanical  Systems  (CONQUEST),  UC 
Berkeley,  Berkeley,  CA  (4/30/04). 


10 


86.  “Cavity  Quantum  Electrodynamics,”  H.  J.  Kimble,  Workshop  on  Quantum  Information 
Science  and  Emerging  Technologies  (invited),  NIST,  Boulder,  CO  (4/30/04). 

87.  “Fault- tolerant  quantum  computation  using  graph  states,”  Robert  Raussendorf,  University 
of  Innsbruck,  Austria  (4/28/04). 

88.  “Recent  Progress  toward  the  Realization  of  Quantum  Networks,”  H.  J.  Kimble  (invited), 
Harvard  University,  Cambridge,  MA  (4/19/04). 

89.  “Extending  and  Generalizing  the  Kochen-Specker  theorem,”  Ben  Toner,  Universite  de 
Montreal,  Parts  I  and  II,  Montreal,  Canada  (4/15-16/2004). 

90.  “Quantum  Teleportation  —  Fact  and  Fantasy,”  H.  J.  Kimble,  Biedenham  Lecture  IV, 
University  of  Texas  at  Austin,  Austin,  TX  (4/6/04). 

91.  “Quantum  Teleportation  —  Fact  and  Fantasy,”  H.  J.  Kimble,  Dean’s  Scholars  Lecture, 
University  of  Texas  at  Austin,  Austin,  TX  (4/2/04). 

92.  “Scalable  Quantum  Communication  Networks  with  Photon  Pairs  from  Atomic  Ensembles,” 
H.  J.  Kimble,  Biedenham  Lecture  III,  University  of  Texas  at  Austin,  Austin,  TX  (4/1/04). 

93.  “Deterministic  Generation  of  Single  Photons  from  One  Atom  Trapped  in  a  Cavity,”  J. 
McKeever,  A.  Boca,  A.  D.  Boozer,  R.  Miller,  J.  R.  Buck  A.  Kuzmich  and  H.  J.  Kimble, 
Seminar,  NIST  Ion  Storage  Group  (invited),  Boulder,  CO  (4/04). 

94.  “Deterministic  Generation  of  Single  Photons  from  One  Atom  Trapped  in  a  Cavity,”  J. 
McKeever,  A.  Boca,  A.  D.  Boozer,  R.  Miller,  J.  R.  Buck  A.  Kuzmich  and  H.  J.  Kimble, 
JILA  Seminar  (invited),  Boulder,  CO  (4/04). 

95.  “The  New  Science  of  Quantum  Information,”  H.  J.  Kimble,  Dean’s  Scholars  Lecture, 
University  of  Texas  at  Austin,  Austin,  TX  (3/26/04). 

96.  “More  Cavity  Quantum  Electrodynamics,”  H.  J.  Kimble,  Biedenham  Lecture  II,  University 
of  Texas  at  Austin,  Austin,  TX  (3/26/04). 

97.  “Measurement-based  quantum  computation  with  cluster  states,”  Robert  Raussendorf, 
University  of  New  Mexico,  Albuquerque,  NM  (3/26/04). 

98.  “Cavity  Quantum  Electrodynamics  —  from  Purcell  and  Casimir  to  the  Era  of  Strong 
Coupling  with  Single  Atoms  and  Photons,”  H.  J.  Kimble,  Department  of  Physics 
Colloquium,  Biedenham  Lecture  I,  University  of  Texas  at  Austin,  Austin,  TX  (3/24/04). 

99.  “Real-Time  Quantum  Feedback  Control  and  Precision  Measurement,”  JM  Geremia, 
University  of  Oregon  Physics  Department  Seminar  (invited)  (3/04). 

100.  “Quantum  error  correction  for  continuously  detected  errors,”  Charlene  Ahn,  IBM, 
Yorktown,  NY  (3/04). 

101.  “Robust  quantum  memory  with  local  controls,”  Jim  Harrington,  LANL  Quantum  Institute, 
Los  Alamos,  NM  (3/04). 

102.  “Robust  quantum  memory  with  local  controls,”  Jim  Harrington,  University  of  New 
Mexico,  Albuquerque,  NM  (3/04). 

103.  “Quantum  measurement  and  feedback  control  with  cold  atoms,”  Hideo  Mabuchi, 
Harvard/MIT  Center  for  Ultracold  Atoms  Seminar,  Cambridge,  MA  (3/2/04). 

104.  “Knowing  what  you  know:  estimation  and  control  in  nanoscale  systems,”  Hideo  Mabuchi, 
MIT  Center  for  Bits  and  Atoms  Colloquium,  Cambridge,  MA  (3/1/04). 

105.  “Superselection  Rules  and  Quantum  Protocols,”  John  Preskill,  Gordon  Research 
Conference  on  Quantum  Information  Science,  Ventura,  CA  (2/25/04). 

106.  “Nonlocal  games,”  Ben  Toner,  SQuInT  2004  (invited),  UC  San  Diego,  San  Diego,  CA 
(2/20-22/2004). 
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107.  “Feedback  Control  of  Continuous  Projective  Measurement,”  (poster)  J.  Stockton,  SQuInT 
2004  (invited),  UC  San  Diego,  San  Diego,  CA  (2/20-22/2004). 

108.  “Conditional  anti-bunching  of  photons  generated  in  a  cold  atomic  ensemble,”  S.  Polyakov, 
SQuInT  2004  (invited),  UC  San  Diego,  San  Diego,  CA  (2/20-22/2004). 

109.  “Real-Time  Quantum  Feedback  Control:  Deterministic  State  Reduction  in  Cold  Atoms,”  J. 
M.  Geremia,  SQuInT  2004  (invited),  UC  San  Diego,  San  Diego,  CA  (2/20/04). 

110.  “Deterministic  Generation  of  Single  Photons  from  One  Atom  Trapped  in  a  Cavity,”  J. 
McKeever,  A.  Boca,  A.  D.  Boozer,  R.  Miller,  J.  R.  Buck  A.  Kuzmich  and  H.  J.  Kimble, 
Sixth  Annual  Southwest  Quantum  Information  and  Technology  (SQuInT)  2004  Workshop 
(invited),  San  Diego,  CA  (2/19/04). 

111.  “Magnetic  Microtraps  for  Cavity  QED,  BECs,  and  Atom  Optics,”  B.  Lev,  IEEE  Nanoscale 
Devices  and  System  Integration  Conference,  Miami,  FL  (2/15/04). 

1 12.  “Putting  Weirdness  to  Work,”  John  Preskill,  AAAS  Annual  Meeting,  Seattle,  WA 
(2/14/04). 

1 13.  “Teleportation  of  Quantum  States  —  Fact  and  Fantasy,”  H.  J.  Kimble,  Quantum  Weirdness 
—  In  Nature,  In  the  Lab  Symposium,  AAAS  Annual  Meeting  (invited),  Seattle,  WA 
(2/14/04). 

1 14.  “Real-Time  Quantum  Feedback  Control  of  Atomic  Spin  Squeezing,”  JM  Geremia,  SQuInT 
Annual  Meeting,  San  Diego,  (2/04). 

115.  “Encoding  a  qubit  into  many  oscillators,”  Jim  Harrington,  SQuInT  2004  (invited),  UC  San 
Diego,  San  Diego,  CA  (2/04). 

116.  “Magnetic  Microtraps  for  Cavity  QED,  BECs,  and  Atom  Optics,”  (poster)  B.  Lev, 
Southwest  Quantum  Information  and  Technology  Network  6th  Annual  Meeting,  UCSD, 
San  Diego,  California  (2/04). 

117.  “Continuous  Quantum  Nondemolition  Measurement  and  Feedback  with  Cold  Atoms,”  JM 
Geremia,  University  of  California  Berkeley  AMO  Physics  Seminar  (invited)  (2/04). 

118.  “Photonic  Quantum  Computation  through  Cavity  Assisted  Interaction,”  H.  J.  Kimble, 
Focused  Quantum  Systems  (FoQuS)  Workshop,  Falls  Church,  VA  (1/28/04). 

1 19.  “Generation  of  Nonclassical  Photon  Pairs  for  Scalable  Quantum  Communication  with 
Atomic  Ensembles,”  Chin-wen  Chou,  34th  Winter  Colloquium  on  the  Physics  of  Quantum 
Electronics  (PQE)  (invited),  Snowbird,  UT  (1/8/04). 

120.  “A  One- Atom  Laser  in  the  Regime  of  Strong  Coupling,”  H.  J.  Kimble,  34th  Winter 
Colloquium  on  the  Physics  of  Quantum  Electronics  (PQE)  (invited),  Snowbird,  UT 
(1/5/04). 

121.  “Real-Time  Quantum  Feedback  Control  of  Atomic  Spin  Squeezing,”  JM  Geremia, 
Deutsche  Forschungsgemeinschaft  (DFG)  Focused  Meeting  (invited)  Bad  Honef,  Germany 
(1/04). 

122.  “Protecting  quantum  states  through  feedback  control,”  Charlene  Ahn,  34th  Winter 
Colloquium  on  The  Physics  of  Quantum  Electronics  (PQE),  Snowbird,  UT  (1/04). 

123.  “Quantum  error  correction  for  continuously  detected  errors,”  Charlene  Ahn,  Perimeter 
Institute,  Waterloo,  Ontario,  Canada  (1/04). 

124.  “Real-time  quantum  feedback  control,”  Hideo  Mabuchi,  SAMSI  Workshop  on  Multiscale 
Modeling  and  Control  Design,  Research  Triangle  Park,  NC  (1/04). 

125.  “Deterministic  preparation  of  spin-squeezed  states  via  real-time  quantum  feedback,”  H. 
Mabuchi,  Stanford-ENS  Quantum  Entanglement  Symposium,  Stanford  (12/16/03). 
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126.  “The  New  Science  of  Quantum  Information,”  H.  J.  Kimble,  Northwestern  University 
(invited),  Evanston,  IL  (1 1/7/03). 

127.  “An  Introduction  to  Control  Theory  from  Classical  to  Quantum,”  JM  Geremia,  Physics 
Winter  School,  University  of  Heidelberg,  Germany  (invited  lecturer),  (1 1/03). 

128.  “Identification,  modeling,  and  control  of  quantum  and  bio-molecular  systems,”  H. 

Mabuchi,  Workshop  on  New  Horizons  in  Molecular  Sciences  and  Systems:  An  Integrated 
Approach,  Okinawa,  Japan  (10/17/03). 

129.  “Experimental  Realization  of  a  One- Atom  Laser  in  a  Regime  of  Strong  Coupling,”  J. 
McKeever,  Quantum  Optics  Seminar,  University  of  Toronto  (invited),  Toronto,  Canada 
(10/14/03). 

130.  “A  One- Atom  Laser  in  a  Regime  of  Strong  Coupling,”  J.  McKeever,  A.  Boca,  A.  D. 
Boozer,  J.  R.  Buck  and  H.  J.  Kimble,  87th  OSA  Annual  Meeting/Laser  Science  XIX 
(invited),  Tucson,  AZ  (10/8/03). 

131.  “Magnetic  Microtraps  for  Cavity  QED,  BECs,  and  Atom  Optics,”  B.  Lev,  Seminar,  Max 
Plank  Institute  for  Quantum  Optics,  Garching,  Germany  (10/03). 

132.  “Overview  of  Caltech  MURI  Center  for  Quantum  Networks,”  B.  Lev,  MURI  Kick-Off 
Meeting,  Stanford,  Palo  Alto  (10/03). 

133.  “Quantum  filtering  and  broadband  atomic  magnetometry,”  H.  Mabuchi,  EURESCO 
Conference  on  Quantum  Optics,  Granada,  Spain  (9/28/03). 

134.  “Continuous  observation  of  open  quantum  systems,”  H.  Mabuchi,  US- Japan  Joint 
Workshop  on  Coherence  in  Quantum  Systems,  Yatsugatake,  Japan  (9/17/03). 

135.  “Coherence  in  broadband  atomic  magnetometry,”  H.  Mabuchi,  Tokyo  University,  Tokyo, 
Japan  (9/15/03). 

136.  “Continuous  observation  of  open  quantum  systems:  conditional  spin-squeezing  and 
broadband  atomic  magnetometry,”  H.  Mabuchi,  U.S. -Japan  Conference  on  Coherence  and 
Quantum  Systems,  Hokkaido,  Japan  (9/12-18/2003). 

137.  “Teleportation  of  continuous  quantum  variables  using  squeezed- state  entanglement,”  K.  W. 
Goh,  The  International  Symposium  on  Optical  Science  and  Technology,  SPIE’s  48th 
Annual  Meeting  (invited),  San  Diego,  CA  (8/6/03). 

138.  “Spin  Squeezing  in  Cold  Atoms  with  Continuous  Measurement,”  JM  Geremia,  Los  Alamos 
QUEST  Annual  Meeting,  Santa  Fe,  (8/03). 

139.  “Cavity  QED  by  the  Numbers,”  H.  J.  Kimble,  16th  International  Conference  on  Laser 
Spectroscopy  (ICOLS03)  (invited),  Palm  Cove,  Australia.  (7/16/03). 

140.  “Microcavities:  strong  coupling  of  atoms  and  photons,”  H.  Mabuchi,  Photonics 
Technologies  Advancement  Program  Workshop  on  Optical  microcavities,  San  Diego.  CA 
(7/15/03). 

141.  “Quantum  optics  and  quantum  information  science,”  H.  Mabuchi,  OSA  Topical  Meeting 
on  Optics  in  Computing,  Washington  D.C.  (6/20/03). 

142.  “Quantum  Control  Lecture  2,”  A.  C.  Doherty,  SQuInT  Student  Summer  Retreat,  Monterey, 
CA  (6/19/03). 

143.  “Quantum  Control  Lecture  1,”  A.  C.  Doherty,  SQuInT  Student  Summer  Retreat,  Monterey, 
CA  (6/18/03). 

144.  “Quantum  Optics  with  Cold  Atoms  —  Cavity  QED  &  Atomic  Ensembles,”  H.  J.  Kimble, 
Gordon  Research  Conference  -  Atomic  Physics  (invited),  Tilton,  NH  (6/17/03). 

145.  “Applications  for  Continuous  Quantum  Measurement  of  Spin  Ensembles”,  (poster)  J.  K. 
Stockton,  Gordon  Research  Conference  on  Atomic  Physics,  Tilton,  NH,  (6/15-20/2003). 
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146.  “The  New  Science  of  Quantum  Information,”  H.  J.  Kimble,  QELS  Plenary  Session 
(invited),  CLEO/QELS  Conference  2003,  Baltimore,  MD  (6/4/03). 

147.  “Experiments  in  quantum  feedback,”  H.  Mabuchi,  SPIE  Conference  on  Fluctuations  and 
Noise,  Santa  Fe,  NM  (6/2/03). 

148.  “Magnetic  Microtraps  for  Cavity  QED,”  B.  Lev,  Niels  Bohr  Institute,  Copenhagen, 
Denmark  (5/30/03). 

149.  “Quantum  Optics  Enabling  Information  Science,”  H.  J.  Kimble,  Hertz  Memorial  Lecture 
(invited),  UC  Davis,  Davis,  CA  (5/28/03). 

150.  “Quantum  Optics  with  Single  Atoms  and  Photons,”  H.  J.  Kimble,  AMO  Student 
Symposium,  DAMOP  2003  Meeting  (invited),  Boulder,  CO  (5/20/03). 

151.  “Cavity  QED  -  From  Purcell  and  Casimir  to  the  Era  of  Strong  Coupling,”  H.  J.  Kimble, 
University  of  Washington  Colloquium  (invited),  Seattle,  WA  (5/12/03). 

152.  “Symmetric  extensions,  local  hidden  variables  and  modified  de  Finetti  theorems,”  A.  C. 
Doherty,  Information  Physics  Group  Seminar,  Department  of  Physics,  University  of  New 
Mexico,  Albuquerque,  NM  (4/30/03). 

153.  “Linear  matrix  inequalities  for  quantum  entanglement  and  control,”  A.  C.  Doherty, 
Department  of  Mechanical  and  Environmental  Engineering,  UC  Santa  Barbara,  Santa 
Barbara,  CA  (4/21/03). 

154.  “Physics  with  photons,  from  quantum  to  bio,”  H.  Mabuchi,  Physics  colloquium,  Temple 
University,  Philadelphia,  PA  (4/19-22/2003). 

155.  “Cavity  QED  —  The  Plumbing,”  H.  J.  Kimble,  Loeb  Lecture  Series  (invited),  Harvard 
University,  Boston,  MA  (4/15/03). 

156.  “Physics  with  photons:  from  quantum  to  bio,”  H.  Mabuchi,  Physics  colloquium,  University 
of  Arizona,  Tucson,  AZ  (4/1 1/03). 

157.  “Quantum  Teleportation  —  Fact  and  Fantasy,”  H.  J.  Kimble,  Loeb  Lecture  Series  (invited), 
Harvard  University,  Boston,  MA  (4/10/03). 

158.  “An  Overview  of  Cavity  Quantum  Electrodynamics  —  from  Purcell  and  Casimir  to  the  Era 
of  Strong  Coupling,”  H.  J.  Kimble,  Loeb  Lecture  Series  (invited),  Harvard  University, 
Boston,  MA  (4/8/03). 

159.  “The  New  Science  of  Quantum  Information,”  H.  J.  Kimble,  Colloquium  (invited),  Loeb 
Lecture  Series,  Harvard  University,  Boston,  MA  (4/7/03). 

160.  “Physics  with  photons:  from  quantum  to  bio,”  H.  Mabuchi,  Optics  and  electronics  seminar, 
University  of  Stanford,  CA  (4/6-8/2003). 

161.  “The  Symmetric  Group  can  be  as  Entangled  as  Possible,  Almost,”  JM  Geremia,  Caltech 
Institute  for  Quantum  Information  Science  seminar  (invited)  April  2003. 

162.  “Quantum  Information  and  Quantum  Control,”  H.  Mabuchi,  Gordon  Research  Conference 
on  Quantum  Information  Science,  Ventura,  CA  (3/24-28/2003). 

163.  “Experiments  in  Quantum  Information  Science  —  What's  the  Point?”  H.  J.  Kimble,  Gordon 
Research  Conference,  Quantum  Information  Science  (invited),  Ventura,  CA  (3/23/03). 

164.  “Quantum  Entanglement  and  Non-locality:  Semidefinite  relaxations  for  problems  in 
Quantum  Information  Processing,”  A.  C.  Doherty,  Workshop  on  Robustness  Analysis 
Tools  with  Applications  to  the  Biological  and  Physical  Sciences,  Kavli  Institute  of 
Theoretical  Physics,  UC  Santa  Barbara,  Santa  Barbara,  CA  (3/21/03). 

165.  “Magnetic  Microtraps  for  Cavity  QED,”  B.  Lev,  Research  Seminar,  NIST  Gaithersburg, 
Maryland  (3/18/03). 
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166.  “Putting  Quantum  Mechanics  to  Work,”  A.  C.  Doherty,  Department  of  Physics,  University 
of  Oregon,  Eugene,  OR  (3/6/03). 

167.  “Quantum  Information  Science  with  Single  Atoms  and  Photons,”  H.  J.  Kimble,  APS 
Annual  Meeting  (invited),  Austin,  TX  (3/5/03). 

168.  “Experiments  in  Quantum  Feedback,”  H.  Mabuchi,  Quantum  information  sciences  seminar, 
MIT,  Cambridge,  MA  (3/3/03). 

169.  “Quantum  Physics  in  Light  of  Quantum  Engineering,”  A.  C.  Doherty,  Department  of 
Physics,  University  of  Michigan,  Ann  Arbor,  MI  (2/13/03). 

170.  “Cavity  QED  in  the  Present  and  Future,”  J.  Buck,  Communications  Research  Laboratory, 
Japan  (2/13/03). 

171.  “State-Insensitive  Trapping  of  Single  Atoms  in  Cavity  QED,”  J.  McKeever,  SQuInT 
Annual  Meeting,  Santa  Fe,  NM  (2/7/03). 

172.  “Quantum  Entanglement  in  the  Symmetric  Subspace,”  JM  Geremia,  SQuInT  Annual 
Meeting,  Santa  Fe,  NM,  February  2003. 

173.  “Cavity  QED  in  the  Present  and  Future,”  J.  Buck,  University  of  Tokyo,  Japan  (2/5/03). 

174.  “Magnetic  Microtraps  for  Cavity  QED,”  B.  Lev,  Weekly  Research  Seminar,  Ludwig- 
Maximilians  University,  Munich,  Germany  (1/30/03). 

175.  “Magnetic  Microtraps  for  Cavity  QED,”  B.  Lev,  Weekly  Research  Seminar,  University  of 
Heidelberg,  Germany  (1/20/03). 

176.  “Physics  with  photons:  from  quantum  to  bio,”  H.  Mabuchi,  Solid  State  Sciences  Seminar 
Series,  Caltech,  Pasadena,  CA  (1/15/03). 

177.  “State-Insensitive  Trapping  of  Single  Atoms  in  Cavity  QED,”  J.  McKeever,  Institute  for 
Applied  Physics,  University  of  Bonn,  Germany  (1/10/03). 

178.  “Quantum  and  classical  control:  theory  &  experiment,”  H.  Mabuchi,  University  of 
Heidelberg  Physics  of  Information  symposium,  Heidelberg,  Germany  (1/10/03). 

179.  “State-Insensitive  Trapping  of  Single  Atoms  in  Cavity  QED,”  J.  McKeever,  Institute  for 
Experimental  Physics,  University  of  Innsbruck,  Austria  (1/9/03). 

180.  “Tracking  and  trapping  single  atoms  for  quantum  logic  in  cavity  QED,”  T.  W.  Lynn,  5th 
Workshop  on  Laser  Cooling,  Awaji  Yumebutai,  Japan  (1/8/03). 

181.  “Quantum  Feedback  and  Measurement:  Arranging  for  Theory  to  Visit  the  Laboratory,”  A. 
C.  Doherty,  US-Australia  Workshop  on  Solid  State  and  Optical  Approaches  to  Quantum 
Information  Science  (invited),  Newport,  Australia  (1/7/03). 

182.  “Experiments  in  quantum  feedback,”  H.  Mabuchi,  Winter  Colloquium  on  The  Physics  of 
Quantum  Electronics,  Snowbird,  UT  (1/6/03). 

183.  “A  New  Physics/A  New  Control,”  Invited  presentation,  A.  C.  Doherty,  IEEE  2002 
Conference  on  Decision  and  Control,  Las  Vegas,  NV  (12/13/02). 

184.  “Physics  with  photons:  from  quantum  to  bio,”  H.  Mabuchi,  Joint  Atomic  Physics 
Colloquium,  Harvard  University,  Boston,  MA  (12/1 1/02). 

185.  “Control  and  Systems  Theoretic  Approaches  to  Quantum  Physics,”  A.  C.  Doherty,  IEEE 
2002  Conference  on  Decision  and  Control,  Las  Vegas,  NV  (12/10-13/2002). 

186.  “Experiments  in  real-time  quantum  feedback,”  H.  Mabuchi,  IEEE  2002  Conference  on 
Decision  and  Control,  Las  Vegas,  NV  (12/10/02). 

187.  “Measurement  and  feedback  in  quantum  systems,”  H.  Mabuchi,  tutorial  workshop  on 
control  of  quantum  systems,  IEEE  2002  Conference  On  Decision  and  Control,  Las  Vegas, 
NV  (12/9/02). 
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188.  “Quantum  Information  Science  —  The  Promise,  the  Problems,  and  the  Plumbing,”  Invited 
presentation,  H.  J.  Kimble,  Frontiers  of  Science  and  Technology:  Quantum  Computation 
and  Information,  Lawrence  Livermore  National  Laboratory,  Livermore,  CA  (12/9/02). 

189.  “Quantum  Control  and  Quantum  Information  Processing,”  A.  C.  Doherty,  Institute  for 
Quantum  Information  (IQI)  Review,  Caltech,  Pasadena,  CA  (12/5/02). 

190.  “Quantum  Measurement:  Arranging  for  Theory  to  Visit  the  Laboratory,”  Invited 
presentation,  A.  C.  Doherty,  Atomic  Physics  Seminar,  University  of  California,  Berkeley, 
CA  (11/20/02). 

191.  “Complexity  and  robustness  in  quantum  networks,”  Invited  presentation,  H.  Mabuchi, 
Vanguard  NextGens  Conference,  San  Diego,  CA  (1 1/18/02). 

192.  “Local  Hidden  Variable  Theories  for  Quantum  States,”  Invited  presentation,  A.  C.  Doherty, 
Quantum  Information  and  Cryptography  Workshop,  MSRI,  Berkeley,  CA  (1 1/8/02). 

193.  “Quantum  Dynamics  with  Single  Atoms  and  Photons,”  Invited  presentation,  H.  J.  Kimble, 
5th  Workshop  on  Nonlinear  Dynamics  and  Chaos,  Courant  Institute  of  Mathematical 
Sciences,  NYU,  New  York,  NY  (10/25/02). 

194.  “Quantum  Physics  in  Light  of  Quantum  Engineering,”  Invited  presentation,  A.  C.  Doherty, 
Department  of  Physics,  University  of  Oxford,  Oxford,  United  Kingdom  (10/21/02). 

195.  “Magnetic  Microtraps  for  Cavity  QE,”  B.  Lev,  MURI  Center  for  Quantum  Networks 
Review,  Caltech,  Pasadena,  CA  (10/16/02). 

196.  “Experiments  in  quantum  feedback,”  H.  Mabuchi,  Mechanical  Engineering  Seminar, 
Caltech,  Pasadena,  CA  (10/15/02). 

197.  “Quantum  Information  Theory  for  Quantum  Networks,”  A.  C.  Doherty,  MURI  Center  for 
Quantum  Networks  Review,  Caltech,  Pasadena,  CA  (10/15/02). 

198.  “The  secret  life  of  photons  and  atoms,”  H.  Mabuchi,  MacArthur  Fellows  Reunion,  St. 

Louis,  MO  (10/10-13/2002). 

199.  “Strong  coupling  of  motion  and  light  in  cavity  QED,”  Invited  presentation,  H.  J.  Kimble, 
OSA  Annual  Meeting,  Orlando,  FL  (10/2/02). 

200.  “Tracking,  trapping,  and  training  atomic  motion  in  cavity  QED,”  Invited  presentation,  T. 

W.  Lynn,  OSA  Annual  Meeting,  Orlando,  FL  (10/1/02). 

201.  “Magnetic  Microtraps  for  Cavity  QED,”  B.  Lev,  Physics  Graduate  Student  Seminar, 
Caltech,  Pasadena,  CA  (10/1/02). 

202.  “Experimental  and  theoretical  foundations  of  quantum  and  biochemical  networks,”  H. 
Mabuchi,  IP  AM  Workshop  on  Alternative  Computing,  UCLA,  Los  Angeles,  CA  (9/30/02). 

203.  “Physics  with  photons:  from  quantum  to  bio,”  H.  Mabuchi,  Physics  Colloquium,  Yale, 

New  Haven,  CT  (9/27/02). 

204.  “Mathematics  in  nanoscale  science  and  engineering,”  H.  Mabuchi,  UCLA  IP  AM  mini¬ 
retreat,  Malibu,  CA  (9/16/02). 

205.  “Quantum  networks  based  on  cavity  QED,”  H.  Mabuchi,  DARPA  QuIST  Pi’s  meeting, 
Boston,  MA  (9/10-13/02). 

206.  “Experiments  on  quantum  feedback,”  H.  Mabuchi,  Decoherence  Workshop,  Ann  Arbor, 

MI  (8/20-28/02). 

207.  “Adaptive  Quantum  Metrology  and  Control,”  J.  Stockton,  Quantum  Enabled  Science  and 
Technology  (QUEST)  2002  Summer  Workshop,  Santa  Fe,  NM  (8/5-9/2002). 

208.  “Exact  and  Approximated  Performance  of  Concatenated  Quantum  Codes,”  B.  Rahn,  6th 
International  Conference  on  Quantum  Communication,  Measurement  and  Computing 
(QCMC’02),  Cambridge,  MA  (7/22/2002). 
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209.  “Quantum  Dynamics  with  Single  Atoms  and  Photons,”  Invited  presentation,  H.  J.  Kimble, 
6th  International  Conference  on  Quantum  Communication,  Measurement  and  Computing 
(QCMC  ’02),  MIT,  Boston,  MA  (7/22-26/2002). 

210.  “Adaptive  quantum  measurement  and  quantum  feedback  control,”  Invited  presentation,  H. 
Mabuchi,  6th  International  Conference  on  Quantum  Communication,  Measurements,  and 
Computing  (QCMC’02),  Boston,  MA  (7/18-23/2002). 

211.  “Quantum  Measurements:  Setting  Up  a  Meeting  Between  Experiments  and  Theory,” 
Invited  presentation,  A.  C.  Doherty,  Physics  Department  Seminar,  Yale  University,  New 
Haven,  CT  (7/9/02). 

212.  “Distinguishing  Separable  and  Entangled  States,”  Invited  presentation,  A.  C.  Doherty, 
Department  of  Applied  Mathematics  and  Theoretical  Physics,  Cambridge  University, 
Cambridge,  United  Kingdom  (6/24/02). 

213.  “Quantum  Measurement  Theory  Visits  the  Laboratory,”  Invited  presentation,  A.  C. 
Doherty,  BEC  Group  Seminar,  Department  of  Physics,  University  of  Otago,  Dunedin,  New 
Zealand  (6/4/02). 

214.  “Cavity  QED  with  neutral  atoms,”  H.  Mabuchi,  NIST  Workshop  on  Neutral  Atom 
Quantum  Computing,  Gaithersburg,  MD  (6/3-4/2002). 

215.  “Cavity  QED  with  Strong  Coupling  —  Toward  the  Deterministic  Control  of  Quantum 
Dynamics,”  Invited  presentation,  H.  J.  Kimble,  Innsbruck,  Austria  (5/29/02). 

216.  “Magnetic  Microtraps  for  Cavity  QED,”  B.  Lev,  Laser  Seminar  at  the  Max-Plank  Institute 
for  Quantum  Optics,  Garching,  Germany  (4/18/02). 

217.  “Magnetic  Microtraps  for  Cavity  QED,”  B.  Lev,  Seminar  at  the  University  of  Innsbruck, 
Innsbruck,  Austria  (4/12/02). 

218.  “Magnetic  Microtraps  for  Cavity  QED,”  B.  Lev,  Seminar  at  the  University  of  Trento, 
Trento,  Italy  (4/11/02). 

219.  “Quantum  information  processing  with  atoms  and  photons,”  Alex  Kuzmich,  Physics 
Department  Colloquium,  Georgia  Institute  of  Technology,  Atlanta,  GA  (3/29/02). 

220.  “A  Model  Reduction  Analysis  of  Concatenated  Quantum  Codes,”  B.  Rahn,  CIMMS 
Workshop,  California  Institute  of  Technology,  Pasadena,  CA  (3/22/02). 

221.  “Measurement,  feedback,  and  the  quantum-classical  transition,”  H.  Mabuchi,  John  Wheeler 
Symposium,  Princeton,  NJ  (3/16/02). 

222.  “Quantum  Dynamics  with  Single  Atoms  and  Photons,”  Invited  presentation,  H.  J.  Kimble, 
10th  JST  International  Symposium  on  Quantum  Computing,  Nano-Science  &  Technology 
for  Implementation  of  Quantum  Computers,  Tokyo,  Japan  (3/12-14/2002). 

223.  “Distinguishing  Separable  and  Entangled  States,”  A.  C.  Doherty,  SQuInT  ’02  Annual 
Meeting,  Boulder,  CO  (3/9/02). 

224.  “Quantum  Teleportation  of  Light  Beams,”  Tiancai  Zhang,  Kok  Win  Goh,  Chin- wen  Chou, 
Peter  Lodahl,  and  H.  J.  Kimble,  poster  presentation  at  SQuInT  ’02  Annual  Meeting, 
Boulder,  CO  (3/7-10/2002). 

225.  “Designing  and  Characterizing  Photonic  Band  Gap  Materials  for  Cavity  QED,”  J.  M. 
Geremia  and  J.  Williams,  SQuInT  ’02  Annual  Meeting,  Boulder,  CO  (3/8/02). 

226.  “Exact  and  Approximate  Performance  of  Concatenated  Quantum  Codes,”  B.  Rahn,  poster 
presentation  at  SQuInT  ’02  Annual  Meeting,  Boulder,  CO  (3/7-10/2002). 

227.  “Single  Atom  Trapping  in  Cavity  QED,”  J.  McKeever,  poster  presentation  at  SQuInT  ’02 
Annual  Meeting,  Boulder,  CO  (3/7-10/2002). 
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228.  “Multiscale  science:  from  quantum  to  bio,”  H.  Mabuchi,  Physics  Colloquium,  JILA, 
Boulder,  CO  (2/21/02). 

229.  “Control  theory  and  foundations  of  quantum  mechanics,”  H.  Mabuchi,  Mohammed  Dahleh 
Memorial  Symposium  at  University  of  California  at  Santa  Barbara  (2/8/02). 

230.  “Distinguishing  Separable  and  Entangled  States,”  A.  C.  Doherty,  IQI  Seminar,  Caltech, 
Pasadena,  CA  (1/15/02). 

23 1 .  “Multiscale  science:  from  quantum  to  bio,”  H.  Mabuchi,  Physics  Research  Conference  at 
the  California  Institute  of  Technology,  Pasadena,  CA  (1/10/02). 

232.  “Distributed  Compression  of  a  Quantum  Source,”  A.  C.  Doherty,  Australasian  Conference 
on  Optics,  Lasers  and  Spectroscopy  (invited),  Brisbane,  Australia  (12/3-6/01). 

233.  “Progress  toward  the  Realization  of  Quantum  Networks,”  H.  J.  Kimble,  the  Quantum 
Information  conference  at  UCSB  Institute  for  Theoretical  Physics,  Santa  Barbara,  CA 
(12/3-7/2001). 

234.  “The  quantum-classical  transition  on  trial:  is  the  whole  more  than  the  sum  of  the  parts?”  H. 
Mabuchi,  Caltech’s  Watson  Lecture  Series  (public  lecture),  Pasadena,  CA  (1 1/14/01). 

235.  “Quantum  Measurements:  arranging  a  meeting  between  theory  and  experiment,”  A.  C. 
Doherty,  Atomic  Molecular  and  Optical  Physics  Group  Seminar,  University  of  Texas, 
Austin,  Austin,  TX  (11/9/01). 

236.  “Control  Methods  for  Analysis  of  Concatenated  Quantum  Codes,”  B.  Rahn,  Lloyd  Group 
Meeting,  Massachusetts  Institute  of  Technology  (invited),  Cambridge,  MA  (1 1/2/01). 

237.  “Cooling  of  a  single  atom  in  an  optical  trap  inside  a  resonator,”  J.  McKeever,  poster 
presentation  of  the  Optical  Society  of  America  Annual  Meeting/  Interdisciplinary  Laser 
Science  XVII  (ILS-XVII)  Conference,  Long  Beach,  CA  (10/18/01). 

238.  “Sensitivities  of  atom-cavity  microscopes,”  K.  Birnbaum,  poster  presentation  at  the  Optical 
Society  of  America  Annual  Meeting/  Interdisciplinary  Laser  Science  XVII  (ILS-XVII) 
Conference,  Long  Beach,  CA  (10/18/01). 

239.  “Dynamics  of  Single-Atom,  Single-Photon  Trapping  in  Cavity  QED,”  T.  Lynn,  poster 
presentation  at  the  Optical  Society  of  America  Annual  Meeting/  Interdisciplinary  Laser 
Science  XVII  (ILS-XVII)  Conference,  Long  Beach,  C A  ( 1 0/ 1 8/0 1 ). 

240.  “Quantum  teleportation  of  quadrature  amplitudes,”  T.  Zhang,  Optical  Society  of  America 
Annual  Meeting/  Interdisciplinary  Laser  Science  XVII  (ILS-XVII)  Conference,  Long 
Beach,  CA  (10/17/01). 

241.  “Determining  optimal  microsphere  sizes  for  cavity  QED,”  J.  R.  Buck,  Optical  Society  of 
America  Annual  Meeting/  Interdisciplinary  Laser  Science  XVII  (ILS-XVII)  Conference. 
Long  Beach,  CA  (10/17/01). 

242.  “Entangled  atoms  and  photons,”  A.  Kuzmich,  New  Laser  Scientists  Conference  (NLSC  - 
2001)/  Optical  Society  of  America  Annual  Meeting  2001  (invited),  Long  Beach,  CA 
(10/12/01). 

243.  “Quantum  Information  Science  -  The  Promise,  the  Problems,  and  the  Plumbing,”  H.  J. 
Kimble,  invited  presentation  at  Shanxi  University,  Taiyuan,  China  (10/9/01). 

244.  “Cavity  QED  with  Strong  Coupling,”  H.  J.  Kimble,  invited  presentation  at  Shanxi 
University,  Taiyuan,  China  (10/9/01). 

245.  “Experiments  in  quantum  feedback,”  H.  Mabuchi,  Physics  Colloquium  at  University  of 
California,  Berkeley,  Berkeley,  CA  (10/8/01). 

246.  “Micromagnetic  Traps  for  Cavity  QED,”  B.  Lev,  The  Quantum  Technology  Seminar  at  Los 
Alamos  National  Laboratory  (invited),  Los  Alamos,  New  Mexico  (9/19/01). 
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247.  “Quantum  measurement  and  real-time  feedback,”  H.  Mabuchi,  Nonlinear  Optics  and 
Lasers  Gordon  Research  Conference,  New  London,  NH  (7/29/01). 

248.  “Quantum  measurement  and  feedback,”  H.  Mabuchi,  SIAM  Meeting  on  Control  and  its 
Applications  (plenary),  San  Diego,  CA  (7/12/01). 

249.  “Closed-Loop  Quantum  System  Identification,”  JM  Geremia,  Caltech  Institute  for 
Quantum  Information  Science  Seminar  (7/01). 

250.  “Real-time  feedback  for  quantum  measurement  and  control,”  H.  Mabuchi,  Atomic  Physics 
Gordon  Research  Conference,  Williamstown,  MA  (6/19/01). 

251.  “Dynamics  of  Single -Atom,  Single-Photon  Trapping  in  Cavity  QED,”  T.  Lynn,  poster 
presentation  at  the  Eighth  Rochester  Conference  on  Coherence  and  Quantum  Optics 
(CQ08),  Rochester,  NY  (6/14/01). 

252.  “Cavity  QED  with  Strong  Coupling,”  H.  J.  Kimble,  Eighth  Rochester  Conference  on 
Coherence  and  Quantum  Optics  (CQ08)  (invited),  Rochester,  NY  (6/12-17/2001). 

253.  “Cavity  QED  with  Cold  Atoms,”  H.  J.  Kimble,  ICOLS-XV  Conference(invited),  Snowbird, 
UT  (6/10-12/2001). 

254.  “Real-time  feedback  for  quantum  measurement  and  control,”  H.  Mabuchi,  Canadian 
Institute  for  Advanced  Research  (CIAR)  Workshop  on  Quantum  Computation,  Toronto, 
Canada  (5/19-20/2001). 

255.  “Real-time  feedback  for  quantum  measurement  and  control,”  H.  Mabuchi,  1st  Summer 
School  and  Conference  on  Spintronics  and  Quantum  Information  (invited),  Maui,  HI 
(5/16/01). 

256.  “Cavity  QED  and  quantum  measurement,”  H.  Mabuchi,  1st  Summer  School  and 
Conference  on  Spintronics  and  Quantum  Information  (invited  tutorial),  Maui,  HI  (5/15/01). 

257.  “Quantum  feedback  and  adaptive  measurement,”  H.  Mabuchi,  Conference  on  Lasers  and 
Electro-Optics/Quantum  Electronics  and  Laser  Science  (CLEO/QELS),  Baltimore,  MD 
(5/10/01). 

258.  “Science  and  Technology  at  the  quantum-classical  interface,”  H.  Mabuchi,  Simon  Fraser 
University  (Physics  Colloquium),  Vancouver,  Canada  (3/16/01). 

259.  “Science  and  Technology  at  the  quantum-classical  interface,”  H.  Mabuchi,  University  of 
British  Columbia  (Physics  Colloquium),  Vancouver,  Canada  (3/15/01). 

260.  “Quantum  networks  based  on  cavity  QED,”  H.  Mabuchi,  NEDO  Workshop  on  Quantum 
Functional  Devices  (invited  guest  lecture),  Tokyo,  Japan  (3/8/01). 

261.  “Adaptive  measurement  of  quantum  phase,”  H.  Mabuchi,  Fourth  Annual  Meeting  of  the 
Southwest  Quantum  Information  and  Technology  Network  (invited),  Pasadena,  CA 
(3/4/01). 

262.  “Science  and  Technology  at  the  quantum-classical  interface,”  H.  Mabuchi,  UCSB  (Physics 
Colloquium),  Santa  Barbara,  CA  (1/23/01). 

263.  “Quantum  networks  based  on  cavity  QED,”  H.  Mabuchi,  First  International  Conference  on 
Experimental  Implementation  of  Quantum  Computation  (invited),  Sydney,  Australia 
(1/19/01). 

264.  “Robust  Control  in  the  Quantum  Domain,”  A.  C.  Doherty,  39th  IEEE  Conference  on 
Decision  and  Control,  Sydney,  Australia  (12/12-15/2000). 

265.  “Atoms  in  Optical  Cavities,”  H.  J.  Kimble,  The  Quantum  Theory  Centenary,  Berlin, 
Germany  (12/11/00). 

266.  “Real-time  measurement  and  feedback  in  quantum  systems,”  H.  Mabuchi,  Laboratory  for 
Information  and  Decision  Systems  Colloquium,  MIT  (1 1/28/00). 
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267.  “What  is  an  observable?”  John  Preskill,  IQI  Workshop  on  quantum  computation  and 
information  (1 1/15/00). 

268.  “Cavity  QED  with  Single  Atoms  and  Photons  -  Toward  Deterministic  Control  of  Quantum 
Dynamics,”  H.  J.  Kimble,  JILA/NIST  Colloquium,  Boulder,  CO  (1 1/8/00). 

269.  “Quantum  Measurement,  feedback,  and  nonlinear  dynamics  in  cavity  QED,”  H.  Mabuchi, 
Northern  New  Mexico  Complexity,  Entropy  and  Physics  of  Information  (CEPI)  Seminar, 
Albuquerque,  NM  (1 1/1/00). 

270.  “Photonic  crystal  microcavities  for  strong  coupling  between  an  atom  and  the  cavity  field”, 
J.  Vuckovic,  M.  Loncar,  H.  Mabuchi  and  A.  Scherer,  Proceedings  of  the  LEOS  2000,  pp. 
840-841,  Rio  Grande,  Puerto  Rico  (1 1/00). 

27 1 .  “Modal  analysis  of  waveguides  based  on  a  triangular  photonic  crystal  lattice,”  M.  Loncar, 
J.  Vuckovic  and  A.  Scherer,  Proceedings  of  the  LEOS  2000,  Rio  Grande,  Puerto  Rico 
(11/00). 

272.  “Quantum  Teleportation”  and  “Cavity  QED,”  H.  J.  Kimble,  Chaos,  Decoherence  and 
Quantum  Entanglement  (Pan-American  Advanced  Study  Institute),  Ushuaia,  Argentina 
(10/9-20/2000). 

273.  “Quantum  clock  synchronization  and  quantum  error  correction,”  John  Preskill,  NASA- 
D.O.D.  Workshop  on  Quantum  Information  and  Clock  Synchronization  (9/25/00). 

274.  “Cavity  QED  with  Cold  Atoms,”  H.  J.  Kimble,  Mysteries,  Puzzles,  and  Paradoxes  in 
Quantum  Mechanics,  Garda  Lake,  Italy  (9/19/00). 

275.  “Trapping  and  Tracking  Single  Atoms  and  Single  Photons,”  A.  C.  Doherty,  U.S./Japan 
Joint  Seminar  “Coherent  Quantum  Systems”,  Newport,  Rhode  Island  (9/17-21/2000). 

276.  “Real-time  measurement  and  feedback  in  quantum  systems,”  H.  Mabuchi,  Optical  sciences 
colloquium,  University  of  Arizona,  Tucson,  AZ  (9/14/00). 

277.  “Cavity  QED  with  Trapped  Atoms,”  H.  J.  Kimble,  European  Conference  on  Lasers  and 
Electro-optics/Intemational  Quantum  Electronics  Conference  2000  (CLEO-Europe/IQEC 
2000),  Nice  Acropolis,  France  (9/14/00). 

278.  “Information  and  the  quantum — classical  transition,”  H.  Mabuchi,  Physics  colloquium, 
Harvey  Mudd  College,  Pomona,  CA  (9/12/00). 

279.  “Quantum  networks  based  on  cavity  QED,”  H.  Mabuchi,  Canadian  Institute  for  Advanced 
Research  Workshop  on  Quantum  Computation,  Calgary,  Canada  (9/1-5/2000). 

280.  “Quantum  error  correction  and  fault  tolerance,”  John  Preskill,  Quantum  Computing 
Symposium  sponsored  by  ARO,  Baltimore,  MD  (8/28/00). 

28 1 .  “The  Quantum  Internet  -  Distributed  Quantum  Networks  for  Computation  & 
Communication,”  H.  J.  Kimble,  Quantum  Computing  Symposium  sponsored  by  ARO, 
Baltimore,  MD  (8/28/00). 

282.  “Information  dynamics  in  cavity  QED,”  H.  Mabuchi,  American  Chemical  Society  National 
Meeting,  Washington  D.C.  (8/20-24/2000). 

283.  “Real-time  Tracking  and  Trapping  of  Single  Atoms  in  Cavity  QED,”  H.  J.  Kimble,  XVII 
International  Conference  on  Atomic  Physics  (ICAP),  Florence,  Italy  (6/00). 

284.  “Quantum  Communication  and  Memory  MURI  Kickoff  Meeting,”  H.  Mabuchi,  Program 
review  at  the  D.O.D.  in  Ft.  Monmouth,  NJ  (6/13-14/2000). 

285.  “Encoding  a  qubit  in  an  oscillator,”  John  Preskill,  MURI  Kickoff,  Fort  Monmouth,  NJ 
(6/13-14/2000). 

286.  “Real-Time  Tracking  and  Trapping  of  Single  Atoms  in  Cavity  QED,”  H.  J.  Kimble, 
Institute  of  Physics  and  Astronomy,  Aarhus  University,  Aarhus,  Denmark  (6/10/00). 
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287.  “Codes  for  continuous  quantum  variables,”  John  Preskill,  Aspen  2000  Workshop  — 
Quantum  Information  and  Computation  (6/6/00). 

288.  “Real-time  measurement  and  feedback  in  cavity  QED,”  H.  Mabuchi,  Quantum  information 
sciences  colloquium,  IJCSB,  Santa  Barbara,  CA  (6/2/00). 

(2)  Scientific  Personnel 

Faculty,  supported  by  this  grant,  who  are  members  of  the  National  Academy:  H.  J. 
Kimble 

Number  of  Graduate  Students  Supported:  33 

Mark  L.  Adams,  Charlene  Ahn,  Michael  Armen,  John  Au,  Paul  Barclay,  David  N.  Barsic, 
Andreea  Boca,  A.  David  Boozer,  Joseph  R.  Buck,  Chin-wen  Chou,  Kok  Win  Goh,  Kovid 
Goyal,  Laurence  C.  Gunn,  Jim  Harrington,  Asa  Hopkins,  Ali  Husain,  Shwetank  Kumar, 
Andrew  Landahl,  Benjamin  Lev,  Marko  Loncar,  Jason  McKeever,  Russell  Miller,  Terrell 
D.  Neal,  Tracy  Northup,  Benjamin  Rahn,  Federico  Spedalieri,  Kartik  Srinivasan,  John 
Stockton,  Ben  Toner,  Ramon  van  Handel,  Jelena  Vuckovic,  Elizabeth  Wilcut  and 
Tomoyuki  Yoshie 

Number  of  Post  Doctorates  Supported:  15 

Takao  Aoki,  Warwick  Bowen,  Eyal  M.  Buks,  Andrew  C.  Doherty,  Luming  Duan,  J.  M. 
Geremia,  Alex  Kuzmich,  Chungsok  Lee,  Debbie  Leung,  Hanns-Christoph  Naegerl, 
Robert  Raussendorf,  Dan  Stamper-Kurn,  Steven  van  Enk,  Jon  Williams  and  Tiancai 
Zhang 

Number  of  Faculty  Supported:  6 

Hideo  Mabuchi,  H.  J.  Kimble,  John  Preskill,  Axel  Scherer,  Oskar  Painter,  Mladen  Barbie 
Number  of  PhDs  Awarded:  1 6 

Names  of  personnel  receiving  PhDs:  Mark  L.  Adams,  Charlene  Ahn,  David  N.  Barsic, 
Andreea  Boca,  A.  David  Boozer,  Joseph  R.  Buck,  Jim  Harrington,  Ali  Husain,  Andrew 
Landahl,  Benjamin  Lev,  Marko  Loncar,  Jason  McKeever,  Federico  Spedalieri,  Kartik 
Srinivasan,  Jelena  Vuckovic  and  Tomoyuki  Yoshie. 

Number  of  Undergraduate  Students  Supported:  6 

Wei  Lien  Dang,  Nathan  E.  Flowers-Jacobs,  Arjun  Menon,  Nicolay  M.  Tanushev, 
Chenyang  Wang,  Ernest  C.  Yeung 

Prizes,  Fellowships  and  Awards  - 

H.  J.  Kimble  was  awarded  the  Julius  Edgar  Lilienfeld  Prize  by  the  American  Physical 
Society. 

Hideo  Mabuchi,  Office  of  Naval  Research  Young  Investigator  Award,  2000  -  2003 
John  D.  and  Catherine  T.  MacArthur  Foundation  Fellowship,  awarded  June  2000 
Discover  magazine’s  “20  Scientists  to  Watch  in  the  Next  20  Years”,  October  2000 
Office  of  Naval  Research  Young  Investigator  Award,  2000  -  2003 
Classroom  Teaching  Award  (Graduate  Student  Council,  Caltech)  2000 

(3)  Report  of  Inventions 

Number  of  Patents  Disclosed:  One 
Number  of  Patents  Awarded:  One 

“Photonic  crystal  microcavities  for  strong  coupling  between  an  atom  and  the 
cavity  field  and  method  of  fabricating  the  same,”  A.  Scherer,  J.  Vuckovic, 

M.  Loncar,  and  H.  Mabuchi  (US  Patent  #6,466,709  Bl,  issued  15  October  2002) 
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(4)  Scientific  Progress  and  Accomplishments 

During  the  MURI  performance  period  we  completed  a  diverse  portfolio  of  world-class  research 
projects.  We  made  remarkable  progress  in  all  major  topics  encompassed  by  the  defining  vision 
of  our  “Quantum  Networks”  MURI  Center:  demonstration  of  elementary  quantum  network 
protocols  via  cavity  QED,  development  of  integrated  quantum  nodes  based  on  atom  chips  and 
photonic  bandgap  structures,  and  advancing  quantum  information  theory.  What  follows  is  an 
executive  summary  of  our  main  accomplishments  in  core  areas  of  MURI  research. 

Quantum  network  protocols  via  cavity  QED 

The  results  described  in  this  section  are  mainly  attributed  to  the  group  of  Jeff  Kimble  and  to 
Luming  Duan;  see  our  cumulative  publication  list  for  details. 

A  long-standing  ambition  in  the  field  of  cavity  quantum  electrodynamics  (QED) 
has  been  to  trap  single  atoms  inside  high-Q  cavities  in  a  regime  of  strong  coupling.  Diverse 
avenues  have  been  pursued  for  creating  the  trapping  potential  for  atom  confinement,  including 
additional  far  off-resonant  trapping  beams,  near-resonant  light  with  mean  intracavity  photon 
number  ~1,  and  single  trapped  ions  in  high- finesse  optical  cavities.  A  critical  aspect  of  this 
research  is  the  development  of  techniques  for  atom  localization  that  are  compatible  with  strong 
coupling  in  cavity  QED,  as  required  for  the  realization  of  various  quantum  network  protocols. 

Early  in  the  MURI  performance  period  we  reached  a  significant  milestone  in  our  quest  to 
trap  and  cool  single  atoms  in  cavity  QED:  we  achieved  extended  trapping  times  for  single  atoms 
in  a  cavity  while  still  maintaining  strong  coupling,  with  a  trapping  potential  for  the  center-of- 
mass  motion  that  is  largely  independent  of  the  internal  atomic  state,  and  demonstrated  a  scheme 
that  allows  continuous  observation  of  trapped  atoms  by  way  of  the  atom- field  coupling.  More 
specifically,  we  recorded  trapping  times  up  to  3  s  for  single  Cs  atoms  stored  in  an  intracavity  far- 
off  resonance  trap  (FORT),  which  represented  a  100-fold  improvement  beyond  the  sole  previous 
realization  of  trapping  in  cavity  QED  (also  by  Kimble’s  group,  in  1999).  We  also  continuously 
monitored  trapped  atoms  by  way  of  strong  coupling  to  a  probe  beam,  including  observations  of 
trap  loss  atom  by  atom  over  intervals  ~ls.  These  measurements  incorporated  auxiliary  cooling 
beams,  and  provided  the  first  realization  of  cooling  for  trapped  atoms  strongly  coupled  to  a 
cavity.  Our  protocols  are  enabled  by  the  choice  of  a  “magic”  wavelength  for  the  FORT,  for 
which  the  relevant  atomic  levels  are  shifted  almost  equally,  thereby  providing  significant 
advantages  for  coherent  state  manipulation  of  the  atom-cavity  system. 

Also  early  in  the  MURI  period,  we  proposed  a  method  to  implement  quantum  information 
processing  in  high-Q  cavities  with  a  single  trapped  but  non-localized  atom.  Our  method  is  based 
on  adiabatic  passage,  which  makes  the  relevant  dynamics  insensitive  to  the  randomness  of  the 
atom  position  with  an  appropriate  interaction  configuration.  We  validated  our  protocol  with  both 
approximate  analytical  calculations  and  exact  numerical  simulations. 

In  the  middle  years  of  the  MURI  we  exploited  the  new  technical  tools  described  above  in  a 
series  of  important  demonstrations  of  the  practical  utility  of  strong  coupling.  We  realized  a  one- 
atom  laser  in  the  regime  of  strong  coupling,  generated  single  photons 
“on-demand”  from  one  atom  trapped  in  a  cavity,  and  developed  a  protocol  for  real-time 
determination  of  the  number  of  trapped  atoms  in  our  cavity.  We  then  directed  our  attention  to 
the  development  of  a  new  set  of  technical  tools  for  cavity-QED-based  Quantum  Information 
Science.  In  particular  we  developed  non-invasive  methods  for  characterizing  and  tuning  the  local 
environment  of  a  trapped  intracavity  atom,  such  as  background  magnetic  fields,  trapping 
potential  fluctuations,  and  polarization  fluctuations  of  the  FORT  beam.  We  also  developed  “in 
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situ”  Doppler  and  blue-detuned  Sisyphus  cooling  methods  for  reducing  the  radial  motion  of  a 
trapped  intracavity  atom,  as  well  as  a  laser  system  for  driving  Raman  transitions  between 
hyperfine  ground  states  of  the  atom.  This  latter  capability  is  central  to  many  proposed  schemes 
for  quantum  information  processing  in  cavity  QED. 

In  the  final  years  of  our  MURI  effort  we  achieved  two  more  major  advances  including  the 
first  demonstration  of  a  complete  protocol  based  on  an  individual  intracavity  trapped  atom  and  a 
demonstration  of  cavity-mediate  strong  coupling  between  flying  qubits.  The  Kimble  research 
group  reported  observations  of  the  vacuum  Rabi  spectrum  for  one  trapped  atom,  which  marks  a 
major  milestone  for  cavity  QED.  Previous  experiments  on  the  vacuum  Rabi  spectrum  have 
required  integration  of  signals  collected  from  large  numbers  of  atoms  in  order  to  accumulate 
sufficient  signal-to-noise  ratio,  but  the  new  work  by  Kimble  et  al.  exploits  their  unique  capability 
for  non-invasive  atom  trapping.  Their  experiment  involved  a  sequence  of  trapping,  cooling, 
probing,  and  re-cooling  stages.  It  is  highly  significant  that  individual  trapped  atoms  could  be 
reset  into  a  “data-ready”  state  multiple  times  to  combat  decoherence.  The  Kimble  group  has  also 
recently  reported  the  observation  of  photon  blockade  in  the  transmission  of  light  through  a  cavity 
containing  one  strongly-coupled  atom.  This  effect,  which  is  analogous  to  the  Coulomb  blockade 
effect  in  microelectronics,  is  a  consequence  of  single-photon  optical  nonlinearity,  and  provides  a 
basis  for  devices  such  as  deterministic  single-photon  generators  and  possibly  quantum  logic 
gates  for  flying  qubits. 

Development  of  integrated  quantum  nodes 

The  results  described  in  this  section  are  main  attributable  to  the  groups  of  Axel  Scherer,  Oskar 
Painter  and  Hideo  Mabuchi. 

During  the  early  years  of  our  MURI  effort,  our  efforts  in  the  area  of  integrated  quantum 
nodes  proceeded  separately  along  two  main  fronts.  The  first  was  the  development  of  magnetic 
microtraps  suitable  for  confining  atoms  in  close  proximity  to  a  dielectric  microcavity;  the  second 
was  work  on  developing  dielectric  microcavities  with  sufficiently  high  Q-factor  for  cavity  QED 
with  resonance  frequencies  near  the  D2  line  of  atomic  Cs.  Much  of  our  early  work  on  magnetic 
microtraps  focused  on  improving  loading  protocols  for  transferring  Cs  atoms  from  macroscopic 
“mirror  MOT”  traps  into  lithographically  defined  microtraps.  This  necessitated  some  detailed 
work  on  the  characterization  of  light- induced  collisions  that  limit  the  transfer  efficiency.  We 
found  excellent  agreement  between  our  measurements  and  an  atomic  physics  model  based  on 
values  for  scattering  cross-sections  that  were  previously  measured  by  researchers  interested  in 
Bose-Einstein  Condensation.  These  experimental  results  give  us  confidence  that  we  could  design 
optimal  loading  protocols  to  maximize  the  efficiency  and  phase-space  density  of  atom  transfer 
into  magnetic  microtraps.  Early  on  we  also  investigated  the  possibility  of  using  lithographically- 
patterned  permanent  magnets  for  atom  trapping,  which  has  great  appeal  because  such  traps 
would  not  need  to  dissipate  significant  heat  while  operating.  We  specifically  used  electron  beam 
lithography  to  pattern  the  surface  of  a  commercial  hard  disk  platter,  and  demonstrated  (utilizing  a 
mirror  MOT)  that  cold  atoms  bounce  off  of  its  magnetic  field  pattern  without  detectable 
perturbations.  We  later  went  on  to  propose  a  concrete  scheme  for  combining  such  an  atom  mirror 
with  metallic  pads  to  create  stable  magnetoelectrostatic  microtraps  for  chip-scale  cavity  QED. 

We  are  currently  working  to  fabricate  and  demonstrate  such  microtraps. 

Our  early  work  on  dielectric  microcavities  focused  on  defect  resonators  within  InGaAs 
photonic  crystals,  even  though  this  material  was  known  to  be  unsuitable  for  work  with  Cs  atoms, 
mainly  because  it  was  a  known  material  system  in  which  to  tackle  the  primary  question  of 
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whether  the  Q-factors  of  photonic  crystal  cavities  could  ever  be  made  sufficiently  high  to  be  of 
interest  for  cavity  QED  with  strong  coupling.  There  were  some  promising  advances  at  the  time, 
but  we  ultimately  had  to  switch  to  a  different  material  (SiNx)  in  order  to  achieve  compatibility 
with  atomic  Cs.  In  addition  to  working  to  raise  the  Q-factors  of  fabricated  microcavities,  we 
explored  the  inclusion  of  quantum  wells  and  quantum  dots  and  created  low-threshold  power 
photonic  crystal  lasers.  This  work  was  eventually  “spun  off’  from  the  MURI  program  and 
continues  to  provide  the  basis  for  work  on  photonic-crystal  based  optical  information  processing 
and  for  ultra-sensitive  liquid  phase  sensor  devices. 

In  the  middle  years  of  our  MURI  effort,  we  turned  our  attention  to  formulating  a  proof- 
of-principle  demonstration  of  strong  coupling  between  magnetically  confined  neutral  atoms  and 
a  photonic  bandgap  defect  resonator.  This  has  proven  to  be  a  highly  challenging  undertaking 
which  has  only  now  come  close  to  fruition.  Initially  we  continued  to  pursue  two  main  lines  of 
work:  construction  of  an  appropriate  “atom  chip”  for  loading  and  guiding  of  cold  atoms  to  the 
photonic  bandgap  (PBG)  structure,  and  adaptation  of  the  PBG-resonator  designs  from  InGaAs  to 
GaAs  to  make  them  compatible  (or  so  we  thought)  with  the  resonant  optical  wavelengths  for 
atomic  Cs. 

We  successfully  fabricated  an  atom  chip  design  that  incorporates  all  the  features  necessary  for  an 
initial  experiment  on  coupling  magnetically-guided  atoms  to  a  PBG  cavity.  Small-scale  wires, 
used  in  the  final  stages  of  compression  and  guiding,  were  produced  by  lithographic  patterning  of 
gold  on  sapphire.  Intermediate  loading  and  compression  was  accomplished  using  medium-scale 
wires  that  we  cut  into  a  copper  clad  Teflon  circuit  board  using  a  computer-controlled  mill.  Initial 
loading  from  a  background  Cs  vapor  required  the  transient  use  of  a  large  copper  wire  underneath 
the  circuit  board.  We  used  this  atom  chip  to  optimize  our  laser  cooling  protocols  for  transferring 
Cs  atoms  from  a  room-temperature  background  vapor  to  magnetostatic  guiding  elements  on  the 
chip,  which  we  planned  to  use  to  deliver  cold  atoms  to  the  vicinity  of  an  on-chip  PBG  cavity. 

On  the  PBG  side  of  our  effort  we  developed  a  novel  configuration  for  coupling  an  optical  fiber- 
taper  to  a  PBG  defect  cavity,  via  a  PBG  waveguide.  Using  such  a  setup,  we  successfully 
demonstrated  laser  coupling  to  and  spectroscopic  characterization  of  a  PBG  defect  cavity 
fabricated  in  InGaAs.  We  found  an  excellent  quality  factor  Q~48,000  and  achieved  an  input- 
output  coupling  efficiency  of  nearly  fifty  percent,  which  would  be  more  than  sufficient  for 
demonstration  experiments  with  magnetically  guided  atoms. 

In  order  to  transfer  these  results  to  a  material  system  suitable  for  cavity  QED  with  Cs  atoms, 
we  first  spent  over  a  year  attempting  to  achieve  comparably  high  Q-factors  with  AlGaAs 
photonic  bandgap  structures.  We  ultimately  realized  that  there  were  oxidation  and  impurity 
issues  endemic  to  AlGaAs,  such  that  it  would  never  be  a  good  material  to  use.  Fortunately,  we 
finally  found  the  right  material  system  in  SiNx  (Silicon  Nitride),  although  we  had  to  take  a  bit  of 
a  step  backward  to  using  microdisk  resonators  rather  than  PBG  defect  resonators  because  the 
complex  etching  protocols  required  for  fabricating  the  latter  geometry  had  not  yet  been 
optimized  in  SiNx.  Our  focus  then  turned  to  task  of  functionally  integrating  our  atom  chip  with 
the  microcavity  and  optical-fiber  coupling  structures.  Doing  this  has  been  far  more  challenging 
than  we  at  first  could  appreciate,  largely  because  our  previous  work  on  the  photonic  bandgap 
tapered-fiber  couplers  had  not  been  done  under  vacuum.  We  first  successfully  debugged  several 
major  problems  involving  thermal  management  in  the  fiber  tapers  and  mechanical  stability  of  the 
couplers,  developing  along  the  way  a  novel  “micro-joint”  technique  for  rigidly  and  permanently 
attaching  the  fiber  taper  in  an  optimally  coupled  position.  We  are  now  using  this  technique  to 
achieve  simultaneous  coupling  to  arrays  of  ~10  microdisk  resonators  on  a  single  chip,  which  we 
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believe  will  in  the  long  run  facilitate  the  creation  of  multi-cavity  quantum  nodes,  and  in  the  short 
run  help  to  alleviate  an  unfortunate  Cs-exposure  issue  that  we  recently  discovered.  During  our 
first  attempt  to  run  a  complete  experiment  on  coupling  chip-guided  atoms  to  a  semiconductor 
microcavity,  we  found  that  exposure  of  the  cavity  to  background  Cs  vapor  resulted  in  gradual  but 
strong  shift  of  its  resonance  frequency.  This  shift  seems  to  saturate  at  a  magnitude  of  about  0.2 
nm  and  is  not  accompanied  by  any  degradation  of  the  microcavity  Q-factor.  This  is  highly 
inconvenient  because  it  takes  the  microcavity  out  of  our  achievable  tuning  window  for  coupling 
to  the  Cs  D2  line,  and  prevented  our  first  attempt  at  the  experiment  from  succeeding.  At  this 
point  in  time  we  are  in  the  final  stages  of  preparing  a  second  attempt,  in  which  we  will  anticipate 
this  Cs-exposure  shift  when  we  trim  the  initial  resonance  frequencies  of  our  SiNx  microdisks, 
and  in  which  we  will  employ  an  array  of  ten  microdisks  (as  described  above)  with  a  range  of 
resonance  frequencies  just  in  case  the  saturation  shift  is  different  this  time  around.  We  hope  to 
be  able  to  report  a  successful  demonstration  of  chip-scale  cavity  QED  within  the  next  few 
months. 

Advancing  quantum  information  theory 

The  results  presented  in  this  section  are  mainly  attributable  to  John  Preskill  and  his  group,  MURI 
visitors,  Rob  Raussendorf,  and  Andrew  Doherty. 

Our  early  theoretical  activities  focused  on  the  accuracy  threshold  for  fault-tolerant  quantum 
computing  using  topological  codes  and  the  security  of  quantum  key  distribution  with  imperfect 
sources  and  detectors.  Topological  codes  are  well  suited  for  robust  storage  and  processing  of 
quantum  information  because  the  code's  check  operators  can  be  efficiently  measured  with  local 
quantum  gates.  We  studied  the  order-disorder  transition  for  these  codes,  which  corresponds  to 
the  accuracy  threshold.  It  was  shown  that  encoded  quantum  states  are  arbitrarily  well  protected 
in  the  limit  of  a  large  code  block  provided  that  the  probability  of  error  in  each  syndrome  bit 
measurement  is  below  three  percent. 

If  implemented  with  perfect  equipment,  quantum  key  distribution  is  provably  secure  against 
arbitrary  eavesdropping  attacks.  But  how  do  flaws  in  the  source  or  detector  affect  security?  We 
proved  that  security  is  robust  against  arbitrary  flaws  in  the  source  as  long  as  the  detector  is 
perfect  and  the  source  does  not  leak  to  the  eavesdropper  any  information  about  what  basis  is  used 
in  the  protocol.  The  proof,  which  uses  a  new  and  remarkably  simple  method,  also  applies  to  the 
case  where  the  source  is  perfect  and  the  detector  has  arbitrary  flaws  (a  case  treated  earlier  by 
Mayers).  We  showed  that  security  is  robust  against  flaws  (such  as  the  emission  of  weak 
coherent  states  instead  of  single -photon  states)  that  reveal  a  little  bit  of  information  about  the 
basis. 

We  subsequently  developed  a  new  method  for  analyzing  security,  in  which  the  classical  coin 
flips  that  determine  the  basis  in  which  the  signals  are  sent  and  detected  are  treated  quantumly. 

The  advantage  of  this  new  viewpoint  is  that  the  basis  dependence  of  flaws  in  the  equipment  can 
be  characterized  according  to  how  much  the  adversary's  attack  disturbs  the  coins.  Using  the  new 
method,  unconditional  security  can  be  proven  for  generic  small  flaws  in  the  source  and  detector. 
In  related  work,  we  also  studied  the  impact  of  local  conservation  laws  (superselection  rules)  on 
quantum  games.  We  say  that  a  game  is  secure  if  a  cheater  who  breaks  the  rules  is  unable  to  alter 
the  outcome  of  the  game.  Naively,  it  seems  that  in  an  invariant  world  subject  to  a  superselection 
rule,  a  cheater  would  have  less  power  than  in  the  unrestricted  world,  not  subject  to  a 
superselection  rule.  But  on  the  contrary,  we  showed  that  any  cheating  strategy  in  the  unrestricted 
world  can  be  accurately  simulated  in  the  invariant  world.  By  explaining  how  the  physics  of  the 
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invariant  world  (subject  to  the  conservation  law)  can  be  simulated  in  the  unrestricted  world  and 
vice  versa,  we  clarified  the  physical  implications  of  superselection  rules,  which  have  a  central 
role  in  modem  quantum  field  theory.  We  also  analyzed  how  quantum  protocols  can  be 
composed  without  compromising  security.  We  showed  in  particular  that  classical  authentication 
can  be  securely  composed  with  quantum  key  distribution.  This  result  means  that  a  key  generated 
in  a  round  of  key  distribution  can  be  used  safely  to  authenticate  further  rounds.  Continuing  work 
now  focuses  on  formulating  more  flexible  definitions  of  security  that  will  allow  composability 
theory  to  be  applied  to  a  broader  family  of  quantum  protocols. 

More  recently,  we  have  shown  that  any  strong  quantum  coin  flipping  protocol  is  susceptible 
to  same-sided  bias  -  by  cheating,  one  player  or  the  other  can  force  the  coin  to  come  up  heads 
with  probability  -0.707.  Student  Carlos  Mochon  found  a  weak  coin  tossing  protocol  in  which  the 
cheater  cannot  force  a  win  with  probability  above  0.692;  he  also  found  improved  bounds  on  the 
cheat  sensitivity  of  quantum  bit  commitment  protocols.  In  work  currently  in  progress,  the 
methods  used  to  prove  these  results  are  being  extended  to  a  general  theory  of  two-player 
quantum  games.  It  is  hoped  that  this  theory  will  resolve  the  long-standing  question  whether  there 
is  a  weak  quantum  coin  tossing  protocol  with  arbitrarily  small  bias.  Student  Ben  Toner,  together 
with  visiting  scholars  Hoyer,  Cleve,  and  Watrous,  studied  the  power  of  entanglement  in  two- 
party  cooperative  games.  They  showed  that  shared  entanglement  profoundly  alters  the  soundness 
of  two-pro ver  interactive  proof  systems,  and  formulated  generalizations  of  Tsirelson's  inequality, 
which  provide  upper  bounds  on  quantum  nonlocality. 

Robert  Raussendorf,  Sergey  Bravyi,  and  student  Jim  Harringon  studied  a  quantum  phase 
transition  that  occurs  in  a  three-dimensional  cluster  state  subject  to  noise.  Using  topological 
encoding  methods,  they  estimated  the  (nonzero)  critical  “temperature”  at  which  the  entanglement 
length  changes  from  infinite  to  finite.  In  related  work,  Raussendorf  has  developed  protocols  for 
purification  of  cluster  states  that  can  be  used  to  achieve  fault  tolerance  in  the  one-way  quantum 
computer.  Student  Charlene  Ahn  studied  the  blowup  in  circuit  depth  that  occurs  when  an  ideal 
quantum  circuit  is  simulated  using  noisy  quantum  gates.  By  combining  topological  coding  with 
methods  for  studying  the  robustness  of  classical  cellular  automata,  she  showed  that  if  the  ideal 
circuit  has  size  L,  the  blowup  in  depth  can  be  a  factor  of  order  log(log(L));  this  is  a  big 
improvement  over  the  best  previously  known  result  (a  factor  polynomial  in  log(L)).  Student  Jim 
Harringon  developed  fault-tolerant  schemes  for  the  case  in  which  all  gates  (both  quantum  and 
classical)  are  local  in  space.  He  obtained  analytic  and  numerical  estimates  of  the  accuracy 
threshold  for  this  case,  which  had  never  been  carefully  analyzed  before.  The  analytic  result 
established  a  critical  noise  rate  order  10‘9,  but  the  numerical  results  indicate  that  the  actual  error 
threshold  is  better  by  many  orders  of  magnitude. 

Andrew  Doherty,  in  joint  work  with  Pablo  Parrilo  and  student  Federico  Spedalieri,  developed 
a  new  approach  to  characterizing  entanglement  using  the  theory  of  semi-definite  programming 
realizations.  This  ties  the  quantum  information-theoretic  topic  of  separability  testing  to  a  host  of 
other  NP-hard  problems  such  as  minimization,  number  partitioning  and  set  non-inclusion. 

(5)  Technology  Transfer  —  None. 
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Suppression  of  Spin  Projection  Noise  in  Broadband  Atomic  Magnetometry 


JM  Geremia,*  John  K.  Stockton,  and  Hideo  Mabuchi 
Physics  and  Control  &  Dynamical  Systems,  California  Institute  of  Technoloqy,  Pasadena  CA  91125 

(Dated:  February  15,  2005) 

We  demonstrate  that  quantum  nondemolition  (QND)  measurement,  combined  with  a  suitable 
parameter  estimation  procedure,  can  improve  the  sensitivity  of  a  broadband  atomic  magnetometer 
by  reducing  uncertainty  due  to  spin  projection  noise.  Furthermore,  we  provide  evidence  that  real¬ 
time  quantum  feedback  control  offers  robustness  to  classical  uncertainties,  including  shot-to-shot 
atom  number  fluctuations,  that  would  otherwise  prevent  quantum-limited  performance. 

PACS  numbers:  07.55.Ge,  03.65.Ta,  42.50.Lc,  02.30.Yy 


Atomic  magnetometers  estimate  the  magnitude  of  an  ex¬ 
ternal  magnetic  field  by  observing  Larmor  precession  in 
a  spin-polarized  atomic  sample  [1].  A  canonical  proce¬ 
dure  for  “detecting”  a  magnetic  field  oriented  along  the 
y- axis  (in  a  laboratory-fixed  coordinate  system)  operates 
by  aligning  the  magnetic  moments  of  N  spin-/  atoms 
along  the  x-axis.  The  resulting  polarized  atomic  state  is 
characterized  by  its  net  magnetization,  F  =  (F),  where 
the  quantum  operator,  F,  corresponds  to  the  total  angu¬ 
lar  momentum  of  the  collective  atomic  system. 

Under  the  influence  of  the  field,  b  =  B y,  the  atomic 
magnetization  precesses  from  its  initial  value, 

dF(t)  =  -7  (F (t)  x  b)  dt ,  F(0)  =  ftFx,  (1) 


where  F  =  Nf  for  N  spin-/  atoms  and  the  Larmor  fre¬ 
quency,  cjl  =  7#,  is  determined  by  the  gyromagnetic 
ratio,  7.  These  dynamics  confine  the  mean  spin  vector 
to  the  xz-plane  such  that,  in  the  small-time  (and  small- 
decoherence)  limit  appropriate  for  discussing  detection 
thresholds,  the  ^-component  of  the  atomic  magnetization 
is  given  by  Fz{t)  =  ^BFt,  t  <C  ct;/1. 

The  magnetic  field  can  thus  be  inferred  from  the  slope 
of  Fz  during  its  small-angle  Larmor  precession, 


0  <  t  <  r. 


(2) 


Uncertainty  in  the  field  estimation,  A H,  results  from 
various  sources  of  error  that  can  be  divided  into  three 
classes:  (1)  spin  projection  noise  [2],  or  quantum  un¬ 
certainty  in  the  initial  orientation  of  F  due  to  non¬ 
commutativity  of  the  quantum  operators,  Fx,  Fy  and  Fz, 
(2)  finite  signal  to  noise  in  the  physical  measurement  used 
to  determine  Fz,  and  (3)  classical  parameter  uncertain¬ 
ties  in  Eq.  (2),  namely  fluctuations  in  F  that  arise  from 
shot-to-shot  variance  in  the  atom  number,  N. 

Here  we  demonstrate  that,  given  a  quantum  nonde¬ 
molition  (QND)  measurement  of  Fz  with  a  finite  signal 
to  noise  ratio,  degradation  of  the  field  sensitivity  due  to 
projection  noise  can  be  minimized  by  an  estimation  pro¬ 
cedure  [3,  4]  that  exploits  the  spin-squeezing  produced  by 
the  QND  measurement  [5-7].  However,  we  find  that  the 
simplest  procedure  for  suppressing  spin  projection  noise 


is  susceptible  to  classical  parameter  uncertainty.  Incor¬ 
porating  real-time  quantum  feedback  control  into  the  es¬ 
timation  procedure  alleviates  this  source  of  error. 

We  consider  a  QND  measurement  of  Fz  performed  by 
quantum-limited  detection  of  an  optical  field  scattered 
by  the  atomic  system  [7,  8].  Such  a  measurement  is  de¬ 
scribed  by  the  continuous  photocurrent, 

y(t)  =  VMFz(t)+C(t),  (3) 

where  the  ((t)  are  Gaussian  stochastic  increments  that 
reflect  detection  (optical)  noise.  The  measurement 
strength ,  M,  relates  the  mean  value  of  the  photocurrent 
to  the  ^-component  of  the  collective  atomic  spin  [7] . 

Our  procedure  [3]  based  on  quantum  Kalman  filter¬ 
ing  [9,  10]  estimates  the  magnetic  field  from  the  aver¬ 
age  slope,  y'(r ),  obtained  by  regressing  the  QND  pho¬ 
tocurrent  over  the  interval  0  <  t  <  r.  Fig.  1  schemati¬ 
cally  illustrates  this  procedure.  Beginning  from  the  spin- 
polarized  state  at  t  =  0,  the  measurement  reveals  both 
the  slope  due  to  (small  angle)  Larmor  precession  and  an 
offset  due  to  the  initial  uncertainty  in  the  orientation 
of  F.  This  spin  projection  noise  offset  is  randomly  dis¬ 
tributed  with  variance  AFz2(0)  &  F/ 2  in  an  ensemble  of 
measurement  trajectories,  according  to  the  Heisenberg- 
Robertson  relation,  AF2AFZ2  >  |ft2|(Fx)|.  Freedom  to 
absorb  the  non-zero  value  of  Fz( 0)  into  the  regression  in¬ 
tercept  rather  than  the  slope  minimizes  the  impact  of  the 
quantum  projection  noise  on  the  estimated  field,  B. 


FIG.  1:  Atomic  magnetometry  based  on  continuous  QND 
measurement  and  quantum  filtering  enables  field  estimation 
procedures  that  suppress  projection  noise  of  the  initial  atomic 
state  (simulated  data). 
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FIG.  2:  Schematic  of  our  apparatus  for  broadband  atomic 
magnetometry  based  on  continuous  QND  measurement  and 
real  time  quantum  feedback  control. 


Our  single-shot  magnetic  field  estimate  is  given  by 


B(r) 


V't 
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r  <  lol  \ 


(4) 


where  y'T  is  the  photocurrent  slope  obtained  by  linear 
regression  of  y(t)  over  the  time  interval,  0  <  t  <  r.  In 
principle,  field  uncertainty  is  limited  only  by  statistical 
regression  error  [3] , 
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where  A(2  =  E[y  ((t)dt]2  is  the  integrated  noise  vari¬ 
ance  in  a  1/t  bandwidth.  The  unitless  QND  signal  to 
noise  ratio,  SNR  =  y[M /A£r,  (both  M  and  A(2  have 
units  proportional  to  frequency)  is  determined  by  exper¬ 
imental  parameters,  such  as  the  optical  probe  power  and 
detuning,  and  the  scattering  interaction  strength  [7]. 

The  uncertainty  of  our  optimal  estimator,  Eq.  (5), 
should  be  compared  to  that  of  a  procedure  which  can¬ 
not  distinguish  between  Larmor  precession  and  the  initial 
spin  projection  noise.  Such  is  the  case  for  steady-state 
atomic  magnetometers  [11,  12]  where  the  uncertainty, 


=  ^7^10)  +  ^,  (6) 

retains  a  contribution  from  both  AF2(0)  «  F/2  and  the 
optical  shotnoise.  In  the  limit  of  infinite  signal  to  noise 
this  expression  saturates  to  the  so-called  shotnoise  mag¬ 
netometry  limit  [11].  Eq.  (6)  corresponds  to  an  estima¬ 
tion  procedure  that  averages  the  photocurrent, 

B{T)  =  ^hnLvmdt'  (7) 


rather  than  determining  its  slope.  It  is  readily  shown  that 
steady-state  atomic  magnetometers  operate  in  a  manner 
logically  equivalent  to  this  type  of  direct  averaging. 

Our  estimation  procedure,  which  suppresses  projection 
noise,  requires  precise  knowledge  of  the  QND  measure¬ 
ment  sensitivity  Fy/M.  Shot-to-shot  variation  in  N  pro¬ 
duces  fluctuations,  A F,  in  the  length  of  F  that  directly 
propagate  into  the  field  estimation  as  a  proportional  er¬ 
ror,  AE?p  =  y'AF/ {^FyfM)  «  B(AF/F).  A  similar 
argument  applies  to  M.  While  relative  parameter  uncer¬ 
tainties  introduce  essentially  no  error  when  B  =  0,  they 
can  completely  mask  the  improved  resolution  provided 
by  spin-squeezing  when  5  /  0. 

To  reduce  the  effects  of  classical  parameter  uncer¬ 
tainty,  our  magnetometer  is  implemented  according  to 
the  closed- loop  methodology  [4]  illustrated  in  Fig.  2.  The 
QND  photocurrent,  y(t),  drives  a  precision  y- axis  magnet 
in  negative  feedback  configuration  to  stabilize  Fz  to  zero 
[8,  13].  In  the  presence  of  an  external  magnetic  field,  the 
controller  imposes  a  compensating  field,  bc(£)  ~  —B(t)y 
to  prevent  the  atomic  magnetization  from  precessing  out 
of  the  xy- plane.  The  magnetic  field  is  estimated  from  the 
time-averaged  feedback  signal, 

B(t)  =  -~  fT  Bc(t)dt,  (8) 

T  Jo 

rather  than  the  photocurrent.  Since  the  magnetometer 
always  operates  with  Fz  ~  0,  the  closed- loop  estimation 
is  reasonably  immune  to  atom  number  fluctuations. 

We  have  recently  demonstrated  QND  detection  and 
real-time  quantum-limited  feedback  control  with  an  ap¬ 
paratus  similar  to  that  in  Fig.  2  [7,  13].  Our  spin  sys¬ 
tem  is  provided  by  the  62Si/2(F=4)  ground  state  hy- 
perfine  manifold  in  133 Cs.  We  obtain  samples  with 
N  ~  1010  —  1011  atoms  at  a  temperature  of  T  =  10 
/iK  via  dark  spontaneous-force  optical  trapping.  Shot- 
to-shot  fluctuations  in  N  are  <20%.  Spin  polariza¬ 
tion  along  the  x-axis  is  achieved  by  optical  pumping  on 
the  62Si/2(F=4)^62P3/2(F/=4)  hyperfine  transition  and 
continuous  QND  measurement  of  Fz  is  implemented  by 
balanced  polarimetric  detection  of  a  laser  detuned  from 
the  4— >5  transition  by  A  =  550  MHz. 

Background  magnetic  fields  are  continually  nulled  us¬ 
ing  a  combination  of  large  (1  m)  external  three- axis 
Helmholtz  coils  and  smaller  computer  controlled  trim- 
coils.  The  experiment  is  synchronized  with  respect  to 
the  60  Hz  line  cycle,  and  we  estimate  the  resulting  shot- 
to-shot  field  fluctuations  in  a  100  /is  measurement  win¬ 
dow  to  be  ^850  nG.  Atomic  decoherence  is  <6%  over 
the  t  =  100  /is  measurement  trajectories  we  consider 
[7].  Further  characterization  of  our  state  preparation, 
atom  number,  transverse  spin  relaxation,  spin-squeezing, 
and  quantum  noise  limited  feedback  performance  can  be 
found  in  Ref.  [13].  A  detailed  procedure  for  determining 
the  degree  of  atomic  polarization  and  the  QND  signal  to 
noise  ratio  can  be  found  in  Ref.  [7]. 
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FIG.  3:  Open-loop  magnetic  field  estimation  error,  B  —  B,  as  a  function  of  measurement  time  for  (a)  B  =  0  and  (b)  B  —  10  yG. 
Inset  plots  show  the  Faraday  polarimeter  photocurrent  for  each  QND  measurement  trajectory.  The  shaded  regions  indicate  the 
single-shot  regression  uncertainty  while  the  dotted  curves  reflect  the  ensemble  error  measured  from  500  trajectories. 


We  began  by  operating  our  magnetometer  with  feed¬ 
back  disabled  in  order  to  characterize  the  adverse  effects 
of  classical  parameter  uncertainty.  Fig.  3  shows  example 
open-loop  field  estimations  performed  using  the  proce¬ 
dure  in  Eq.  (4)  for  two  different  magnetic  fields,  B  =  0 
and  B  —  10  yG.  When  the  QND  measurement  is  initi¬ 
ated  at  t  =  0  by  opening  the  probe  laser  shutter  [refer 
to  Fig.  2]  the  photocurrent  establishes  an  average  offset 
[inset  of  Fig.  3(a)]  that  is  randomly  distributed  in  an 
ensemble  of  similar  trajectories.  Our  ability  to  observe 
this  random  offset  reflects  sufficient  signal  to  noise  in  our 
QND  measurement  to  produce  squeezing  [3,  7,  13]. 

Since  B  =  0  in  Fig.  3(a),  the  atoms  do  not  undergo 
Larmor  precession  and  the  slope  of  y(t)  is,  as  expected, 
y'  ~  0.  As  described  above,  statistical  fluctuations  due 
to  optical  noise  require  that  this  slope  be  obtained  by 
regression,  as  filtering  the  photocurrent  reduces  the  sta- 


FIG.  4:  Comparison  of  the  two  field  estimation  procedures, 
Eqs.  (4)  and  (7)  (circles  and  triangles,  respectively)  with  B  = 
0  and  B  —  10  yG.  The  dotted  line  reflects  the  theoretical 
sensitivity  limit,  Eq.  (6),  of  a  magnetometer  with  the  same 
signal  to  noise  ratio  that  does  not  exploit  spin-squeezing. 


tionary  noise  by  time-averaging.  The  single-shot  estima¬ 
tion  trajectory  for  B  =  0  computed  according  to  Eq.  (4) 
is  depicted  by  the  dark  solid  line  in  Fig.  3(a)  while  the 
light  shaded  region  denotes  the  single-shot  field  uncer¬ 
tainty,  A H,  due  to  statistical  error  in  the  linear  regres¬ 
sion.  Values  for  F  and  M  needed  to  evaluate  Eq.  (4) 
were  obtained  from  full-scale  atomic  Larmor  precession 
according  to  the  procedure  detailed  in  Ref.  [7]. 

The  dotted  lines  in  Fig.  3(a)  indicate  the  ensemble  field 
variance,  computed  as  E [(B  —  B)2]1/2  from  500  QND  tra¬ 
jectories,  for  the  B  =  0  field  estimate.  At  long  times,  this 
measure  of  the  magnetometer  performance  saturates  to 
the  level  of  shot-to-shot  background  magnetic  field  fluc¬ 
tuations  in  our  experimental  apparatus,  approximately 
850  nG.  However,  prior  to  saturation,  as  depicted  by  the 
B  —  0  curves  in  Fig.  4,  the  regression  estimation  proce¬ 
dure  (circles)  outperforms  the  direct  averaging  estimator 
(triangles)  given  by  Eq.  (7).  Unlike  direct  averaging,  the 
regression  estimator  suppresses  the  uncertainty  due  to 
initial  spin  projection  noise —  the  ensemble  uncertainty 
drops  below  the  field  uncertainty  threshold  given  by  Eq. 
(6)  [dotted  line  in  Fig.  4]. 

It  is  important  to  note  that  the  coherent  state  pro¬ 
jection  uncertainty  (dotted  line  in  Fig.  4)  was  computed 
using  an  absolute  calibration  [7]  of  M,  and  the  average 
value  of  F  inferred  from  full-amplitude  Larmor  preces¬ 
sion  measurements.  Even  though  our  optically  pumped 
atomic  system  did  not  likely  begin  from  a  true  minimum- 
uncertainty  state  due  to  imperfect  pumping,  sufficient 
QND  spin  noise  reduction  was  achieved  to  allow  the  mag¬ 
netometer  to  outperform  the  projection  noise  uncertainty 
corresponding  to  that  of  an  actual  coherent  state. 

In  contrast,  the  B  =  10  yG  open  loop  estimation  un¬ 
certainty  fails  to  surpass  the  coherent  state  threshold  de¬ 
spite  a  clearly  visible  photocurrent  offset  [inset  in  Fig. 
3(b)]  suggesting  the  presence  of  spin-squeezing.  Evi¬ 
dently,  the  non-zero  slope  renders  the  open  loop  estima¬ 
tion  susceptible  to  classical  parameter  uncertainty  in  F 
and  M .  As  such,  the  long  time  estimation  uncertainty 
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FIG.  5:  (a)  Closed-loop  photocurrent  for  a  B  —  —300  \iG  field,  (b)  real-time  feedback  field  and  (c)  the  resulting  closed-loop 
field  estimation  error,  B  —  B,  as  a  function  of  measurement  time. 


for  B  =  10  fiG  saturates  to  a  level  much  higher  than  that 
of  the  ambient  magnetic  field  fluctuations,  as  in  Fig.  4. 

To  alleviate  the  effects  of  classical  parameter  uncer¬ 
tainty,  we  next  performed  our  closed-loop  estimation  pro¬ 
cedure  by  enabling  the  feedback  loop  for  the  entire  dura¬ 
tion  of  each  QND  trajectory.  The  photocurrent  in  Fig. 
5(a)  displays  no  discernable  slope  despite  the  presence  of 
a  B  =  —300  fiG  field  as  the  feedback  loop  drives  a  can¬ 
cellation  field  [Fig.  5(b)],  Hc,  to  maintain  Fz  ~  0.  The 
closed-loop  field  estimate,  computed  according  to  Eq.  (8) 
for  0  <  t  <  r,  is  seen  to  be  robust  to  shot-to-shot  pa¬ 
rameter  fluctuations —  it  is  evident  from  Fig.  5(c)  that 
the  ensemble  uncertainty  of  the  closed-loop  estimator  for 
B  =  —300  fiG  (squares)  achieves  similar  performance  to 
the  B  =  0  open  loop  estimation  (circles).  Despite  the 
large  magnitude  of  the  estimated  field,  the  closed-loop 
procedure  is  able  to  outperform  the  coherent  state  pro¬ 
jection  noise  theshold  [dotted  line  in  Fig.  5(c)]. 

It  should  also  be  pointed  out  that  in  closed  loop  config¬ 
uration,  where  the  estimation  uncertainty  is  due  almost 
entirely  to  QND  detection  noise,  the  ensemble  variance  is 
an  overly  conservative  measure  of  the  magnetometer  per¬ 
formance.  After  all,  ambient  fluctuations  that  produce 
the  850  nG  sensitivity  floor  in  Figs.  4  and  5(c)  are  real 
magnetic  fields  sensed  by  the  atoms.  Where  other  con¬ 
tributions  to  the  detection  threshold  are  well-controlled, 
the  single-shot  estimation  error  [Fig.  5(c)  stars]  more  ac¬ 
curately  reflects  the  magnetometer’s  performance.  This 
single-shot  closed-loop  uncertainty  surpasses  the  coher¬ 
ent  spin  state  threshold  at  even  long  times  in  this  case 
prior  to  the  onset  of  significant  atomic  decoherence. 

These  results  highlight  what  we  anticipate  will  become 
a  central  theme  in  quantum-limited  metrology.  Feed¬ 
back  enables  a  precision  measurement  to  achieve  opti¬ 
mal  insensitivity  to  classical  uncertainty  without  sacri¬ 
ficing  resolution  [4,  14].  Furthermore,  our  closed  loop 
methodology  can  be  immediately  extended  to  detection 
of  non-stationary  fields.  Such  an  approach  is  likely  to  be 
essential  for  obtaining  acceptable  performance  in  various 
precision  metrological  applications  including  spin  reso¬ 


nance  measurements,  atomic  frequency  standards,  and 
matter- wave  gravimetry. 
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Abstract — Feedback  control  of  quantum  mechanical  systems 
must  take  into  account  the  probabilistic  nature  of  quantum 
measurement.  We  formulate  quantum  feedback  control  as  a 
problem  of  stochastic  nonlinear  control  by  considering  separately 
a  quantum  filtering  problem  and  a  state  feedback  control  problem 
for  the  filter.  We  explore  the  use  of  stochastic  Lyapunov  techniques 
for  the  design  of  feedback  controllers  for  quantum  spin  systems 
and  demonstrate  the  possibility  of  stabilizing  one  outcome  of  a 
quantum  measurement  with  unit  probability. 

Index  Terms — Lyapunov  functions,  quantum  filtering,  quantum 
mechanics,  quantum  probability,  stochastic  nonlinear  control. 


I.  Introduction 

IT  IS  A  basic  fact  of  nature  that  at  small  scales — at  the  level 
of  atoms  and  photons — observations  are  inherently  proba¬ 
bilistic,  as  described  by  the  theory  of  quantum  mechanics.  The 
traditional  formulation  of  quantum  mechanics  is  very  different, 
however,  from  the  way  stochastic  processes  are  modeled.  The 
theory  of  quantum  measurement  is  notoriously  strange  in  that  it 
does  not  allow  all  quantum  observables  to  be  measured  simul¬ 
taneously.  As  such  there  is  yet  much  progress  to  be  made  in  the 
extension  of  control  theory,  particularly  feedback  control,  to  the 
quantum  domain. 

One  approach  to  quantum  feedback  control  is  to  circumvent 
measurement  entirely  by  directly  feeding  back  the  physical 
output  from  the  system  [1],  [2].  In  quantum  optics,  where  the 
system  is  observed  by  coupling  it  to  a  mode  of  the  electromag¬ 
netic  field,  this  corresponds  to  all-optical  feedback.  Though  this 
is  in  many  ways  an  attractive  option  it  is  clear  that  performing 
a  measurement  allows  greater  flexibility  in  the  control  design, 
enabling  the  use  of  sophisticated  in-loop  signal  processing 
and  nonoptical  feedback  actuators.  Moreover,  it  is  known  that 
some  quantum  states  obtained  by  measurement  are  not  easily 
prepared  in  other  ways  [3]— [5] . 

We  take  a  different  route  to  quantum  feedback  control,  where 
measurements  play  a  central  role.  The  key  to  this  approach  is 
that  quantum  theory,  despite  its  entirely  different  appearance,  is 
in  fact  very  closely  related  to  Kolmogorov’ s  classical  theory  of 
probability.  The  essential  departure  from  classical  probability 
is  the  fact  that  in  quantum  theory  observables  need  not  com¬ 
mute,  which  precludes  their  simultaneous  measurement.  Kol¬ 
mogorov’  s  theory  is  not  equipped  to  deal  with  such  objects:  One 
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can  always  obtain  a  joint  probability  distribution  for  random 
variables  on  a  probability  space,  implying  that  they  can  be  mea¬ 
sured  simultaneously.  Formalizing  these  ideas  leads  naturally  to 
the  rich  field  of  noncommutative  or  quantum  probability  [6]— [8] . 
Classical  probability  is  obtained  as  a  special  case  if  we  consider 
only  commuting  observables. 

Let  us  briefly  recall  the  setting  of  stochastic  control  theory. 
The  system  dynamics  and  the  observation  process  are  usually 
described  by  stochastic  differential  equations  of  the  Ito  type. 
A  generic  approach  to  stochastic  control  [9],  [10]  separates  the 
problem  into  two  parts.  First  one  constructs  a  filter  which  prop¬ 
agates  our  knowledge  of  the  system  state  given  all  observations 
up  to  the  current  time.  Then  one  finds  a  state  feedback  law  to 
control  the  filtering  equation.  Stochastic  control  theory  has  tra¬ 
ditionally  focused  on  linear  systems,  where  the  optimal  [linear 
quadratic  Gaussian  (LQG)]  control  problem  can  be  solved  ex¬ 
plicitly. 

A  theory  of  quantum  feedback  control  with  measurement 
can  now  be  developed  simply  by  replacing  each  ingredient  of 
stochastic  control  theory  by  its  noncommutative  counterpart. 
In  this  framework,  the  system  and  observations  are  described 
by  quantum  stochastic  differential  equations.  The  next  step  is 
to  obtain  quantum  filtering  equations  [1 1]— [14].  Remarkably, 
the  filter  is  a  classical  Ito  equation  due  to  the  fact  that  the 
output  signal  of  a  laboratory  measuring  device  is  a  classical 
stochastic  process.  The  remaining  control  problem  now  reduces 
to  a  problem  of  classical  stochastic  nonlinear  control.  As  in 
the  classical  case,  the  optimal  control  problem  can  be  solved 
explicitly  for  quantum  systems  with  linear  dynamics. 

The  field  of  quantum  stochastic  control  was  pioneered  by 
V.  P.  Belavkin  in  a  remarkable  series  of  papers  [1 1]— [13],  [15] 
in  which  the  quantum  counterparts  of  nonlinear  filtering  and 
LQG  control  were  developed.  The  advantage  of  the  quantum 
stochastic  approach  is  that  the  details  of  quantum  probability 
and  measurement  are  hidden  in  a  quantum  filtering  equation  and 
we  can  concentrate  our  efforts  on  the  classical  control  problem 
associated  with  this  equation.  Recently  the  quantum  filtering 
problem  was  reconsidered  by  Bouten  et  al.  [14]  and  quantum 
optimal  control  has  received  some  attention  in  the  physics  liter¬ 
ature  [16],  [17]. 

The  goal  of  this  paper  is  twofold.  We  review  the  basic  ingre¬ 
dients  of  quantum  stochastic  control:  Quantum  probability,  fil¬ 
tering,  and  the  associated  geometric  structures.  We  then  demon¬ 
strate  the  use  of  this  framework  in  a  nonlinear  control  problem. 
To  this  end,  we  study  in  detail  an  example  directly  related  to 
our  experimental  apparatus  [4] .  As  this  is  not  a  linear  system, 
the  optimal  control  problem  is  intractable  and  we  must  resort 
to  methods  of  stochastic  nonlinear  control.  We  use  stochastic 
Lyapunov  techniques  to  design  stabilizing  controllers,  demon¬ 
strating  the  feasibility  of  such  an  approach. 
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We  are  motivated  in  studying  the  quantum  control  problem 
by  recent  developments  in  experimental  quantum  optics  [4], 
[18]— [20].  Technology  has  now  matured  to  the  point  that 
state-of-the-art  experiments  can  monitor  and  manipulate 
atomic  and  optical  systems  in  real  time  at  the  quantum  limit , 
i.e.,  the  sources  of  extraneous  noise  are  sufficiently  suppressed 
that  essentially  all  the  noise  is  fundamental  in  nature.  The 
experimental  implementation  of  quantum  control  systems  is 
thus  within  reach  of  current  experiments,  with  important  ap¬ 
plications  in,  e.g.,  precision  metrology  [20]— [23]  and  quantum 
computing  [24],  [25].  Further  development  of  quantum  control 
theory  is  an  essential  step  in  this  direction. 

This  paper  is  organized  as  follows.  In  Section  II,  we  give  an 
introduction  to  quantum  probability  and  sketch  a  simple  deriva¬ 
tion  of  quantum  filtering  equations.  We  also  introduce  the  partic¬ 
ular  physical  system  that  we  study  in  the  remainder  of  this  paper. 
In  Section  III,  we  study  the  dynamical  behavior  of  the  filtering 
equation  and  the  underlying  geometric  structures.  Finally,  Sec¬ 
tion  IV  is  devoted  to  the  design  of  stabilizing  controllers  using 
stochastic  Lyapunov  methods. 

II.  Quantum  Probability  and  Filtering 

The  purpose  of  this  section  is  to  clarify  the  connections 
between  quantum  mechanics  and  classical  probability  theory. 
The  emphasis  is  not  on  rigor  as  we  aim  for  a  brief  but  broad 
overview;  we  refer  to  the  references  for  a  complete  treatment. 

A.  Finite -Dimensional  Quantum  Probability 

We  begin  by  reviewing  some  of  the  traditional  elements  of 
quantum  mechanics  (e.g.,  [26])  with  a  probabilistic  flavor. 

An  observable  of  a  finite-dimensional  quantum  system  is  rep¬ 
resented  by  a  self-adjoint  linear  operator  X  =  X*  on  some  un¬ 
derlying  finite-dimensional  complex  Hilbert  space  PL  (*  denotes 
Hermitian  conjugation).  Every  self-adjoint  operator  has  a  spec¬ 
tral  decomposition 

x  =  iPi,  Xi  e  R,  Pi  =  Pf  =  P*  (1) 

i 

where  are  the  eigenvalues  of  X  and  Pi  are  projectors  onto 
orthogonal  eigenspaces  in  PL  such  that  JV  Pi  =  Id^ . 

If  we  were  to  measure  X  we  would  obtain  one  of  the  values 
A;  as  the  measurement  outcome.  The  Pi  represent  the  events 
that  can  be  measured.  To  complete  the  picture  we  still  need  a 
probability  measure.  This  is  provided  by  the  density  operator  p , 
which  is  a  linear  operator  on  PL  satisfying 

P  =  P*,  Trp=l,  p  >  0.  (2) 

The  probability  of  an  event  Pi  is  given  by 


As  in  classical  probability,  it  will  be  useful  to  formalize  these 
ideas  into  a  mathematical  theory  of  quantum  probability  [6]-[8] . 
The  main  ingredient  of  the  theory  is  the  quantum  probability 
space  (A,  p).  Here,  A  is  a  *-algebra,  i.e.,  an  algebra  with  invo¬ 
lution  *  of  linear  operators  on  PL ,  and  p  is  the  associated  state. 
An  observable  on  (A,  p)  is  a  sum  of  the  form  (1)  with  Pi  E  A. 
In  the  finite-dimensional  case  this  implies  that  every  observable 
is  a  member  of  A ,  but  we  will  see  that  this  need  not  be  the  case 
in  infinite  dimensions. 

A  does  not  necessarily  contain  all  self-adjoint  operators  on 
PL.  Of  special  importance  is  the  case  in  which  A  is  a  commu¬ 
tative  algebra,  i.e.,  all  the  elements  of  A  commute  ([X,  Y]  = 
XY  —  YX  —  0  VX,  Y  E  A).  It  is  easily  verified  that  there  is  a 
one-to-one  correspondence  (up  to  isomorphism)  between  com¬ 
mutative  quantum  probability  spaces  (A,  p)  and  classical  proba¬ 
bility  spaces  (0,  X,  P)  with  card  Q  =  dim  PL .  As  A  is  commu¬ 
tative  we  may  represent  all  its  elements  by  diagonal  matrices; 
the  diagonals  are  then  interpreted  as  functions  /  :  D  — ►  R.  The 
projectors  Pi  E  A  now  correspond  to  indicator  functions  xa{ 
on  Q  and  hence  define  the  cr-algebra  T  —  {Ai}.  Finally,  P  is 
defined  by  P  [Ai\  =  Tr [Pip\. 

Clearly,  classical  probability  is  a  special  case  of  quantum 
probability.  However,  noncommutative  A  are  inherent  to 
quantum  mechanical  models.  Suppose  A,  B  are  two  events 
(projectors)  that  do  not  commute.  Then,  A  and  B  cannot 
be  diagonalized  simultaneously,  and  hence  they  cannot  be 
represented  as  events  on  a  single  classical  probability  space. 
Suppose  we  wish  to  measure  A  and  B  simultaneously,  i.e., 
we  ask  what  is  the  probability  of  the  event  ( A  and  B)1  In 
the  classical  case  this  would  be  given  by  the  joint  prob¬ 
ability  P[A,B]  =  P[A  C\  B]  =  E \xaXb\-  However,  in 
the  noncommutative  case  this  expression  is  ambiguous  as 
Tr [pAB]  ^  Tr [pBA].  We  conclude  that  ( A  and  B)  is  an 
invalid  question  and  its  probability  is  undefined.  In  this  case, 
the  events  A  and  B  are  said  to  be  incompatible.  Similarly,  two 
observables  on  A  can  be  measured  simultaneously  only  if  they 
commute. 

We  conclude  this  section  with  the  important  topic  of  con¬ 
ditional  expectation.  A  traditional  element  of  the  theory  of 
quantum  measurement  is  the  projection  postulate,  which  can 
be  stated  as  follows.  Suppose  we  measure  an  observable  X  and 
obtain  the  outcome  A^.  Then,  the  measurement  causes  the  state 
to  collapse 


i  _  PipPi 
PU  ~  Tr [pPiY 


(5) 


Suppose  that  we  measure  another  observable  X'  after  mea¬ 
suring  X.  Using  (5),  we  write 


Pi  =  Tr  [pPi\. 

We  can  now  easily  find  the  expectation  of  X 

(X)  =  y2^Ar[pP]  =  Tv[pX]. 

i 

In  quantum  mechanics  p  is  also  called  the  system  state. 


(3)  P[X'  =  X'j\X  =  Xi]  =  Tr[Pjp|i]  =  •  (6) 

Now,  compare  to  the  definition  of  conditional  probability  in 
^  classical  probability  theory 

P[g|A]=P[p[^]A],  A,BgP.  (7) 
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Clearly,  (6)  and  (7)  are  completely  equivalent  if  X,  X'  com¬ 
mute.  It  is  now  straightforward  to  define  the  quantum  analog  of 
conditional  expectation 


£[X'\B]  =  Yj 

i 


Tr  IpPjX’Pjl 
Tr  [pPi]  1 


(8) 


Here,  B  is  the  *-algebra  generated  by  X ,  i.e.,  it  is  the  algebra 
whose  smallest  projectors  are  Pi.  This  definition  also  coincides 
with  the  classical  conditional  expectation  if  X,  X'  commute. 

We  obtain  ambiguous  results,  however,  when  X,  X'  do  not 
commute,  as  then  the  fundamental  property  (£[X/  \  B])  =  (X'} 
is  generally  lost.  This  implies  that  if  we  measure  an  observable, 
but  “throw  away”  the  measurement  outcome,  the  expectation 
of  the  observable  may  change.  Clearly  this  is  inconsistent  with 
the  concept  of  conditional  expectation  which  only  changes  the 
observer’ s  state  of  knowledge  about  the  system,  but  this  is  not 
surprising:  noncommuting  X,  X'  cannot  be  measured  simulta¬ 
neously,  so  any  attempt  of  statistical  inference  of  X'  based  on 
a  measurement  of  X  is  likely  to  be  ambiguous.  To  avoid  this 
problem  we  define  the  conditional  expectation  only  for  the  case 
that  X 7  commutes  with  every  element  of  B.  The  measurement 
B  is  then  said  to  be  nondemolition  [11]  with  respect  to  X'. 

The  essence  of  the  formalism  we  have  outlined  is  that  the 
foundation  of  quantum  theory  is  an  extension  of  classical  prob¬ 
ability  theory.  This  point  of  view  lies  at  the  heart  of  quantum 
stochastic  control.  The  traditional  formulation  of  quantum  me¬ 
chanics  can  be  directly  recovered  from  this  formalism.  Even  the 
nondemolition  requirement  is  not  a  restriction:  We  will  show 
that  the  collapse  rule  (5)  emerges  in  a  quantum  filtering  theory 
that  is  based  entirely  on  nondemolition  measurements. 


B.  Infinite-Dimensional  Quantum  Probability 

The  theory  of  the  previous  section  exhibits  the  main  fea¬ 
tures  of  quantum  probability,  but  only  allows  for  finite-state 
random  variables.  A  general  theory  which  allows  for  contin¬ 
uous  random  variables  is  developed  along  essentially  the  same 
lines  where  linear  algebra,  the  foundation  of  finite-dimensional 
quantum  mechanics,  is  replaced  by  functional  analysis.  We  will 
only  briefly  sketch  the  constructions  here;  a  lucid  introduction 
to  the  general  theory  can  be  found  in  [6]. 

A  quantum  probability  space  (A,  p)  consists  of  a  Von  Neu¬ 
mann  algebra  A  and  a  state  p.  A  Von  Neumann  algebra  is  a  * -al¬ 
gebra  of  bounded  linear  operators  on  a  complex  Hilbert  space  H 
and  p  :  A  —>  C  is  a  linear  map  such  that  p(Id^)  =  1  ,  p(A*A)  > 
0  VA  E  A,  and  p(A*A)  =  0  iff  A  =  0.  We  gloss  over  addi¬ 
tional  requirements  related  to  limits  of  sequences  of  operators. 
It  is  easily  verified  that  the  definition  reduces  in  the  finite-di¬ 
mensional  case  to  the  theory  in  the  previous  section,  where  the 
density  operator  p  is  identified  with  the  map  X  i— ►  Tr[pX].  We 
always  assume  Id^  E  A. 

As  in  the  finite-dimensional  case  there  is  a  correspondence 
between  classical  probability  spaces  and  commutative  algebras. 
Given  the  classical  space  (S2,X,  P)  the  associated  quantum 
probability  space  is  constructed  as  follows: 


where  A  acts  on  7i  by  pointwise  multiplication.  Conversely, 
every  commutative  quantum  probability  space  corresponds  to  a 
classical  probability  space.  This  fundamental  result  in  the  theory 
of  operator  algebras  is  known  as  Gel’fand’s  theorem. 

Observables  are  represented  by  linear  operators  that  are  self- 
adjoint  with  respect  to  some  dense  domain  of  7i.  The  spectral 
decomposition  (1)  is  now  replaced  by  the  spectral  theorem  of 
functional  analysis,  which  states  that  every  self-adjoint  operator 
X  can  be  represented  as 

X=  [  XE(dX)  E:BU^V(H).  (10) 

Jr 

Here,  E  is  the  spectral  or  projection- valued  measure  associated 
to  X,  V{H)  is  the  set  of  all  projection  operators  on  H ,  and  BR 
is  the  Borel  cr-algebra  on  R.  X  is  affiliated  to  A  if  E{ A)  E  A 
VA  E  Br,  replacing  the  concept  of  measurability  in  classical 
probability  theory.  For  X  affiliated  to  A,  the  probability  law  and 
expectation  are  given  by 

P[X  E  A]  =  p(E(A))  (X)  =  f  Xp(E(dX)).  (11) 

Jr 

Note  that  unlike  in  finite  dimensions  not  all  observables  affil¬ 
iated  to  A  are  elements  of  A;  observables  may  be  unbounded 
operators,  while  A  only  contains  bounded  operators. 

It  remains  to  generalize  conditional  expectations  to  the  in¬ 
finite-dimensional  setting,  a  task  that  is  not  entirely  straight¬ 
forward  even  in  the  classical  case.  Let  B  C  A  be  a  commu¬ 
tative  Von  Neumann  subalgebra.  As  before,  we  will  only  de¬ 
fine  conditional  expectations  for  observables  that  are  not  de¬ 
molished  by  B,  i.e.,  for  observables  affiliated  to  the  commutant 
Bf  =  {A  E  A  :  [A,B]  =  0  VB  E  B}. 

Definition  1 :  The  conditional  expectation  onto  B  is  the  linear 
surjective  map  8 [•  |  B\  :  B'  — >  B  with  the  following  properties, 
for  all  A  E  B'\ 

1)  £[ldn\B]  =  Idn; 

2)  £[A\B\  >  Oif  A  >  0; 

3)  £\BXAB2  |  B]  =  BX£[A  \  B]B2  VBUB2  E  B\ 

4)  p{£[A\B])  =  p(A). 

The  definition  extends  to  any  observable  X  affiliated  to  B'  by 
operating  £[•  |  B]  on  the  associated  spectral  measure. 

It  is  possible  to  prove  (e.g.,  [14])  that  the  conditional  expec¬ 
tation  exists  and  is  unique. 

C.  Quantum  Stochastic  Calculus 

Having  extended  probability  theory  to  the  quantum  setting, 
we  now  sketch  the  development  of  a  quantum  Ito  calculus. 

We  must  first  find  a  quantum  analog  of  the  Wiener  process. 
Denote  by  (O,  X,  P)  the  canonical  Wiener  space  of  a  classical 
Brownian  motion.  The  analysis  in  the  previous  section  suggests 
that  quantum  Brownian  motion  will  be  represented  by  a  set  of 
observables  on  the  Hilbert  space  T  =  I/2(Q;C).  Define  the 
symmetric  Fock  space  over  L2(U)  as 

oo 

rs(L2([/))  =  C©0L2(f/;C)0n  U  CR+  (12) 

n=  1 


'H  =  L2(tt;  C)  A  =  L°°(ft-C)  p:f^  [  f  dP 

Jn 


where  Q  denotes  the  symmetrized  tensor  product.  It  is  well 
^  '  known  in  stochastic  analysis  (e.g.,  [8])  that  T  and  TS(Z,2(IR+)) 
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are  isomorphic,  as  every  L2 -functional  on  O  is  associated  to  its 
Wiener  chaos  expansion.  Now  define  the  operators 

g*ki  dt 

A*gk  =  g  0  ki  ©  •  •  •  ©  kn  (13) 

where  k  =  k\  ©  •  •  •  ©  kn,  g,  ki  £  L2([R+)  and  ki  means  that 
the  term  i  is  omitted.  It  is  sufficient  to  define  the  operators  for 
such  vectors  as  their  linear  span  To  is  dense  in  T.  We  get 

[Ag,Ah]  =  [A*,A*h]=  0  [Ag,AX]=  [  g*hdt  (14) 

J  R_|_ 

and  indeed  (y,  Agw )  =  ( A*v ,  w)  for  v./w  £  r0. 

We  will  construct  Wiener  processes  from  A  and  A* ,  but  first 
we  must  set  up  the  quantum  probability  space.  We  take  A  to 
contain  all  bounded  linear  operators  on  V.  To  construct  p  con¬ 
sider  the  vector  A  =  1  0  0  £  TS(L2(R+)).  Then 

p:A^C  p{X)  =  (A,1A).  (15) 

Now,  consider  the  operator  A*  +  Ag.  Using  (14)  and  the 
Baker-Campbell-Hausdorff  lemma,  we  obtain 


describe  the  main  results,  due  to  Hudson  and  Parthasarathy  [27], 
and  refer  to  [7],  [8],  and  [27]  for  the  full  theory. 

Let  7t  be  the  Hilbert  space  of  the  system  of  interest;  we 
will  assume  that  dim  77  <  oo.  Now,  let  A  be  the  set  of  all 
bounded  operators  on  Ft  0  T.  The  state  p  =  <f|  pr  is  given 

in  terms  some  state  pn  on  Ft  and  pr  as  defined  in  (15).  The 
Hudson-Parthasarathy  equation 

US}t  =  Id  +  jT  [tAA*  -  L*dAt  -  (iH  +  \l*L^  db'j  U8,t 

(20) 

defines  the  flow  Usj  of  the  noisy  dynamics.  Here,  L  and  H  are 
operators  of  the  form  L  0  Id  on  Ft  0  T  and  H  is  self-adjoint.  It 
can  be  shown  that  Usj  is  a  unitary  transformation  of  Ft  0  Ts^ 
and  Usj  =  UkgUsj..  Given  an  observable  S  at  time  0,  the  flow 
defines  the  associated  process  St  =  Uq  tSU^t- 

Quantum  stochastic  differential  equations  are  easily  manip¬ 
ulated  using  the  following  rules.  The  expectation  of  any  inte¬ 
gral  over  dAt  or  dA%  vanishes.  The  differentials  dAt,  dA*  com¬ 
mute  with  any  adapted  process.  Finally,  the  quantum  Ito  rules 
are  dAt  dA *  =  dt ,  dA2  =  (dA*)2  =  dA*  dAt  =  0. 

Let  X  £  Ft  be  any  system  observable;  its  time  evolution  is 
given  by  jt(X)  =  U*)g(X  0  Id)Uo^.  We  easily  obtain 


Agk  =  ki  ©  •  •  •  ©  ki  ©  •  •  •  ©  k7 


A, 

i= 1 


./ 

JR 


(ei(A  =  (a,  eiA*ne~^9^2  eiAg  =  e_5llsll2  (16) 

where  ||p||2  is  the  integral  of  |p|2  over  U+.  However,  the  char¬ 
acteristic  functional  of  a  classical  Wiener  process  is 


djt(X)  =  jt(CX)  dt+jt([L*,X ])  dAt+jt([X,  L})  dA*  (21) 

where  CX  =  i[H,  X]  +  L*XL-  (1/2)(L*LX  +  XL*L).  This 
expression  is  the  quantum  analog  of  the  classical  Ito  formula 


E 


a  r  ~g^ dwt 


=  e 


(17) 


where  g  is  a  real  function.  Clearly,  A*  +  Ag  is  equivalent  in  law 
to  a  classical  Wiener  integral,  and  any  Qt  =  A*t  +  Ag±  with 
gt(s)  =  X[o is  a  quantum  Wiener  process. 

It  is  easy  to  verify  that  [Qt,  Qs]  =  0  Vt,  s.  This  important 
property  allows  us  to  represent  all  Qt,  t  £  R+  on  a  single  clas¬ 
sical  probability  space  and,  hence,  Qt  is  entirely  equivalent  to  a 
classical  Wiener  process.  Two  such  processes  with  different  ifi 
do  not  commute,  however,  and  are  thus  incompatible. 

The  Fock  space  (12)  has  the  following  factorization  property: 
for  any  sequence  of  times  t\  <  £2  <•••<  tn  £  IR+ 


r  =  rtl]  0  rtl?t2  0  rt2,t3  0  •  •  •  0  rw,  ,*n  0  r[tn  (is) 

with  r,.,  =  rs(L2([s,t])),rt]  =  r0,t,andr[t  =  rtj00.Thus,r 
can  be  formally  considered  as  a  continuous  tensor  product  over 
TS(L2 ({£})),  a  construction  often  used  implicitly  in  physics  lit¬ 
erature.  A  process  St  is  called  adapted  if  St  =  Sp  0  Id  in 
Tt]  0  T[t  for  every  t  £  R+.  Qt  is  adapted  for  any  cp. 

It  is  customary  to  define  the  standard  noises 

At  =  Ami  A*  =  A*xm>  teU+'  (19) 

One  can  now  define  Ito  integrals  and  calculus  with  respect  to 
Atl  A^  in  complete  analogy  to  the  classical  case.  We  will  only 


djt(f)  =  jt(Lf)  dt  +  jt(E/)  dWt  (22) 

where  jt(f )  =  f(xt)  with  dxt  =  b(xt)  dt  +  a(xt)  dWt,L  is 
the  infinitesimal  generator  of  xt  and  E/  =  aldif.  Similarly,  C 
is  called  the  generator  of  the  quantum  diffusion  Usj- 

In  fact,  the  quantum  theory  is  very  similar  to  the  classical 
theory  of  stochastic  flows  [28],  [29]  with  one  notable  exception: 
the  existence  of  incompatible  observables  does  not  allow  for  a 
unique  sample  path  interpretation  (xt  in  the  classical  case)  of  the 
underlying  system.  Hence  the  dynamics  is  necessarily  expressed 
in  terms  of  observables,  as  in  (21). 

D.  Measurements  and  Filtering 

We  now  complete  the  picture  by  introducing  observations  and 
conditioning  the  system  observables  on  the  observed  process. 
The  following  treatment  is  inspired  by  [12]  and  [13]. 

1 )  Classical  Filtering:  To  set  the  stage  for  the  quantum  fil¬ 
tering  problem  we  first  treat  its  classical  counterpart.  Suppose 
the  system  dynamics  (22)  is  observed  as  yt  with 

dyt  =  jt(h)  dt  +  k  dVt  (23) 

for  uncorrelated  noise  Vt  with  strength  n  >  0.  We  wish  to  cal¬ 
culate  the  conditional  expectation  i rt(f)  =  E[jt(f)\J^]. 

Recall  the  classical  definition:  E[X  \X]  is  the  J^-measurable 
random  variable  such  that  E[E[X  |  X]Y]  =  E[X7]  for  all 
JT-measurable  Y.  Suppose  T  is  generated  by  some  random 
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variable  F.  The  definition  suggests  that  to  prove  X  =  E[X  \  T\ 
for  some  X-measurable  X,  it  should  be  sufficient  to  show  that 

E[XeF€]  =  E[XeF«]  V^eR  (24) 


i.e.,  the  conditional  generating  functions  coincide. 

We  will  apply  this  strategy  in  the  continuous  case.  As  tt *(/) 
is  an  Xf  -semimartingale  we  introduce  the  ansatz 


dnt(f)  =  Ctdt  +  Dt  dyt 


(25) 


with  Ct,Dt  T\ -adapted.  We  will  choose  Ct,Dt  such  that 
E[ef 7 rt(/)]  =  E[efjt(/)]  for  all  functions  g,  where 

4  =  efo  a(*)dv.-h«2  £  9(s )2  de9  =  g{t)e9  ^  (26) 


The  Ito  correction  term  in  the  exponent  was  chosen  for  conve¬ 
nience  and  does  not  otherwise  affect  the  procedure. 

Using  Ito’s  rule  and  the  usual  properties  of  conditional  ex¬ 
pectations,  we  easily  obtain 


jEjefjtC fj 
dt 

cflE  [4Mf)\ 

dt 


^[4MLf)  + M)etMhf)]  (27) 
E  [eat(Ct  +  TTt(h)Dt) 

+  g(t)4 U2-0*  +  Mh)Mf))\  •  (28) 


Requiring  these  expressions  to  be  identical  for  any  g  gives 

dMf)  =  MLf)  dt  +  n~l(i vt(hf)  -  Mh)Mf))  dWt  (29) 

where  the  innovations  process  dWt  =  n~1(dyt  —  7 rt(h)  dt) 
is  a  Wiener  process.  Equation  (29)  is  the  well-known 
Kushner-Stratonovich  equation  of  nonlinear  filtering  [30], 

[31]. 

2)  Quantum  Filtering :  The  classical  approach  generalizes 
directly  to  the  quantum  case.  The  main  difficulty  here  is  how 
to  define  in  a  sensible  way  the  observation  (23)? 

We  approach  the  problem  from  a  physical  perspective  [32]. 
The  quantum  noise  represents  an  electromagnetic  field  coupled 
to  the  system  (e.g.,  an  atom.)  Unlike  classically,  where  any  ob¬ 
servation  is  in  principle  admissible,  a  physical  measurement  is 
performed  by  placing  a  detector  in  the  field.  Hence,  the  same 
noise  that  drives  the  system  is  used  for  detection,  placing  a  phys¬ 
ical  restriction  on  the  form  of  the  observation. 

We  will  consider  the  observation  Y/  =  E/q  t(A £  +  At)Uoj  + 
k(B*  +  Bt).  Here,  Bt  is  a  noise  uncorrelated  from  At  that  does 
not  interact  with  the  system  (the  Hilbert  space  is  TL CgYcgY,  etc.) 
Physically,  we  are  measuring  the  field  observable  A %  +  At  after 
interaction  with  the  system,  corrupted  by  uncorrelated  noise  of 
strength  k  >  0.  Using  the  Ito  rule  and  (20)  we  get 


dY'  =  jt(L *  +  L)  dt  +  dA*  +  dAt  +  n(dB;  +  dBt ).  (30) 


It  is  customary  in  physics  to  use  a  normalized  observation  Yt 
such  that  dY-!2  =  dt.  We  will  use  the  standard  notation 

dYt  =  V»j( jt(L *  +  L)dt  +  dA*  +  dAt)  +  V^vdVt  (31) 


where  Vt  =  B%  +  Bt  and  77  =  (1  +  n?)  1  £  (0, 1]. 

Y{  and  Yf  satisfy  the  following  two  crucial  properties. 

1)  Y{  is  self-nondemolition ,  i.e.,  [Y/,  Y'\  =  0  \/s  <  t .  To 
see  this,  note  that  [Y/ ,  Y'\  =  [U$  tQtU0^U$  sQsU0^\ 
with  Qt  =  A*  At .  But  Us^  is  a  unitary  transfor¬ 
mation  of  H  0  T,,t  and  Qs  =  Id  f§  Qs]  (8)  Id  on 

®T[S ;  thus  we  get  U*tQs USjt  =  Q8U*jtU^t  = 
Qs,soU$iSQsU0i3  =  U$itQsU0if  But  then  \Y( ,  Yf\  = 
E/q  t[Qu  Qs]Uo,t  =  0  as  we  have  already  seen  that  Qt 
is  self-nondemolition. 

2)  Y/  is  nondemolition ,  i.e.,  [jt(X),Y'\  =  0  Vs  <  £  for 
all  system  observables  X  on  .  The  proof  is  identical 
to  the  proof  of  the  self-nondemolition  property. 

These  properties  are  essential  in  any  sensible  quantum  filtering 
theory:  Self-nondemolition  implies  that  the  observation  is  a 
classical  stochastic  process,  whereas  nondemolition  is  required 
for  the  conditional  expectations  to  exist.  A  general  filtering 
theory  can  be  developed  that  allows  any  such  observation  [11], 
[12];  we  will  restrict  ourselves  to  our  physically  motivated  Yt. 

We  wish  to  calculate  7r t(X)  =  S[jt(X)  \  Bt\  where  Bt  is  the 
algebra  generated  by  Ys<t.  Introduce  the  ansatz 

dirt(X)  =  Ct  dt  +  Dt  dYt  (32) 

where  Ct,Dt  are  affiliated  to  Bf  .  Define 

4  =  jo  dY°-Jo  g^2  ds,  de9t  =  g(t)e9t  dYt.  (33) 
Using  the  quantum  Ito  rule  and  Definition  1,  we  get 

=  (e?irt(£X)  +  g(t)e9MXL  +  L*X)^j) 

(34) 

rf(e*^(X))  =  (4(Ct  +  ML*  +  L)Dtyfrj) 

+  g(t)e9(Dt  +  ML*  +  L)MX)M))  • 

(35) 


Requiring  these  expressions  to  be  identical  for  any  g  gives 

d7Tt(X)  =  irt(£X)  dt  +  ^t{XL  +  L*X) 

~irt(L*  +  L)irt(X))(dYt  -  +  L)  dt)  (36) 

which  is  the  quantum  analog  of  (29).  It  can  be  shown  that  the 
innovations  process  dWt  =  dYt  —  rj7rt(L *  +  L)  dt  is  a  mar¬ 
tingale  (e.g.,  [14])  and,  hence,  it  is  a  Wiener  process  by  Levy’s 
classical  theorem. 

E.  The  Physical  Model 

Quantum  (or  classical)  probability  does  not  by  itself  describe 
any  particular  physical  system;  it  only  provides  the  mathematical 
framework  in  which  physical  systems  can  be  modeled.  The 
modeling  of  particular  systems  is  largely  the  physicist’s 
task  and  a  detailed  discussion  of  the  issues  involved  is 
beyond  the  scope  of  this  article;  we  limit  ourselves  to  a 
few  general  remarks.  The  main  goal  of  this  section  is  to 
introduce  a  prototypical  quantum  system  which  we  will  use 
in  the  remainder  of  this  article. 
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The  emergence  of  quantum  models  can  be  justified  in  different 
ways.  The  traditional  approach  involves  “quantization”  of 
classical  mechanical  theories  using  an  empirical  quantization 
rule.  A  more  fundamental  theory  builds  quantum  models  as 
“statistical”  representations  of  mechanical  symmetry  groups 
[33],  [34].  Both  approaches  generally  lead  to  the  same 
theory. 

The  model  considered  in  this  paper  (Fig.  1)  is  prototypical 
for  experiments  in  quantum  optics;  in  fact,  it  is  very  similar  to 
our  laboratory  apparatus  [4] .  The  system  consists  of  a  cloud  of 
atoms,  collectively  labeled  “spin”,  interacting  with  an  optical 
field  (along  z)  produced  by  a  laser.  After  interacting  with  the 
system  the  optical  field  is  detected  using  a  photodetector  con¬ 
figuration  known  as  a  homodyne  detector.  A  pair  of  magnetic 
coils  (along  y)  are  used  as  feedback  actuators. 

The  optical  and  magnetic  fields  are  configured  so  they  only 
interact,  to  good  approximation,  with  the  collective  angular  mo¬ 
mentum  degrees  of  freedom  of  all  the  atoms  [35].  Rotational 
symmetry  implies  that  observables  of  angular  momentum  must 
form  the  rotation  Lie  algebra  50  (3).  If  we  impose  addition¬ 
ally  that  the  total  angular  momentum  is  conserved,  then  it  is 
a  standard  result  in  quantum  mechanics  [26]  that  the  angular 
momentum  observables  form  an  irreducible  representation  of 
5  o  (3) .  Such  a  system  is  called  a  spin. 

We  take  H  to  be  the  spin  Hilbert  space.  Any  finite  dimension 
2  <  dimTf  <  oo  supports  an  irrep  of  50  (3);  the  choice  of 
dim  Ti  =  2j  +  1  depends  on  the  number  of  atoms  and  their 
properties.  We  can  choose  an  orthonormal  basis  {'ipm  E  m  = 
—  7,  —j  +  1, . . . ,  j}  such  that  the  observables  Jx,y,z  of  angular 
momentum  around  the  x,  y,  z-axis  are  defined  by1 

Jx'lpm  ~  Cm'lpm+1  ^—mdPm  —  1 

Jy'tpm  ~  M'nilPrn-\-\  —  l 

Jz'lpm  =  m'lpm  (37) 

with  cm  =  (1/2)  —  m)(j  +  m  +  1).  It  is  easily  verified 

that  Jx,y,z  indeed  generate  50 (3),  e.g.,  [Jx,  Jy\  =  iJz. 

Note  that  Jx.y.z  are  discrete  random  variables;  the  fact 
that  angular  momentum  is  “quantized,”  unlike  in  classical 
mechanics,  is  one  of  the  remarkable  predictions  of  quantum 
mechanics  that  give  the  theory  its  name.  Another  remarkable 
nonclassical  effect  is  that  Jx,y{z  are  incompatible  observables. 

The  noise  in  our  model  and  its  interaction  with  the  atoms 
emerges  naturally  from  quantum  electrodynamics,  the  quantum 
theory  of  light  [36].  Physical  noise  is  not  white;  however, 
as  the  correlation  time  of  the  optical  noise  is  much  shorter 
than  the  time  scale  of  the  spin  dynamics,  a  quantum  analog 
of  the  classical  Wong-Zakai  procedure  [37],  [38]  can  be 
employed  to  approximate  the  dynamics  by  an  equation  of  the 
form  (20).  In  fact,  the  term  —  (1/2)L*L  in  (20)  is  precisely 
the  Wong-Zakai  correction  term  that  emerges  in  the  white 
noise  limit. 

We  now  state  the  details  of  our  model  without  further  physical 
justification.  The  system  is  described  by  (20)  with  L  =  \[MJZ 

Angular  momentum  is  given  in  units  of  h  ~  1.055  x  10  _34  kg  m2s_1 .  To 
simplify  the  notation  we  always  work  in  units  such  that  h .  =  1 . 
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Fig.  1.  Schematic  of  an  experiment  for  continuous  quantum  measurement 
and  control.  The  spin  interacts  with  an  optical  mode,  which  is  measured 
continuously  by  homodyne  detection.  A  magnetic  field  is  used  for  feedback. 

and  H  =  B(t)Jy.  Here  M  >  0  is  the  strength  of  the  interac¬ 
tion  between  the  light  and  the  atoms;  it  is  regulated  experimen¬ 
tally  by  the  optical  cavity.  B(t)  is  the  applied  magnetic  field  and 
serves  as  the  control  input.  Finally,  homodyne  detection  [32] 
provides  exactly  the  measurement2  (31),  where  y  is  determined 
by  the  efficiency  of  the  photodetectors. 

In  the  remainder  of  this  paper,  we  will  study  the  spin  system 
of  Fig.  1.  Before  we  devote  ourselves  entirely  to  this  situation, 
however,  we  mention  a  couple  of  other  common  scenarios. 

Often  L  is  not  self-adjoint;  in  this  case,  the  system  can  emit  or 
absorb  energy  through  interaction  with  the  field.  This  situation 
occurs  when  the  optical  frequency  of  the  cavity  field  is  resonant 
with  an  atomic  transition.  In  our  case  the  frequency  is  chosen 
to  be  far  off-resonant;  this  leads  to  self-adjoint  L  after  adiabatic 
elimination  of  the  cavity  dynamics  (e.g.,  [16]).  The  filter  dy¬ 
namics  in  this  scenario,  to  be  described  later,  is  known  as  state 
reduction.  The  sequence  of  approximations  that  is  used  for  our 
particular  model  is  described  in  [39]. 

Finally,  a  different  detector  configuration  may  be  chosen.  For 
example,  a  drastically  different  observation,  known  as  photon 
counting,  gives  rise  to  a  Poisson  (jump)  process.  We  refer  to 
[32]  for  a  full  account  of  the  quantum  stochastic  approach  to 
observations  in  quantum  optics. 

III.  Geometry  and  Dynamics  of  the  Filter 

In  the  previous  section,  we  introduced  our  physical  model.  A 
detailed  analysis  resulted  in  the  filtering  equation  (36),  where 
7 rt(X)  is  the  best  estimate  of  the  observable  X  given  the  obser¬ 
vations  Ys<t.  We  will  now  study  this  equation  in  detail. 

Note  that  (36)  is  driven  by  the  observation  Yt,  which  is  a  clas¬ 
sical  stochastic  process.  Hence,  (36)  is  entirely  equivalent  to  a 
classical  Ito  equation.  This  is  an  important  point,  as  it  means  that 
in  the  remainder  of  this  article  we  only  need  classical  stochastic 
calculus. 

2In  practice  one  measures  not  Yt  but  its  formal  derivative  I{t)  —  dYt/dt. 
As  in  classical  stochastics  we  prefer  to  deal  mathematically  with  the  integrated 
observation  Yt  rather  than  the  singular  “white  noise”  photocurrent  I (t) . 
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A.  The  State-Space 

We  begin  by  investigating  the  state  space  on  which  the  filter 
evolves.  Clearly  (36)  defines  the  time  evolution  of  a  map  7rt;  we 
will  show  how  this  map  can  be  represented  efficiently. 

The  map  irt  associates  to  every  observable  X  on  Ti  a  classical 
stochastic  process  which  represents  the  expectation  of  X  condi¬ 
tioned  on  the  observations  up  to  time  t.  It  is  easily  verified  that 
7rt  is  linear,  identity  preserving,  and  maps  positive  observables 
to  positive  numbers:  In  fact,  it  acts  exactly  like  the  expectation 
of  X  with  respect  to  some  finite-dimensional  state  on  Ti.  We 
will  denote  this  state  by  pt ,  the  conditional  density  at  time  t, 
where  by  definition  7 rt(X)  =  Tr[ptX|. 

It  is  straightforward  to  find  an  expression  for  pt.  We  get 

dpt  =  Cpt  dt+^(Lpt+ptL* -Tr[pt(L+L*)\pt)  dWt  (38) 

with  the  innovations  dWt  =  dYt  —  ^/pYi[pt{L  +  L *)]  dt  and 
the  adjoint  generator  £*p  =  —  i[H,  p]  +  LpL*  —  (1/2 )(L*Lp  + 
pL*L).  In  physics,  this  equation  is  also  known  as  a  quantum 
trajectory  equation  or  stochastic  master  equation. 

Let  dim  Ti  =  n;  as  n  is  finite,  we  can  represent  linear  op¬ 
erators  on  Ti  by  complex  matrices.  Thus,  (38)  is  an  ordinary, 
finite-dimensional  ltd  equation.  We  saw  in  Section  II-A  that  pt 
is  a  density  matrix,  i.e.,  it  belongs  to  the  space 

v={pe  CnXn  :  p  =  p*, Tr  p  =  1,  p  >  0}.  (39) 

By  construction  V  is  an  invariant  set  of  (38),  and  forms  the  nat¬ 
ural  state  space  of  the  filter. 

B.  Geometry  ofV 

The  geometry  of  V  is  rather  complicated  [40].  To  make  the 
space  more  manageable  we  will  reparametrize  V  so  it  can  be 
expressed  as  a  semialgebraic  set. 

Let  us  choose  the  matrix  elements  pij  of  p  as  follows.  For 
i  >  j  set  pij  =  A ij  +  ipij  with  A ij,Pij  E  P.  For  i  <  j  set 
PH  =  p*?:.  Finally,  choose  an  integer  k  between  1  and  n.  For 
i  ^  k  set  pa  =  E  R,  and  pkk  =  1  -  vi-  c°Uect 

all  n2  —  1  numbers  A ij,Pij,  Ui  into  a  vector  A.  Then,  clearly, 
the  map  h  :  A  1— >  p  is  an  isomorphism  between  lRn  _1  and 
{p  E  Cnxn  :  p  =  p* ,  Tr  p  =  1}. 

It  remains  to  find  the  subset  K  C  Rn  -1  that  corresponds  to 
positive-definite  matrices.  This  is  nontrivial,  however,  as  it  re¬ 
quires  us  to  express  nonnegativity  of  the  eigenvalues  of  p  as  con¬ 
straints  on  pij .  The  problem  was  solved  by  Kimura  [40]  using 
Descartes’  sign  rule  and  the  Newton-Girard  identities  for  sym¬ 
metric  polynomials;  we  quote  the  following  result. 

Proposition  1:  Define  kp(p),p  =  2  . . .  n  recursively  by 

pkp(p)  =  LA1)9-1  Tv[pq}kp-q(p)  (4°) 

q=l 


We  work  out  explicitly  the  simplest  case  n  =  2  (spin  j  = 
(1/2)).  Set  pn  =  v, P22  =  1  -  v,  P21  =  A  +  ip  =  p*12.  Then 

K2  =  (A  =  (A,  p,  v)  £  R3  :  A2  +  p2  +  v(v  —  1)  <  0}.  (42) 

This  is  just  a  solid  sphere  with  radius  (1/2),  centered  at 
(0, 0,  (1/2)).  The  case  n  =  2  is  deceptively  simple,  however:  it 
is  the  only  case  with  a  simple  topology  [41],  [40]. 

We  can  also  express  (38)  in  terms  of  A.  Specifically,  we  will 
consider  the  spin  system  L  =  s/MJz ,  H  =  B{t)Jy  in  the  basis 
'01/2  =  (1, 0),  /0_1/2  =  (0?  1)  on  C2x2.  We  obtain 

MXt'j  dt 
+  A/Mrj\t(  1  -  2vt)  dWt 
dpt  =  -^MPt  dt  +  \[Mr)Pt{ 1  ~  dWt 
dvt  =  —B(t)Xtdt  -  2y/Mr)vt{yt  -  1)  dWt.  (43) 
By  construction,  K2  is  an  invariant  set  for  this  system. 


d\t  =  (  B(t)  (  vt  -  A  -  \ 


C.  Convexity  and  Pure  States 

Just  like  its  classical  counterpart,  the  set  of  densities  V  is 
convex.  We  have  the  following  fundamental  result. 

Proposition  2:  The  set  V  is  the  convex  hull  of  the  set  of  pure 
states  Q  =  {vv*  E  CnXn  :  v  E  Cn,  \\v\\  =  1}  C  V. 

Proof:  As  any  p  E  V  is  self-adjoint  it  can  be  written  as 
p  =  JA  A iViV*,  where  Vi  are  orthonormal  eigenvectors  of  p  and 
A i  are  the  corresponding  eigenvalues.  However,  Trp  =  1,  p  >  0 
imply  that  JA  A;  =  1  and  A^  E  [0, 1].  Hence  V  C  conv  Q. 
Conversely,  it  is  easily  verified  that  conv  Q  C  V.  ■ 

Pure  states  are  the  extremal  elements  of  V;  they  represent 
quantum  states  of  maximal  information.  Note  that  classically 
extremal  measures  are  deterministic,  i.e.,  P [A]  is  either  0  or  1 
for  any  event  A.  This  is  not  the  case  for  pure  states  p  =  vv *, 
however:  any  event  A  =  ww*  with  0  <  ||rc*7/||  <  1,  \\w\\  =  1 
will  have  0  <  Tr[pA]  <  1.  Thus,  no  quantum  state  is  determin¬ 
istic,  unless  we  restrict  to  a  commutative  algebra  A. 

Intuitively  one  would  expect  that  if  the  output  Yt  is  not  cor¬ 
rupted  by  independent  noise,  i.e.,  77  =  1,  then  there  is  no  loss  of 
information  and,  hence,  an  initially  pure  po  would  remain  pure 
under  (38).  This  is  indeed  the  case.  Define 


dt  +  (L-  ht)  dWt 


(44) 


where  ht  =  (1/2 )v^(L*  +  L)vt.  Then,  it  is  easily  verified  that 
pt  =  vtv%  obeys  (38)  with  77  =  1.  It  follows  that  if  77  =  1,  Q 
is  an  invariant  set  of  (38).  In  the  concrete  example  (43)  it  is  not 
difficult  to  verify  this  property  directly:  when  r\—  1,  the  sphere 
A2  +  p?  4-  —  1)  =  0  is  invariant  under  (43). 


with  ko  =  ki  =  1.  Define  the  semialgebraic  set 

K  =  { A  G  nr2-1  :  kp(h( A))  >  0,p  =  2 . . .  n}.  (41) 

Then,  h  is  an  isomorphism  between  K  and  V. 

Note  that  2k 2  =  1  —  Tr[p2]  >  0  implies  ||A||2  =  JA  7/?  _|_ 
Y,i>j(xij  +  Pij)  <  Tr[/i(A)2]  <  1.  Hence,  K  is  compact. 


D.  Quantum  State  Reduction 

We  now  study  the  dynamics  of  the  spin  filtering  equation 
without  feedback  B(t )  =  0.  We  follow  the  approach  of  [42]. 
Consider  the  quantity  Vt  =  7 r*(  J2)  —  1 Tt(Jz)2-  We  obtain 

®  =  -4Mr?E  [Vt2]  . 


(45) 
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Clearly,  E[V. t2]  >  0,  so  E[Vt\  decreases  monotonically.  But  Vt  > 
0  and  E[V f]  =  0  iff  Vt  =  0  a.s.  We  conclude  that 

lim  E[Vt\  =  0  (46) 

t — >oo 

and,  hence,  Vt  — >  0  a.s.  as  t  — >  oo.  However,  the  only  states 
pep  with  Vt  =  Tr[j2p]  -  Tr  [Jzp]2  =  0  are  the  eigenstates 
'ipm'ipm  of  J^.  Hence,  in  the  long-time  limit  the  conditional  state 
collapses  onto  one  of  the  eigenstates  of  Jz ,  as  predicted  by  (5) 
for  a  “direct”  measurement  of  Jz. 

With  what  probability  does  the  state  collapse  onto  eigenstate 
ml  To  study  this,  let  us  calculate  We  get 

=  2\Z^?7ri('i/’rriV,m)(TO  “  z))  dWt.  (47) 

Clearly,  is  a  martingale,  so 

Pm  =  E[7r00(V’mV,m)]  =  7T0  (t/Wm)-  (48) 

We  have  already  shown  that  is  one  of  and  as  the 

are  orthonormal  this  implies  thatTToo^m V;m)  =  Tr[PooV^m^m] 
is  1  if  n  =  m  and  0,  otherwise.  Thus,  is  just  the  proba¬ 
bility  of  collapsing  onto  the  eigenstate  m.  However,  note  that 
TTo (V-’mV'm)  =  TrboV’mV'm]’ s0  (48)  gives  exactly  the  same  col- 
lapse  probability  as  the  “direct”  measurement  (3). 

We  conclude  that  the  predictions  of  quantum  filtering  theory 
are  entirely  consistent  with  the  traditional  quantum  mechanics. 
A  continuous  reduction  process  replaces,  but  is  asymptotically 
equivalent  to,  the  instantaneous  state  collapse  of  Section  II-A. 
This  phenomenon  is  known  as  quantum  state  reduction .3  We 
emphasize  that  quantum  filtering  is  purely  a  statistical  inference 
process  and  is  obtained  entirely  through  nondemolition  mea¬ 
surements.  Note  also  that  state  reduction  occurs  because  L  =  Jz 
is  self-adjoint;  other  cases  are  of  equal  physical  interest,  but  we 
will  not  consider  them  in  this  paper. 

Physically,  the  filtering  approach  shows  that  realistic  mea¬ 
surements  are  not  instantaneous  but  take  some  finite  time.  The 
time  scale  of  state  reduction  is  of  order  M_1 ,  an  experimentally 
controlled  parameter.  A  carefully  designed  experiment  can  thus 
have  a  reduction  time  scale  of  an  order  attainable  by  modern 
digital  electronics  [43],  which  opens  the  door  to  both  measuring 
and  manipulating  the  process  in  real  time. 

IV.  Stabilization  of  Spin  State  Reduction 
A.  The  Control  Problem 

It  is  a  standard  idea  in  stochastic  control  that  an  output  feed¬ 
back  control  problem  can  be  converted  into  a  state  feedback 
problem  for  the  filter  [9],  [10].  This  is  shown  schematically  in 
Fig.  2.  The  filtering  equations  (36)  or  (38)  are  driven  by  Yt\ 
hence,  at  least  in  principle,  the  conditional  state  pt  can  be  cal¬ 
culated  recursively  in  real  time  by  a  digital  processor. 

The  filter  describes  optimally  our  knowledge  of  the  system; 
clearly,  the  extent  of  our  knowledge  of  the  system  state  limits  the 
precision  with  which  it  can  be  controlled.  The  best  we  can  hope 

3The  term  state  reduction  is  sometimes  associated  with  quantum  state  dif¬ 
fusion,  an  attempt  to  empirically  modify  the  laws  of  quantum  mechanics  so 
that  state  collapse  becomes  a  dynamical  property.  The  state  diffusion  equation, 
which  is  postulated  rather  than  derived,  is  exactly  (44)  with  L  —  L* .  We  use  the 
term  state  reduction  as  describing  the  reduction  dynamics  without  any  relation 
to  its  interpretation.  The  analysis  of  [42]  is  presented  in  the  context  of  quantum 
state  diffusion,  but  applies  equally  well  to  our  case. 
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System:  Eq.  (21 ) 

!,’■  Eq.  (31) 
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Filter:  Eq.  (38) 

Fig.  2.  Schematic  of  the  feedback  control  strategy.  The  output  from  the  system 
is  used  to  propagate  the  conditional  state  of  the  filter.  The  feedback  signal  is  of 
state  feedback  form  with  respect  to  the  conditional  state. 

to  do  is  to  control  the  system  to  the  best  of  our  knowledge,  i.e., 
to  control  the  filter.  The  latter  is  a  well-posed  problem,  despite 
that  we  cannot  predict  the  observations  Yt,  because  we  know  the 
statistics  of  the  innovations  process  Wt. 

For  such  a  scheme  to  be  successful  the  system  dynamics  (21) 
must  be  known,  as  the  optimal  filter  is  matched  to  the  system  dy¬ 
namics.  Designing  controllers  that  perform  well  even  when  the 
system  dynamics  is  not  known  precisely  is  the  subject  of  robust 
control  theory.  Also,  efficient  signal  processing  algorithms  and 
hardware  are  necessary  to  propagate  (38)  in  real  time,  which  is 
particularly  problematic  when  dim  TL  is  large.  Neither  of  these 
issues  will  be  considered  in  this  paper. 

The  state  reduction  dynamics  discussed  in  the  previous  sec¬ 
tion  immediately  suggests  the  following  control  problem:  We 
wish  to  find  state  feedback  B(t)  =  4>(pt)  so  that  one  of  the 
eigenstates  p  =  *s  globally  stabilized.  The  idea  that  a 

quantum  measurement  can  be  engineered  to  collapse  determin¬ 
istically  onto  an  eigenstate  of  our  choice  is  somewhat  remark¬ 
able  from  a  traditional  physics  perspective,  but  clearly  the  mea¬ 
surement  scenario  we  have  described  provides  us  with  this  op¬ 
portunity.  For  additional  motivation  and  numerical  simulations 
relating  to  this  control  problem,  see  [3]. 

B.  Stochastic  Stability 

In  nonlinear  control  theory  [44]  stabilization  of  nonlinear  sys¬ 
tems  is  usually  performed  using  the  powerful  tools  of  Lyapunov 
stability  theory.  In  this  section  we  will  describe  the  stochastic 
counterpart  of  deterministic  Lyapunov  theory,  developed  in  the 
1960s  by  Has’minskiT  and  others.  We  will  not  give  proofs,  for 
which  we  refer  to  [45]-[48]. 

Let  Wt  be  a  Wiener  process  on  the  canonical  Wiener  space 
(Q,  T ,  P).  Consider  an  Ito  equation  on  Rn  of  the  form 

dxt  =  b(xt )  dt  +  <j{xt)  dWt  (49) 

where  6,  a  :  Pn  — »  Un  satisfy  the  usual  linear  growth  and  local 
Lipschitz  conditions  for  existence  and  uniqueness  of  solutions 
[49].  Let  x *  be  a  fixed  point  of  (49),  i.e.,  b(x *)  =  <j(x*)  =  0. 

Definition  2:  The  equilibrium  solution  xt  =  x*  of  (49)  is 

1)  stable  in  probability  if 

lim  P[sup  \xt  —  x*\  >  e]  =  0  Ve  >  0; 

xo^x*  t>0 

2)  asymptotically  stable  if  it  is  stable  in  probability  and 

lim  P[  lim  \xt  —  x* |  =  0]  =  1 : 

xo^x*  r — >oo 

3)  globally  stable  if  it  is  stable  in  probability  and 

P[  lim  \xt  —  x*\  =  0]  =  1. 

i— ►  oo 
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Note  that  1)  and  2)  are  local  properties,  whereas  3)  is  a  global 
property  of  the  system. 

Recall  that  the  infinitesimal  generator  of  xt  is  given  by 

i  =  +  <50> 

%  ij 

so  dE[f(xt)\/dt  =  E [Lf(xt)\.  We  can  now  state  the  stochastic 
equivalent  of  Lyapunov’s  direct  method  [45]— [47]. 

Theorem  1:  Define  Uh  —  {x  :  \x  —  x*\  <  h}.  Suppose 
there  exists  some  h  >  0  and  a  function  V  :  Uh  4f  R+  that 
is  continuous  and  twice  differentiable  on  Uh\{x*},  such  that 
V(x*)  =  0  and  V(x)  >  0,  otherwise,  and  LV(x)  <  0  on  Uh- 
Then  the  equilibrium  solution  xt  =  x*  is  stable  in  probability. 
If  LV(x)  <  0  on  },  then  x*  is  asymptotically  stable. 

Theorem  1  is  a  local  theorem;  to  prove  global  stability  we 
need  additional  methods.  When  dealing  with  quantum  filtering 
equations  a  useful  global  result  is  the  following  stochastic 
LaSalle-type  theorem  of  Mao  [48].  In  the  theorem  we  will 
assume  that  the  dynamics  of  (49)  are  confined  to  a  bounded 
invariant  set  G. 

Theorem  2:  Let  G  be  a  bounded  invariant  set  with  respect 
to  the  solutions  of  (49)  and  xq  E  G.  Suppose  there  exists  a 
continuous,  twice  differentiable  function  V  :  G  — >  [R+  such 
that  LV(x)  <  0  Vx  e  G.  Then  lim^oo  LV(xt)  =  0  a.s. 

Finally,  we  will  find  it  useful  to  prove  that  a  particular  fixed 
point  repels  trajectories  that  do  not  originate  on  it.  To  this  end, 
we  use  the  following  theorem  of  Has’minskiT  [45]. 

Theorem  3:  Suppose  there  exists  some  h  >  0  and  a  function 
V  :  Uh  — »  R  that  is  continuous  and  twice  differentiable  on 
Uh\{x*},  such  that 

lim  V (x)  =  +oo 

and  LV(x)  <  0  on  C4\{:r*}.  Then,  the  equilibrium  solution 
xt  =  x*  is  not  stable  in  probability  and,  moreover 

P[sup  \xt  —  x*\  <  h\  =  0  \/xo  E  Uh\{x*}.  (51) 

i>0 

C.  A  Toy  Problem:  The  Disc  and  the  Circle 

We  treat  in  detail  an  important  toy  problem:  spin  j  =  (1/2). 
The  low  dimension  and  the  simple  topology  make  this  problem 
easy  to  visualize.  Nonetheless  we  will  see  that  the  stabilization 
problem  is  not  easy  to  solve  even  in  this  simple  case. 

We  have  already  obtained  the  filter  (43)  on  K2  for  this  case. 
Conveniently,  the  origin  in  K2  is  mapped  to  the  lower  eigenstate 
*$-1/2'$*. 1/2 ;  we  will  attempt  to  stabilize  this  state. 

Note  that  the  equations  for  At,ut  are  decoupled  from  pt. 
Moreover,  the  only  point  in  K2  with  (A,  v)  —  (0, 0)  has  p  =  0. 
Hence,  we  can  equivalently  consider  the  control  problem 

B{t)  (A  -Tj-  \m\3j  dt 

+  \J M^At(l  —  2i*t)  dWt 
dvt  =  -B(t) At  dt  -  2 ^jMr}vt(yt  -  1)  dWt 


on  the  disc  B 2  =  {(A,  v)  E  IR2  :  A2  +  v(y  —  1)  <  0}.  Control¬ 
ling  (52)  is  entirely  equivalent  to  controlling  (43),  as  globally 
stabilizing  (A,  v)  —  (0, 0)  guarantees  that  p  is  attracted  to  zero 
due  to  the  geometry  of  K2- 

An  even  simpler  toy  problem  is  obtained  as  follows.  Suppose 
p  =  1 ;  we  have  seen  that  then  the  sphere  A2  +  p2  —  1)  =  0 

is  invariant  under  (43).  Now,  suppose  that  additionally  po  =  0. 
Then,  clearly  the  circle  S'1  =  {(A,  zv)  E  U2  :  A2  +  u(z^  — 1)  =  0} 
is  an  invariant  set.  We  find 

ddt  =  ^B(t)  -  Sin 6t cos dt- VM sm0tdWt  (53) 

after  a  change  of  variables  (2At,  2vt)  =  (sin  0t ,  1  +  cos  0t). 

System  (52)  could  in  principle  be  realized  by  performing  the 
experiment  of  Fig.  1  with  a  single  atom.  The  reduced  system 
(53)  is  unrealistic,  however;  it  would  require  perfect  photode¬ 
tectors  and  perfect  preparation  of  the  initial  state.  Nonetheless, 
it  is  instructive  to  study  this  case,  as  it  provides  intuition  which 
can  be  applied  in  more  complicated  scenarios.  Note  that  (53)  is 
a  special  case  of  (52)  where  r\  —  1  and  the  dynamics  is  restricted 
to  the  boundary  of  B2 . 

D.  Almost  Global  Control  on  S1 

We  wish  to  stabilize  (A,  v)  =  (0,0),  which  corresponds  to 
6  =  7r.  Note  that  by  (53)  a  positive  magnetic  field  B  >  0  causes 
an  increasing  drift  in  0,  i.e.,  a  clockwise  rotation  on  the  circle. 
Hence,  a  natural  choice  of  controller  is  one  which  causes  the 
state  to  rotate  in  the  direction  nearest  to  0  =  7r  from  the  current 
position.  This  situation  is  sketched  in  Fig.  3(a). 

A  drawback  of  any  such  controller  is  that  by  symmetry,  the 
feedback  must  vanish  not  only  on  8  =  7r  but  also  on  6  =  0; 
hence,  0  =  0  remains  a  fixed  point  of  the  controlled  system 
and  the  system  is  not  globally  stable.  We  will  show,  however, 
that  under  certain  conditions  such  feedback  renders  the  system 
almost  globally  stable,  in  the  sense  that  all  paths  that  do  not  start 
on  6  =  0  are  attracted  to  6  =  7r  a.s. 

For  simplicity,  we  choose  a  controller  that  is  linear  in  (A,  v) 

B{t)  =  2GAt  =  GsinOt,  G  >  0.  (54) 

Here,  G  is  the  feedback  gain.  The  generator  of  (53)  is  then 

L  =  ( GsinO  —  \m sin#cos6^  +  \m sin2  0 (55) 
\  2  J  d0  2  o0z 

As  a  first  step  we  will  show  that  the  fixed  point  6  =  7r  is  asymp¬ 
totically  stable  and  that  the  system  is  always  attracted  to  one  of 
the  fixed  points  (there  are  no  limit  cycles,  etc.).  To  this  end,  con¬ 
sider  the  Lyapunov  function 

V(6)  =  1  +  cos  (9,  V(tt)  =  0,  V{6  ^  tt)  >  0.  (56) 

We  obtain 

LV(0)  =  -G  sin26>.  (57) 

It  follows  from  Theorem  1  that  6  =  7r  is  asymptotically  stable, 
(52)  and  from  Theorem  2  that  limt^oo  Ot  E  {0, 7r}  a.s. 
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Fig.  3.  Cartoons  of  the  various  control  schemes;  the  arrows  denote  the  rotation  direction  of  the  magnetic  field,  (a)  Almost  global  control  on  the  circle:  The 
magnetic  field  always  rotates  in  the  direction  of  least  distance  to  6  —  tt,  but  0  —  0  remains  a  fixed  point,  (b)  Global  control  on  the  circle:  We  intentionally 
break  the  symmetry  of  the  controller  to  remove  the  undesired  fixed  point.  The  graphs  show  a  typical  feedback  law  and  Lyapunov  design  with  M  —  1,  B(Q)  — 
(l/2)sin6l  —  (1/4 )  ( 1  +  cos  0),  V(Q)  —  ((5/2)  +  sin  0)(1  +  cos  0).  (c)  A  neighborhood  of  (A,  u)  —  (0, 1)  showing  why  the  almost  global  control  law  fails 
on  the  disc.  The  control  vanishes  on  the  line  A  =  0;  hence,  points  on  this  line  are  never  repelled  with  unit  probability,  in  violation  of  (51). 


What  remains  to  be  shown  is  that  any  trajectory  which  does 
not  start  on  6  =  0  ends  up  at  6  =  it  a.s.  To  prove  this,  consider 


V{6)  —  —  log(l  —  cos  0) 

lim  V (0)  =  +oo. 
0^0 

(58) 

We  easily  find 

LV(9)  =  cos2(#/2)(M  +  M  cos#  —  2G). 

(59) 

Now,  note  that 

LV(0)<  0  V6>£(-7r,7r 

)\{0}  iff  G  >  M. 

(60) 

Thus,  by  Theorem  3  we  have 

P[sup  \  6t\  <  vr]  =  0 

if  60  G  (— 7r,  7r)\{0). 

(61) 

i>0 


However,  as  0  £  S 1  this  implies  0t  it  a.s.  if  0o  £ 
(— 7T,  7r)\{0}.  We  conclude  that  the  control  law  (54)  almost 
globally  stabilizes  the  system  if  we  have  sufficient  gain  G  >  M. 

E.  Global  Control  on  S1 

Any  deterministic  system  on  the  circle  is  topologically  ob¬ 
structed4  from  having  a  globally  stabilizing  controller:  A  con¬ 
tinuous  vector  field  on  S 1  with  a  stable  fixed  point  necessarily 
has  an  unstable  fixed  point  as  well.  In  the  stochastic  case,  how¬ 
ever,  this  is  not  the  case.  Though  the  drift  and  diffusion  terms 
must  each  have  two  fixed  points,  we  may  design  the  system  in 
such  a  way  that  only  the  stable  fixed  points  coincide. 

To  apply  such  a  trick  in  our  system  we  must  break  the  natural 
symmetry  of  the  control  law.  This  situation  is  shown  in  Fig.  3(b). 
There  is  a  region  of  the  circle  where  the  control  rotates  in  the 
direction  with  a  longer  distance  to  6  =  7r;  the  advantage  is  that 
6  =  0  is  no  longer  a  fixed  point. 

The  linear  control  law  that  has  this  property  has  the  form 

B{t )  =  2G\t  +  2  Hvt  =  G  sin  6t  +  iT(l  +  cos  6t)  (62) 

with  G  >  0.  We  can  prove  global  stability  by  applying  Theo¬ 
rems  1  and  2  with  a  Lyapunov  function  of  the  form 

V{6)  =  (a  +  sin0)(l  +  cos#),  a  >  1.  (63) 

4This  is  only  the  case  for  systems  with  continuous  vector  fields  and  contin¬ 
uous,  pure  state  feedback.  The  obstruction  can  be  lifted  if  one  considers  feed¬ 
back  laws  that  are  discontinuous  or  that  have  explicit  time  dependence. 


Unfortunately,  it  is  not  obvious  from  the  analytic  form  of  LV 
how  a  must  be  chosen  to  satisfy  the  Lyapunov  condition.  It  is 
however  straightforward  to  plot  LV,  so  that  in  this  simple  case 
it  is  not  difficult  to  search  for  a  by  hand. 

A  typical  design  for  a  particular  choice  of  parameters  is 
shown  in  Fig.  3(b).  The  conditions  of  Theorems  1  and  2  are 
clearly  satisfied,  proving  that  the  system  is  globally  stable.  Note 
that  when  the  symmetry  is  broken  we  no  longer  need  to  fight 
the  attraction  of  the  undesired  fixed  point;  hence,  there  is  no 
lower  bound  on  G.  In  fact,  in  Fig.  3(b)  we  have  G  <  M. 

F.  Almost  Global  Control  on  B2 

Unfortunately,  the  simple  almost  global  control  design  on  S1 
does  not  generalize  to  B2 .  The  problem  is  illustrated  in  Fig.  3(c). 
The  controller  (54)  vanishes  at  6  =  0  and  it,  but  we  can  prove 
that  0  =  0  is  repelling.  On  B2,  however,  the  control  vanishes 
on  the  entire  line  A  =  0  which  becomes  an  invariant  set  of 
(52).  However,  then  it  follows  from  (48)  that  any  trajectory  with 
A0  =  0,uo  0  {0,1}  has  a  nonzero  probability  of  being  attracted 
to  either  fixed  point. 

Consider  a  neighborhood  Uh  of  the  point  (A,  v)  —  (0,1) 
that  we  wish  to  destabilize.  For  any  h  >  0,  however  small,  Uh 
contains  points  on  the  line  A  =  0  for  which  v  <  1,  and  we 
have  seen  that  trajectories  starting  at  such  points  have  a  nonzero 
probability  of  being  attracted  to  (0, 1).  However,  this  violates 
(51),  so  clearly  we  cannot  prove  Theorem  3  on  52. 

One  could  attempt  to  prove  that  all  points  except  those  with 
A  =  0  are  attracted  to  the  origin  with  unit  probability.  The  Lya¬ 
punov  theory  of  Section  IV-B  is  not  equipped  to  handle  such 
a  case,  however,  and  new  methods  must  be  developed  [50].  In¬ 
stead,  we  will  focus  on  the  global  control  problem. 

G.  Global  Control  on  B2  and  Semialgebraic  Geometry 

Once  again  we  consider  the  asymmetric  control  law 

B{t)  =  2GXt  +  2 Hvt,  G  >  0  (64) 

and  try  to  show  that  it  globally  stabilizes  the  system.  Before 
we  can  solve  this  problem,  however,  we  must  find  a  systematic 
method  for  proving  global  stability.  Searching  “by  hand”  for 
Lyapunov  functions  is  clearly  impractical  in  two  dimensions, 
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and  is  essentially  impossible  in  higher  dimensions  where  the 
state-space  cannot  be  visualized. 

In  fact,  even  if  we  are  given  a  Lyapunov  function  V,  testing 
whether  LV  <  0  on  K  is  highly  nontrivial.  The  problem  can 
be  reduced  to  the  following  question:  Is  the  set  {A  £  [Rn^_1  : 
LV  >  0,kp(h(A))  >  0  =  2  ...n}  empty?  Such  problems 

are  notoriously  difficult  to  solve  and  their  solution  is  known  to 
be  NP-hard  in  general  [51]. 

The  following  result,  due  to  Putinar  [52],  suggests  one  way 
to  proceed.  Let  S'  be  a  semialgebraic  set,  i.e.,  S  =  {x  £  [Rm  : 

>  0,7  =  1 . . .  n}  with  polynomial  Si.  Suppose  that  for 
some  i  the  set  {x  £  IRm  :  Si(x )  >  0}  is  compact.  Then,  any 
polynomial  p  that  is  strictly  positive  on  S  is  of  the  form 


1  0.5  0  0.5  1  X 


Fig.  4.  Contour  plot  of  LV  for  the  control  law  B(t)  —  4  A t  — is t,  with  M  —  2 
and  7j  —  (1/2) .  The  function  V  was  found  by  semidefinite  programming. 


p(x)  =  p0(x)  +  2>(*)*(*)  Pk(x)  =  Y,PkiVf  (65) 

i= 1  3 

where  pkj  are  polynomials,  i.e.,  p  is  an  affine  combination  of 
the  constraints  S{  and  sum- of- squares  polynomials  pk . 

Conversely,  it  is  easy  to  check  that  any  polynomial  of  the  form 
(65)  is  nonnegative  on  S.  We  may  thus  consider  the  following 
relaxation:  Instead  of  testing  nonnegativity  of  a  polynomial  on 
S,  we  may  test  whether  the  polynomial  can  be  represented  in  the 
form  (65).  Though  it  is  not  true  that  any  nonnegative  polynomial 
on  S  can  be  represented  in  this  form,  Putinar’ s  result  suggests 
that  the  relaxation  is  not  overly  restrictive.  The  principal  advan¬ 
tage  of  this  approach  is  that  the  relaxed  problem  can  be  solved 
in  polynomial  time  using  semidefinite  programming  techniques 
[53],  [54]. 

The  approach  is  easily  adapted  to  our  situation  as  K  is  a 
semialgebraic  set,  and  we  solve  the  relaxed  problem  of  testing 
whether  —  LV  can  be  expressed  in  the  form  (65).  In  fact,  the 
semidefinite  programming  approach  of  [53]  and  [54]  even 
allows  us  to  search  for  polynomial  V  such  that  (65)  is  satisfied; 
hence  we  can  search  numerically  for  a  global  stability  proof 
using  a  computer  program.  Such  searches  are  easily  imple¬ 
mented  using  the  Matlab  toolbox  SOSTOOLS  [55]. 

A  typical  design  for  a  particular  choice  of  parameters  is 
shown  in  Fig.  4.  After  fixing  the  parameters  M  =  2, 77  =  (1/2), 
and  the  control  law  B(t)  =  4At  —  vt,  an  SOSTOOLS  search 
found  the  Lyapunov  function 

V(A,  v)  =  21.8  v  -  5.73A2  +  lOAXv  -  5.63i/2  (66) 

where  —LV  is  of  the  form  (65).  Hence,  Theorems  1  and  2  are 
satisfied,  proving  that  the  system  is  globally  stable. 

A  couple  of  technical  points  should  be  made  at  this 
point.  Note  that  formally  the  filtering  equation  (38)  and  its 
parametrizations  do  not  satisfy  the  linear  growth  condition. 
However,  as  the  filter  evolves  on  a  compact  invariant  set  K, 
we  could  modify  the  equations  smoothly  outside  K  to  be  of 
linear  growth  without  affecting  the  dynamics  in  K.  Hence,  the 
results  of  Section  IV-B  can  still  be  used.  Moreover,  it  is  also  not 
strictly  necessary  that  V  be  nonnegative,  as  adding  a  constant 
to  V  does  not  affect  LV.  Hence,  it  is  sufficient  to  search  for 
polynomial  V  using  SOSTOOLS. 


H.  Global  Control  for  Higher  Spin 

The  approach  for  proving  global  stability  described  in  the  pre¬ 
vious  section  works  for  arbitrary  spin  j.  To  generalize  our  con¬ 
trol  scheme  we  need  to  convert  to  the  parametrization  of  Sec¬ 
tion  III-B,  as  we  did  for  spin  j  =  (1/2)  in  (52).  We  must  also 
propose  a  control  law  that  works  for  general  spin  systems. 

We  do  not  explicitly  convert  to  the  parametrized  form  or  gen¬ 
erate  the  constraints  kp,  as  this  procedure  is  easily  automated 
using  Matlab’ s  symbolic  toolbox.  Note  that  the  parameter  k  de¬ 
termines  which  eigenstate  is  mapped  to  the  origin.  This  is  con¬ 
venient  for  SOSTOOLS  searches,  as  polynomials  can  be  fixed 
to  vanish  at  the  origin  simply  by  removing  the  constant  term. 
We  always  wish  to  stabilize  the  origin  in  the  parametrized  coor¬ 
dinate  system. 

To  speed  up  computations  we  can  eliminate  all  the  parameters 
fiij  as  was  done  in  going  from  (43)  to  (52).  The  fact  that  the 
remaining  equations  are  decoupled  from  pj  j  is  easily  seen  from 
(38),  as  both  iJy  and  Jz  are  real  matrices.  Moreover,  it  is  easily 
verified  that,  by  convexity  of  K ,  the  orthogonal  projection  of 
any  p  £  K  onto  {[Rn^_1  :  pij  =  0  Vz  >  j}  lies  inside  K. 
Hence,  we  only  need  to  consider  the  reduced  control  problem 
with  fiij  =  0. 

In  [3],  we  numerically  studied  two  control  laws  for  general 
spin  systems.  The  first  law,  B\  (t)  =  7rt(  +  JZJ%  ~  2mdJx) 
(rrid  is  the  eigenstate  we  wish  to  stabilize),  reduces  to  our  al¬ 
most  global  control  law  when  j  =  (1/2).  However,  numerical 
simulations  suggest  that  for  j  >  (1/2)  this  control  law  gives 
a  finite  collapse  probability  onto  m  arid-  The  second  law, 
L?2(£)  =  7 Tt(Jz)  —  wid,  reduces  to  B2(t)  =  in  the  case 
j  =  (1/2),  which  is  not  locally  stable.  Our  experience  with 
j  =  (1/2)  suggests  that  a  control  law  of  the  form 

B(t)  =  GTTtiJxJz  +  JzJx-ZmdJ^  +  HiTT^J^-md)  (67) 

should  globally  stabilize  the  eigenstate  rrid  of  a  spin  j  system. 

We  have  verified  global  stability  for  a  typical  design  with  j  = 

I,  M  =  2,  r]  =  (1/2),  and  B(t)  =  2 irt(JxJz  +  JZJX}  +  nt{Jz) 
using  SOSTOOLS.  A  Lyapunov  function  was  indeed  found  that 
guarantees  global  stability  of  the  eigenstate  • 

Physically  the  case  j  >  (1/2)  is  much  more  interesting 
than  j  =  (1/2).  An  experiment  with  j  >  (1/2)  can  be  per¬ 
formed  with  multiple  atoms,  in  which  case  the  control  produces 
statistical  correlations  between  the  atoms.  Such  correlations, 
known  as  entanglement,  are  important  in  quantum  computing. 
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The  structure  of  the  control  problem  is,  however,  essentially 
the  same  for  any  j.  We  refer  to  [3]  and  [56]  for  details  on 
entanglement  generation  in  spin  systems. 

V.  Conclusion 

In  this  paper,  we  have  argued  that  quantum  mechanical  sys¬ 
tems  that  are  subjected  to  measurement  are  naturally  treated 
within  the  framework  of  (albeit  noncommutative)  stochastic  fil¬ 
tering  theory.  The  quantum  control  problem  is  then  reduced  to 
a  classical  stochastic  control  problem  for  the  filter.  We  have 
demonstrated  the  viability  of  this  approach  by  stabilizing  state 
reduction  in  simple  quantum  spin  systems  using  techniques  of 
stochastic  nonlinear  control  theory. 

Unfortunately,  the  stabilization  techniques  of  Section  IV  have 
many  drawbacks.  We  do  not  have  a  systematic  procedure  for 
finding  control  laws:  we  postulate  linear  controllers  and  search 
for  corresponding  Lyapunov  functions.  Even  when  the  control 
law  is  known,  verifying  global  stability  is  nontrivial  even  in  the 
simplest  case.  Our  numerical  approach,  though  very  successful 
in  the  examples  we  have  shown,  rapidly  becomes  intractable  as 
the  dimension  of  the  Hilbert  space  grows.  Finally,  our  methods 
do  not  allow  us  to  make  general  statements;  for  example,  though 
it  seems  plausible  that  the  control  law  (67)  is  globally  stabilizing 
for  any  j,  M,  rj,H  /  0,  and  G  >  0,  we  have  not  yet  suc¬ 
ceeded  in  proving  such  a  statement. 

Nonetheless,  we  believe  that  the  general  approach  outlined 
in  this  paper  provides  a  useful  framework  for  the  control  of 
quantum  systems.  It  is  important  in  this  context  to  develop 
methods  for  the  control  of  classical  stochastic  nonlinear  sys¬ 
tems  [57]— [60],  as  well  as  methods  that  exploit  the  specific 
structure  of  quantum  control  problems.  The  design  of  realistic 
control  systems  will  also  require  efficient  signal  processing 
algorithms  for  high-dimensional  quantum  filtering  and  methods 
for  robust  quantum  control  [61]. 
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Abstract 

On  the  occasion  of  the  hundredth  anniversary  of  Albert  Einstein’s  annus 
mirabilis ,  we  reflect  on  the  development  and  current  state  of  research  in  cavity 
quantum  electrodynamics  in  the  optical  domain.  Cavity  QED  is  a  field  which 
undeniably  traces  its  origins  to  Einstein’s  seminal  work  on  the  statistical  theory 
of  light  and  the  nature  of  its  quantized  interaction  with  matter.  In  this  paper, 
we  emphasize  the  development  of  techniques  for  the  confinement  of  atoms 
strongly  coupled  to  high-finesse  resonators  and  the  experiments  which  these 
techniques  enable. 

(Some  figures  in  this  article  are  in  colour  only  in  the  electronic  version) 


1.  From  Einstein  to  cavity  QED 

In  the  years  prior  to  his  seminal  1905  papers,  Albert  Einstein  had  given  much  thought  to 
the  statistical  properties  of  electromagnetic  fields  [1],  especially  with  regard  to  the  theory  of 
black-body  radiation  developed  by  Max  Planck  [2].  Einstein  realized  that  the  quantization 
of  light — particularly  the  creation  and  annihilation  of  ‘light  quanta’ — is  something  more 
fundamental  than  a  tacit  consequence  of  the  assumption  that  the  total  energy  of  a  black-body 
is  discretely  distributed  between  a  set  of  microstates.  Beginning  in  1905  with  On  a  heuristic 
point  of  view  about  the  creation  and  conversion  of  light  [3]  and  in  four  subsequent  papers  on 
quantization  [4-7],  he  laid  the  foundations  of  the  ‘old  quantum  theory’  [8],  summarized  in 
what  is  commonly  referred  to  as  the  ‘light  quantization  hypothesis’: 

. . .  the  energy  of  a  light  ray  emitted  from  a  point  [is]  not  continuously  distributed 
over  an  ever  increasing  space,  but  consists  of  a  finite  number  of  energy  quanta  which 
are  localized  at  points  in  space,  which  move  without  dividing,  and  which  can  only  be 
produced  and  absorbed  as  complete  units  [3]. 
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He  argued  that  the  existence  of  light  quanta  was  essential  to  Planck’s  hypothesis.  In  his 
treatment  of  the  problem,  he  developed  a  formalism  based  upon  the  inversion  of  Boltzmann’s 
law  to  describe  the  statistical  variances  in  the  energy  of  a  black  body.  Using  this  technique,  he 
was  able  to  separate  these  energy  fluctuations  into  the  sum  of  two  quantities,  one  representing 
the  fluctuation  of  a  number  of  particles  and  the  other  corresponding  to  the  variances  of  a 
classical  wave  [6,  9].  The  wave-particle  duality  to  which  Einstein  alluded  is,  of  course,  now 
one  of  the  tenets  of  modern  quantum  mechanics.  These  early  papers  also  began,  for  the  first 
time,  to  broach  phenomenological  problems  in  terms  of  quantized  light.  In  a  very  literal  sense, 
Einstein’s  descriptions  of  the  photoelectric  effect  and  the  ultraviolet  photoionization  of  gases 
[3]  constitute  the  first  applications  of  quantum  electrodynamics  as  well  as  the  first  microscopic 
(rather  than  statistical)  descriptions  of  quantized  electromagnetic  phenomena. 

While  the  same  could  be  said  of  vast  expanses  of  modern  physics,  much  of  the  intellectual 
underpinnings  of  cavity  quantum  electrodynamics  are  certainly  due  to  this  early  work  by 
Einstein.  Einstein  wrote 

The  wave  theory  of  light  which  operates  with  continuous  functions  in  space  has  been 
excellently  justified  for  the  representation  of  a  purely  optical  phenomena  and  it  is 
unlikely  ever  to  be  replaced  by  another  theory.  One  should,  however,  bear  in  mind 
that  optical  observations  refer  to  time  averages  and  not  to  instantaneous  values  and 
notwithstanding  the  complete  experimental  verification  of  the  theory  of  diffraction, 
reflection,  refraction,  dispersion  and  so  on,  it  is  quite  conceivable  that  a  theory  of 
light  involving  the  use  of  continuous  functions  in  space  will  lead  to  contradictions 
of  experience,  if  it  is  applied  to  the  phenomena  of  the  creation  and  conversion  of 
light  [3]. 

As  will  be  described  in  the  following  sections,  an  atom  strongly  coupled  to  the  mode 
of  a  resonant  cavity  is  precisely  one  such  situation.  In  fact  the  atom-cavity  system  is 
uniquely  well-suited  to  the  efficient  generation  of  single  photons,  a  phenomenon  wherein 
Einstein’s  ‘contradictions  of  experience’  certainly  dominate.  The  quantized  description  of 
light  is  manifestly  that  which  is  needed  to  describe  cavity  QED. 

In  what  follows,  we  introduce  cavity  quantum  electrodynamics  in  the  regime  of  strong 
coupling.  With  an  emphasis  on  the  current  state-of-the-art  experiments,  particularly  those 
carried  out  by  our  group  at  Caltech,  involving  atoms  trapped  within  a  optical  cavity,  we  also 
review  the  evolution  of  experiments  in  cavity  QED  and  discuss  future  research  directions. 

2.  Fundamentals  of  the  cavity  QED  system 

2.7.  Introduction 

Cavity  quantum  electrodynamics  explores  the  measurement  and  control  of  quantized 
electromagnetic  fields  and  atomic  systems  coherently  coupled  inside  an  electromagnetic 
resonator.  A  simple,  representative  model  of  the  cavity  QED  system,  as  illustrated  in  figure  1 , 
consists  of  exactly  one  two-state  atomic  system  at  rest  within  the  mode  of  a  resonator  formed 
by  two  spherical  mirrors.  The  dynamics  of  this  system  are  described  by  the  well-known 
Jaynes-Cummings  Hamiltonian  [10,  11]  which  is  composed  of  the  sum  of  separate  terms  for 
both  the  atom  and  field  as  well  as  a  third  term  which  arises  from  the  total  atom-field  dipole 
interaction: 

H  =  ^TicdaVz  +  hcoc  (cd  a  +  |)  +  hg(r)(a^d  +  ad (1) 

The  operators  (dz,d^,d)  are  the  Pauli  operators  corresponding  to  inversion,  raising  and 
lowering  of  the  atomic  state,  while  (a^,a)  are  the  creation  and  annihilation  operators  for 
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Figure  1.  Characteristic  parameters  of  atom,  cavity,  and  environment:  k,  the  rate  of  decay  of  the 
cavity  field;  y,  the  rate  at  which  the  atomic  dipole  radiates  into  modes  other  than  the  cavity  field 
mode;  T,  the  transit  time  of  an  atom  through  the  cavity  mode;  and  g,  the  rate  of  coherent  atom-field 
coupling.  Strong  coupling  requires  that  g/(y,  k,  j)  1. 


photons  in  the  coupled  mode  of  the  resonator.  ( to  a ,  &>c)  are  the  resonant  frequencies  of  the 
atom  and  cavity,  respectively. 

The  magnitude  of  the  coupling  is  a  function  of  the  atom’s  position,  r,  within  the  standing 
wave  structure  of  the  mode  and  is  given  by 

g(T)  =  ( 2  u(  r)  =  g0U(  r).  (2) 

\2h<E0VM  J 


Here,  [i  is  the  atomic  dipole  matrix  element,  VM  is  the  resonant  mode  volume,  a  geometric 
parameter  of  cavity,  and  U( r)  is  the  cavity  mode  function,  defined  such  that  Vm  = 
f  |  U(r)  |2  d3x.  The  coupling  coefficient  2 g0  is  known  as  the  single-photon  Rabi  frequency  and 
represents  the  maximum  rate  at  which  one  quantum  of  excitation  is  exchanged  between  atom 
and  field  [12]. 

This  idealized  model  captures  the  essential  dynamics  of  the  interaction  but  makes  no 
accounting  of  the  dissipative  processes  which  naturally  accompany  real  cavities  and  atoms. 
The  dominant  loss  associated  with  the  cavity  results  from  the  leakage  of  photons  through  the 
mirrors  at  a  rate  2/c,  where  /c,  the  frequency  half- width  of  the  resonant  mode,  is  specified  by 
the  quality  factor  of  the  cavity, 


Q  = 


Ct>c 

2k 


(3) 


A  second  dissipative  channel  is  the  result  of  spontaneous  emission  from  the  atom  into  field 
modes  other  than  that  which  is  preferentially  coupled  to  the  resonator.  In  general,  there  are 
two  distinct  atomic  decay  rates  along  the  transverse  and  longitudinal  directions  of  the  cavity 
(y± ,  Y\\ ) ,  both  of  which  are  functions  of  the  position  of  the  atom,  r  [  1 3] .  In  practice,  with  current 
state-of-the-art  optical  Fabry-Perot  cavities  this  position  dependence  is  largely  negligible.  In 
this  case,  the  angle  subtended  by  the  mode  of  the  cavity  is  small,  and  the  transverse  decay  rate 
is  very  closely  approximated  by  the  atomic  free  space  decay  rate,  y±  =  y  =  y\\/2  (this  is  not 
generally  true,  however,  for  cavities  of  different  geometries  [14-16]).  A  third  consideration 
is  the  length  of  time,  T,  during  which  the  atom  resides  within  the  cavity  mode,  after  which  no 
coherent  evolution  is  possible.  While  each  of  these  three  dissipative  channels  damp  the  desired 
quantum  evolution  of  the  system,  it  should  be  noted  that  they  do  not  necessarily  stand  on  equal 
footing.  In  particular,  the  rate  k  is  characterized  by  emission  of  the  cavity  field  into  one, 
well-defined  spatial  mode  which  facilitates  efficient  readout  of  the  state  of  the  coupled  system. 
The  other  two  rates,  ( y ,  T_1),  represent,  in  general,  irreversible  loss  of  information  to  the 
environment.  As  photons  are  coupled  into  a  continuum  of  experimentally  inaccessible  spatial 
modes,  and  as  atoms  leave  the  cavity  volume  (though  not  necessarily  so  in  the  microwave 
domain),  it  is  difficult  to  extract  useful  information  from  the  system. 
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The  description,  above,  is  generally  applicable  to  cavities  with  resonant  frequencies 
ranging  from  the  microwave  [1 1]  to  optical  regimes  and  which  may  be  coupled  to  appropriate 
two-level  systems  as  diverse  as  Cooper  pair  boxes  [17,  18]  and  Rydberg  atoms  [19,  20]. 
Whereas  high-finesse  Fabry-Perot  cavities  coupled  to  alkali  atoms  in  the  optical  regime  will 
be  the  focus  of  this  discussion,  broad  reviews  of  cavity  QED  in  this  and  other  regimes  are 
available  [12,  21]. 


2.2.  Strong  coupling 

As  we  have  seen,  cavity  QED  is  parametrized  by  four  rates  (go,  k,  y,  T~l).  In  order  to 
emphasize  the  coherent  evolution  of  the  system,  it  is  useful  to  require  that  the  coupling 
coefficient  dominate  dissipation: 

go/(Y,K,  T~l)  »  1.  (4) 


This  condition  is  commonly  known  as  the  strong  coupling  criterion  [12]. 

It  is  instructive  to  explore  strong  coupling  in  terms  of  two  dimensionless  parameters 
known  as  the  critical  photon  and  atom  numbers.  The  critical  (or  saturation)  photon  number 
describes  the  number  of  photons  such  that  for  a  cavity  of  a  given  geometry  the  intracavity 
optical  intensity  is  sufficient  to  saturate  the  atomic  response  (/sat): 


n0 


2go 


2  • 


(5) 


Similarly,  the  critical  atom  number  describes  the  number  of  strongly  coupled  atoms  necessary 
to  affect  appreciably  the  intracavity  field: 


No  = 


(6) 


Many  quantum  optical  systems — lasers  for  instance,  with  ^Jno  ~  103-104 — have  large  critical 
parameters  and  therefore  adding  or  removing  one  photon  or  atom  does  not  significantly  alter 
the  dynamics  as  a  whole.  In  these  systems,  the  coherent  coupling  parameter  go  is  scaled 
away  as  processes  approach  the  semi-classical  regime.  By  contrast,  the  necessary  (though  not 
sufficient)  criteria  for  strong  coupling  are  that  (no,  No)  <£  1.  This  means  that  in  the  regime  of 
strong  coupling,  single  quanta  dominate  the  dynamics  of  the  system  such  that  the  interaction 
between  atom  and  photon  can  be  manifestly  nonclassical  and  nonlinear  for  single  atoms  and 
photons.  Strong  coupling  thereby  provides  a  powerful  tool  for  the  study  of  quantum  optics  as 
well  as  the  interaction  of  the  quantized  electromagnetic  field  with  matter. 


2.3.  The  Jay nes-Cummings  ladder 

In  the  strong  coupling  regime,  coherent  interaction  dominates  dissipation  and  so  the  Jaynes- 
Cummings  Hamiltonian  (1)  provides  a  foundation  for  the  description  of  the  evolution  of 
the  system.  Allowing  the  cavity  mode  to  be  resonant  with  the  atomic  transition  frequency 
(coc  =  co a)  and  diagonalizing  (1)  yields  a  set  of  eigenstates  for  the  system, 

l±>n  =  Tjdg.ra}  ±  \e,n  -  1)),  (7) 

where  (g,  e)  denote  the  ground  and  excited  states  of  the  atom  and  there  are  n  quanta  of 
excitation  in  the  system.  These  are  the  dressed  states  of  the  Jaynes-Cummings  model  and 
represent  the  equal  distribution  of  excitation  between  atom  and  field  with  corresponding  energy 
eigenvalues  E±  =  nhco  d=  +fnhg(r). 
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Experimentally,  as  an  atom  enters  the  mode  of  the  cavity  the  transmission  and  absorption 
spectra  in  the  weak-field  limit  will  no  longer  exhibit  a  single,  empty-cavity  resonance  at 
co  =  coc  but  instead  a  two-peaked  structure  with  maxima  at  co  =  coc  ±  +Jng(r),  corresponding 
to  the  energy  eigenvalues  of  |=b)i  for  one  quantum  of  excitation.  This  characteristic  spectral 
feature  is  known  as  the  vacuum-Rabi  splitting  and  serves  as  a  hallmark  of  strong  coupling. 

2.4.  Cavities  in  the  laboratory 

While  defining  the  criteria  for  strong  coupling  is  a  relatively  simple  matter  (go  »  (y,  /r,  T-1)), 
realizing  cavities  which  in  practice  meet  these  criteria  is  a  decidedly  more  complicated  task. 
Over  the  past  25  years,  experiments  in  optical  cavity  QED  have  pushed  progressively  farther 
into  the  regime  of  strong  coupling  such  that  typical  values  for  state-of-the-art  optical  cavities 
today  are  no  ~  10_3-10-4  photons  and  No  ~  10-2-10-3  atoms.  These  cavities  are  of  Fabry- 
Perot  geometry  and  consist  of  two  superpolished  spherical  mirror  substrates  which  have  been 
coated  with  a  highly  reflective  stack  of  dielectric  layers  ( R  =  0.999  9984  is  a  representative 
value  for  the  reflectivity  of  one  such  state-of-the-art  mirror)  [22].  The  reflectivity  of  these 
mirrors  is  sufficiently  large  that  the  inter-mirror  spacing  (and  therefore  the  mode  volume,  Vm) 
can  be  made  relatively  small  without  k  growing  larger  than  go  oc  VM^2.  Current  experiments 
underway  in  our  group  at  Caltech  involve  atomic  caesium  coupled  to  a  cavity  of  length 
L  =  42.2  jjl m  and  with  mirrors  of  radius  of  curvature  R  =  20  cm  such  that 

(go,  Kf  y)  =  (34,  4.1,  2.5)  MHz,  (8) 

well  into  the  regime  of  strong  coupling.  The  finesse  of  this  cavity  at  the  D2  line  in  atomic 
caesium  (X  =  852.4  nm,  made  resonant  with  the  TEMoo  mode  of  the  cavity)  is  T  —  4.2  x  105, 
and  the  critical  parameters  are  no  —  0.0029  and  No  =  0.018.  In  future  experiments,  it  may  be 
possible  to  achieve  even  higher  finesse  by  coupling  to  the  whispering  gallery  modes  of  quartz 
microspheres  [14-16]  or  microtoroidal  resonators  [23,  24]  or  to  photonic  bandgap  resonators 
[25,26]. 


3.  Experiments  in  the  regime  of  strong  coupling 

In  the  following  sections,  we  briefly  review  the  experimental  evolution  of  optical  cavity  QED 
in  the  regime  of  strong  coupling,  particularly  with  regard  to  those  techniques  developed 
by  our  group  at  Caltech.  The  goal  is  to  set  the  stage  for  a  description  of  the  new  set  of 
tools  made  available  by  the  recent  marriage  of  atom-cavity  systems  and  laser  cooling  and 
trapping.  A  common  theme  throughout  the  discussion  will  be  the  quest  to  localize  and  isolate 
a  single  atom  which  is  strongly  coupled  to  the  cavity  mode,  as  required  by,  for  instance, 
a  variety  of  schemes  for  implementing  quantum  computation  and  communication  protocols 
[27-30,25]. 

3.1.  Early  work 

The  first  observations  of  atoms  strongly  coupled  to  optical  resonators  came  in  the  early 
1990s  with  experiments  involving  atomic  beam  transits  through  the  mode  of  a  high-finesse 
cavity  [31,  32].  The  average  duration  of  each  single  atom  transit  was  T  =  0.4  /xs  and 
the  flux  of  the  beam  was  adjustable  such  that  the  average  intracavity  atom  number  N  ~  1 . 
By  measuring  the  transmission  of  a  weak  probe  of  variable  detuning  about  the  caesium 
(6S1/2,  F  =  4  ->  6P3/2,  F'  =  5')  transition,  the  vacuum-Rabi  splitting  for  one  atom 
(on  average)  was  observed  for  the  first  time,  albeit  weighted  over  a  range  of  values  for 
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Figure  2.  Early  one-atom  vacuum-Rabi  splitting  measurements,  (a)  The  vacuum-Rabi  splitting 
for  N  =  1  as  measured  with  an  atomic  beam  by  Thompson  et  al  [32].  Curve  (i)  represents 
the  theoretical  transmission  averaged  over  the  relevant  range  of  values  for  g(r)  and  N,  while  (ii) 
describes  precisely  one  atom  with  maximal  coupling  go.  (b)  Vacuum-Rabi  spectrum  measured 
using  individual  atomic  transits  from  a  MOT,  as  described  by  Hood  et  al  [37].  The  solid  line  is 
the  predicted  spectrum  based  upon  evolution  of  the  master  equation  for  maximally  coupled  atoms 
while  the  dashed  line  represents  couplings  g(r)  <  go. 


g( r)  and  an  ensemble  of  atoms  (figure  2(a))  [32,  33].  Similar  optical  normal-mode  spectra 
have  been  obtained  by  the  group  of  Feld  using  a  beam  of  barium  atoms  [34],  while  direct 
observation  of  atom-field  Rabi  nutation  in  the  microwave  domain  was  made  by  the  group  of 
Haroche  using  a  beam  of  rubidium  atoms  in  Rydberg  excited  states  [35]. 

The  brief  duration  of  each  transit  and  uncertainty  in  the  instantaneous  rate  of  atom-field 
coupling  presented  a  significant  limitation  on  the  amount  of  information  made  available  per 
atom  in  these  atomic  beam  experiments  [32-35].  Subsequent  experiments  instead  made  use 
of  a  cloud  of  cold  atoms  cooled  to  sub-Doppler  temperatures  in  a  magneto-optic  trap  (MOT) 
located  a  few  millimetres  above  the  cavity.  When  the  MOT  is  released,  some  fraction  of  the 
cold  atoms  fall  between  the  mirrors,  and  it  is  possible  to  observe  in  real  time  their  individual 
trajectories  and  the  durations  of  their  transits  [36].  Using  this  technique,  the  vacuum-Rabi 
spectrum  (figure  2(b))  was  extracted  from  the  transmission  of  a  weak  probe  interacting  with 
many  individual  transits  (each  of  duration  T  ~  100  fis)  on  an  atom-by-atom  basis  [37]. 
Likewise,  this  technique  has  also  enabled  measurements  of  the  nonlinear  optical  response  of 
individual  atoms  to  drive  fields  corresponding  to  n  1  intracavity  photon  [37].  It  should 
be  noted  that  while  each  individual  atom  contributes  significantly  more  information  to  the 
spectrum  in  this  experiment  than  in  those  with  atomic  beams,  the  data  in  figure  2(b)  is  still  the 
result  of  an  ensemble  average  over  many  atoms. 


3.2.  The  atom-cavity  microscope 

By  way  of  the  marriage  of  laser  cooled  atomic  sources  and  cavity  QED,  as  discussed  above,  has 
come  an  exciting  new  regime  in  which  the  kinetic  energy  K  of  atoms  in  transit  through  the  cavity 
is  comparable  to  the  energy  hg( r)  associated  with  the  atom-field  coupling.  In  this  domain,  the 
presence  of  just  one  photon  is  sufficient  to  alter  profoundly  the  atom’s  centre-of-mass  motion. 
Indeed  if  hgo  >  K  and  the  cavity  is  slightly  detuned  from  the  bare  atomic  resonance,  then  the 
dressed  state  | — )  i ,  as  discussed  above,  is  confined  by  an  attractive  pseudopotential  due  to  the 
intracavity  field  [38-40].  The  pseudopotential  wells  are  of  depth  determined  as  a  function  of 
the  probe  intensity  and  the  atom-cavity  detuning,  and  have  facilitated  atomic  trapping  times 
T  ~  0.5  ms. 
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Figure  3.  Diagram  of  the  atom-cavity  microscope,  (a)  Cold  atoms  are  delivered  to  the  cavity 
mode  by  releasing  the  contents  of  a  magneto-optic  trap  (MOT)  a  few  millimetres  above  the  cavity. 
The  transmission  of  weak  probe  beam  is  recorded  using  balanced  heterodyne  detection  [38]. 
(b)  A  reconstruction  of  the  trajectory  of  a  single  atom  bound  to  the  quantized  intracavity  field 
with  n  =  1  photon.  Animations  of  transits  reconstructed  using  this  technique  are  available  at 
http://www.its.caltech.edu/~qoptics/atomorbits/. 


This  technique  for  intracavity  confinement  enabled  the  so-called  atom-cavity  microscope, 
a  protocol  for  reconstructing  the  trajectories  of  single  atoms  based  upon  the  variations  in 
transmission  of  a  weak  probe  beam  (which  also  generates  the  trapping  pseudopotential). 
Experimentally,  as  a  caesium  atom  enters  the  mode  of  the  cavity  its  presence  is  detected  in  real 
time.  The  resulting  signal  is  used  to  switch  rapidly  the  intensity  of  the  probe,  which  is  detuned 
to  the  red  vacuum-Rabi  sideband  at  ( go  a  —  %o)>  in  order  to  form  the  trap,  pushing  atoms 
towards  larger  values  of  g( r).  The  transmitted  probe  field  is  then  monitored  and  recorded  via 
heterodyne  detection.  For  each  individual  atomic  transmission  profile,  which  is  indicative  of 
the  instantaneous  rate  of  coherent  coupling  to  field,  it  is  then  possible  to  extract  information 
about  the  orbit  in  which  the  confined  atoms  move  (such  as  that  shown  in  figure  3(b)).  This 
work  represents  a  first  step  towards  addressing  the  motion  of  an  atom  strongly  coupled  to 
a  cavity,  but  is  limited  in  the  sense  that  the  QED  interaction  and  the  trapping  potential  are 
intertwined,  precluding  the  ability  to  address  and  control  the  state  of  the  atom  while  confined 
by  the  trap. 


33.  State -insensitive  cooling  and  trapping 


While  atomic  confinement  using  the  quantized  cavity  QED  field  offers  an  important  advance 
towards  the  realization  of  well-localized,  trapped  atoms,  it  remains  preferable  to  decouple 
the  trap  from  the  QED  interaction.  Towards  this  goal,  a  number  of  groups  [41-44]  have 
successfully  implemented  optical  dipole  force  traps  (also  known  as  far  off-resonant  traps  or 
FORTs)  consisting  of  a  far-detuned  optical  beam  able  to  induce  a  dissipative,  attractive  force 
on  an  atom  and  yet  only  weakly  drive  atomic  transitions. 

The  principle  of  operation  for  a  FORT,  as  applied  to  a  simple  two-level  atomic  model,  is 
rather  straightforward  [45].  The  FORT  beam,  detuned  by  8  from  the  resonant  frequency  of 
the  transition,  coa ,  induces  an  ac-Stark  shift, 


A  Ege  =  d= 


3ttc2y  /( r) 


co  A 


(9) 


of  equal  magnitude  but  opposite  sign  for  each  of  the  excited  and  ground  states.  For  a  red- 
detuned  field  (S  <  0),  an  atom  in  its  ground  state  with  kinetic  energy  K  <  (/dip  =  /SEg(r) 
is  confined  in  a  potential  which  varies  spatially  as  the  intensity  of  the  FORT  beam  (likewise, 
an  atom  in  its  excited  state  will,  rather  undesirably,  see  a  repulsive  potential).  This  process 
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is  readily  extrapolated  to  more  complicated,  multilevel  systems  such  as  the  alkali  atoms 
commonly  used  in  cavity  QED,  though  the  overall  functional  form  of  (9)  remains  similar  [46] . 

The  earliest  implementations  of  intracavity  FORT  fields  for  cavity  QED  used  lasers  at 
frequencies  an  integer  number  of  cavity  free  spectral  ranges  away  from  the  probe  field  [44] . 
In  so  doing,  a  cavity  which  is  length- stabilized  to  be  resonant  with  the  TEMoo  mode  of 
the  probe  will  simultaneously  support  the  TEM0o  mode  of  the  FORT  as  well.  The  resultant 
intracavity  standing  wave  provides  a  series  of  potential  wells  into  which  sufficiently  cold  atoms 
can  be  loaded.  The  first  experiment  of  this  sort  involved  trapping  caesium  in  a  FORT  with 
^fort  =  869  nm,  characterized  by  a  trap  lifetime  T  —  28  ms,  which  was  evidently  limited  by 
parametric  heating  and  FM-to-AM  noise  conversion  due  to  the  relatively  high  finesse  of  the 
cavity  at  the  FORT  wavelength. 

These  early  difficulties  were  remedied  by  choosing  a  value  A^fort  for  which  the  differential 
ac-Stark  shift  between  excited  and  ground  states  was  very  nearly  zero  and  at  which  the  cavity 
mirrors  were  less  reflective.  Certain  atomic  lines,  including  the  caesium  D2  transition,  have  the 
property  that  for  a  narrow  range  of  wavelengths  the  ac-Stark  shifts  of  the  excited  and  ground 
state  manifolds  are  in  the  same  (trapping)  direction  [47^19].  By  considering  couplings  to  the 
full  manifold  of  caesium  excited  states  it  is  possible  to  recognize  a  nearly  state-insensitive 
trapping  potential  (i.e.,  with  only  small  differential  ac-Stark  shifts)  at  Afort  =  935.6  nm, 
colloquially  referred  to  as  the  ‘magic’  wavelength  of  caesium  [50,  41].  Indeed,  it  has  been 
shown  that  an  intracavity  FORT  at  this  wavelength  with  a  depth  corresponding  to  Uq  ~  3  mK 
allows  for  trap  lifetimes  T  ~  3  s  ‘in  the  dark’  (i.e.,  in  the  absence  of  QED  fields),  likely  limited 
by  the  rate  of  collision  with  background  gas,  and  T  ~  Is  even  in  the  presence  of  a  probe 
field  [41].  This  technique  constitutes  a  milestone  in  optical  cavity  QED,  opening  the  door  to 
a  myriad  of  experiments  involving  one-and- the- same  atom.  These  exciting  new  experiments 
are  to  be  discussed  in  the  next  section. 

The  use  of  optical  dipole  traps  for  localization  of  atoms  within  the  mode  of  a  high-finesse 
cavity  has  developed  rapidly  in  the  past  two  years.  In  addition  to  the  work  described  above,  the 
group  of  Rempe  at  the  Max  Planck  Institute  in  Garching  has  also  implemented  the  co-resonant 
FORT  technique  at  A.Fort  =  785  nm  for  trapping  rubidium  [43].  The  group  of  Chapman  at 
Georgia  Tech  has  developed  a  novel  scheme  whereby  an  optical  dipole  lattice  transverse  to 
the  cavity  axis  acts  as  a  ‘conveyor  belt’  [51],  transporting  rubidium  atoms  into  and  out  of  a 
region  of  interaction  with  a  weak  probe  field  which  is  used  to  monitor  the  state  of  the  system 
in  real-time  [42].  Beyond  the  use  of  optical  traps,  the  groups  of  Walther  and  Blatt  have  made 
tremendous  progress  towards  coupling  single,  trapped  atomic  ions  to  high-finesse  cavities 
[52,  53].  While  these  experiments  have  yet  to  enter  the  regime  of  strong  coupling,  practically 
limitless  trap  lifetimes  and  the  powerful  tools  for  coherent  control  which  have  already  been 
developed  within  the  ion  trap  community  [54-57]  hold  great  promise  for  future  integration 
with  optical  cavities. 


4.  Experiments  with  trapped  atoms 

The  development  of  a  state-insensitive  trap  for  strongly  coupled  atoms  has  not  only  extended 
the  duration  of  individual  transits  far  beyond  the  characteristic  time  scales  for  coherent  atom- 
field  coupling  (i.e.,  T  ~  3  s  whereas  gQ1  ~  30  ns),  but  also  into  a  domain  wherein  complex 
experimental  protocols  can  be  performed  using  one-and-the-same  atom.  The  cavity  QED 
group  at  Caltech,  working  with  atomic  caesium,  has  begun  exploring  this  domain  with  schemes 
for  the  generation  of  single  photons,  with  measurements  of  the  full  vacuum-Rabi  spectrum  for 
just  one  atom,  and  with  a  new  technique  for  in  situ  control  of  the  system. 
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Figure  4.  The  one-atom  laser,  (a)  Schematic  diagram  of  the  one-atom  laser.  Fields  ^3,4  are  applied 
to  a  single  caesium  atom  confined  by  an  intracavity  FORT,  producing  excitation  in  the  TEMoo 
mode  of  the  cavity.  Photons  which  escape  mirror  M2  are  detected  by  single  photon-counting 
avalanche  photodiodes  D\  2 .  (b)  Energy  level  diagram  for  the  one-atom  laser,  as  discussed  in  the 
text. 


4.1.  The  one-atom  laser 

An  optically  driven  single  atom  bound  to  a  resonant  cavity  is,  in  some  sense,  a  laser  extrapolated 
to  its  fundamental,  conceptual  limit  (theoretical  descriptions  of  such  devices  have  existed  for  a 
number  of  years  [58-67]).  In  this  analogy,  the  single  atom  serves  as  a  gain  medium  which,  as 
it  ‘lases’,  couples  photons  into  the  resonant  mode  of  the  cavity.  However,  the  emission  from 
such  a  device  (like  the  device,  itself )  is  qualitatively  very  unlike  a  conventional  (semi-classical) 
laser,  exhibiting  a  variety  of  manifestly  quantum  properties. 

In  a  recent  experiment  to  explore  these  phenomena  [68,  69],  a  field  Q3  is  applied  from  the 
side  of  the  cavity  (i.e.,  transverse  to  the  cavity  axis)  which  pumps  the  caesium  atom  from  the 
(6S1/2,  F  =  3)  ground  state  to  the  (6P3/2,  F'  =  3')  excited  state  (see  figure  4).  The  atomic 
population  is  rapidly  transferred  to  (6S1/2,  F  =  4)  via  strong  coupling  to  the  cavity  (which  is 
tuned  to  be  resonant  with  the  (6P3/2,  F'  =  3'  6S1/2,  F  =  4)  transition),  leaving  a  single 

excitation  in  the  ‘laser’  mode.  A  second  field  Q4,  continuously  applied  to  the  atom,  pumps  to 
(6P3/2,  F'  =  4')  from  which  spontaneous  emission  returns  the  system  to  its  (initial)  ground 
state.  As  the  resulting  photons  exit  the  cavity  at  a  rate  2k,  their  arrival  times  are  recorded 
using  a  pair  of  single  photon-counting  avalanche  photodiodes.  Experimentally,  we  observe 
‘thresholdless  lasing’  and  a  maximum  intracavity  photon  number  which  is  rate-limited  due  to 
the  (irreversible)  recycling  of  the  one  atom  through  its  energy  levels  by  £23>4.  Moreover,  the 
one-atom  laser  exhibits  photon  antibunching  and  sub-Poissonian  photon  statistics,  evidence 
that  it  is  a  manifestly  quantum  light  source.  For  comparison  with  our  experimental  results,  we 
have  extensively  analysed  the  theory  of  ‘lasing’  in  the  strong-coupling  regime,  as  discussed 
in  [69]. 


4.2.  Deterministic  generation  of  single  photons 

While  the  one-atom  laser  produces  a  nonclassical  stream  of  photons,  it  is  also  possible  to 
add  an  extra  layer  of  control  to  the  system  in  order  to  reliably  and  deterministically  produce 
one  photon  on  demand.  We  have  shown  that  by  iteratively  pulsing  the  ^3  4  beams,  a  single 
photon  can  be  generated  in  the  cavity  mode  with  near  unit  efficiency  [70].  Operationally, 
the  Q3  (t)  pulse,  in  concert  with  an  atom  strongly  coupled  to  a  cavity  on  resonance  with 
(6P3/2,  F'  =  3'  — >►  6P1/2,  F  =  4),  drives  coherent,  adiabatic  transfer  of  atomic  population 
between  hyperfine  ground  states  [71].  Simultaneously,  this  process  introduces  a  single  photon 
into  the  intracavity  field,  which  subsequently  escapes  with  a  spatial  profile  fixed  by  the  TEMoo 
mode  of  the  cavity  and  with  a  temporal  profile  determined  by  that  of  the  ^(0  pulse.  A  short 
pulse  of  Q4  light  incoherently  repumps  the  atom  to  the  F  =  3  manifold  and  resets  the  system 
to  generate  subsequent  photons. 
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(a) 


(b) 


Figure  5.  Single  photon  generation  on  demand,  (a)  The  temporal  profile  of  single  photons 
generated  from  one  caesium  atom  strongly  coupled  to  a  cavity  [70].  The  full- width  at  half¬ 
maximum  is  tfwhm  =  120  ns  and  the  long  tail  is  the  result  of  inhomogeneous  preparation  of  initial 
Zeeman  states,  (b)  The  correlation  function  C(r )  as  a  function  of  time  delay  x  for  counts  from  two 
single  photon-counting  photodiodes.  The  significant  suppression  of  the  peak  at  r  =  0  indicates 
that  spurious  two-photon  events  are  rare. 


In  the  regime  of  strong  coupling,  the  rate  g( r)  is  sufficiently  large  that  single  photons  are 
generated  within  the  cavity  with  inferred  efficiency  0g  =  1.15  =b  0.18,  where  the  uncertainty 
is  determined  from  measurements  of  systematic  losses  and  rare  occurrences  of  two-photon 
events.  Unpolarized  photons  escape  the  cavity  with  efficiency  (69  zb  10)%  and  are  detected  by 
two  photoelectric  detectors  at  the  ports  of  a  50/50  beam  splitter,  leading  to  an  overall  detection 
efficiency  of  ~2.4%  (intracavity  photon  to  photoelectric  event).  The  temporal  full-width  at 
half-maximum  of  the  photons  is  Tfhwm  =  120  ns  for  an  (t)  pulse  of  uniform  intensity 
over  a  duration  of  1  /is  (see  figure  5(a)).  The  mean  trap  lifetime  for  a  single  atom  during 
this  process  was  140  ms,  limited  by  heating  from  ^3  4,  allowing  1.4  x  104  single  photons  to 
be  generated  from  each  individual  atom.  The  small  probability  of  a  two-photon  event  taking 
place  is  clearly  evident  from  the  suppression  of  a  peak  at  r  =  0  in  the  correlation  function 
between  the  two  detectors,  figure  5(b).  As  a  metric  for  how  closely  this  system  approximates 
an  ideal  single-photon  source  we  consider  the  quantity  R ,  which  quantifies  the  probability 
of  two-photon  events  relative  to  a  coherent  state  ( R  ~  1  for  a  weak  coherent  state  and 
approaches  infinity  for  an  ideal  single-photon  source).  Averaged  over  all  generation  attempts, 
we  measure  R  =  15.9  =b  1.0  which  approaches  R  &  150  when  considering  only  attempts 
occurring  late  in  the  trapping  interval.  We  thereby  demonstrate  the  predominantly  single¬ 
photon  character  of  our  source  and  hypothesize  that  rare  two-photon  events  are  principally 
the  result  of  ‘contamination’  from  rare  occurrences  in  which  two  atoms  are  simultaneously 
loaded  into  the  trap,  both  generating  photons  [70]. 

The  non-negligible  likelihood  of  two  or  more  atoms  being  loaded  into  the  FORT  is  an 
unavoidable  consequence  of  our  Poissonian  technique  for  introducing  cold  atoms  to  the  cavity. 
As  in  earlier  work,  a  cloud  of  ~105  sub-Doppler  cooled  caesium  atoms  is  released  from  a 
MOT  and  allowed  to  fall  freely  through  the  narrow  space  between  the  two  cavity  mirrors 
which,  due  to  its  prohibitive  geometry,  admits  only  a  few  atoms  per  attempt.  We  can  control 
the  mean  number  by  adjusting  the  efficiency  of  the  cooling  beams  which  prepare  the  MOT, 
but  not  without  detriment  to  the  frequency  with  which  we  load  single  atoms.  As  a  solution 
to  this  problem,  the  Caltech  group  has  developed  a  technique  whereby  we  can  observe  in  real 
time  discrete,  ‘step-like’  increases  in  the  transmission  of  probe  beam  as,  one-by-one,  multiple 
atoms  are  heated  out  of  the  FORT  [72].  In  future  work,  this  protocol  will  provide  an  efficient 
means  for  ensuring  that  precisely  N  =  1  atom  is  always  loaded,  or  that  a  certain  number 
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N  >  1  is  always  loaded  as  required  by  some  schemes  for  processing  quantum  information 
[27,73]. 


43.  Intracavity  Raman  transitions  and  sideband  cooling 


Beyond  these  advances  enabled  by  the  state-insensitive  FORT  [41,  70,  72],  there  exist  a 
number  of  experimental  parameters  which  remain  inaccessible  without  more  precise  control 
over  the  atom  and  its  motion.  Among  these  are  the  location  of  the  atom  r  relative  to  the  QED 
field  (and,  implicitly,  the  range  of  coupling  rates  g( r)  that  atom  experiences  as  it  moves  within 
the  mode  of  the  cavity)  and  the  characteristics  of  the  ambient  magnetic  field  at  r.  In  order  to 
resolve  these  concerns  and  gain  some  a  priori  knowledge  about  atoms  trapped  in  our  FORT, 
we  have  developed  a  novel  technique  for  driving  stimulated  Raman  transitions  between  the 
hyperfine  ground  states  of  an  atom  in  the  FORT. 

The  presence  of  the  cavity  places  certain  geometric  constraints  on  the  beams  which 
drive  these  transitions,  so  we  have  chosen  to  use  the  intracavity  FORT  field  not  only  as 
a  trap  but  also  as  one  arm  of  the  characteristic  Raman  A  configuration.  The  second 
arm  is  an  auxiliary  ‘Raman’  laser,  phase-locked  and  orthogonally  polarized  relative  to  the 
FORT  and  offset  by  a  frequency  ARaman  =  AHf  +  8  where  AHf  =  9.192  632  GHz,  the 
caesium  6S\/2(F  =  3,  F  =  4)  splitting.  8  is  a  variable  frequency  detuning.  Both  lasers 
drive  the  same  cavity  mode,  with  the  Raman  beam  off  cavity  resonance  by  ARaman,  and 
generate  intracavity  fields  with  individual  Rabi  frequencies  (£2 fort,  ^Raman)-  In  the  limit  that 
A'  =  (va  ~  Vfort)  {£2 f ,  ,  ARaman,  8,  y},  (where  vA,  Vfort  are  the  frequencies  of  the 

caesium  D2  line  and  the  FORT  laser,  respectively)  the  net  effective  Rabi  frequency  is  given  by 


£2  fort  £2  r  aman 
2A' 


(10) 


As  the  detuning  8  is  varied,  it  is  possible  to  map  out  a  spectrum  of  resonances  between  the 
various  Zeeman  and  motional  substates  of  the  two  hyperfine  ground  manifolds.  After  an  atom 
is  cooled  into  the  FORT,  a  small  magnetic  field  is  applied  along  the  cavity  axis  to  break  any 
degeneracy  of  Zeeman  states  so  the  atoms  may  be  optically  pumped  into  the  (F  =  3,  mF  =  0) 
ground  state.  The  Raman  laser  at  a  fixed  detuning  8  is  then  pulsed  ON  for  a  duration  rRaman 
in  order  to  drive  a  0Raman  =  TRaman^£  rotation  in  the  (F  =  3,  4)  basis.  Next,  a  probe  pulse 
resonant  with  the  (F  =  4  — >  Ff  =  5')  transition  detects  whether  the  atom  is  in  the  F  =  4 
ground  state  (using  the  type  of  cavity  QED  interaction  discussed  in  the  first  section;  large 
(small)  transmission  of  the  probe  means  the  atom  is  decoupled  from  (coupled  to)  the  cavity, 
namely  in  the  F  =  3  (F  =  4)  ground  state).  Many  iterations  of  this  procedure  yield  the 
probability  P(8)  to  drive  a  Raman  transition  at  that  specific  detuning.  As  8  is  varied,  the  result 
is  a  spectrum  such  as  that  in  figure  6(a),  showing  a  clearly  resolved  peak  corresponding  to 
the  carrier  (F  =  3 ,mF  =  0)  (F  =  4 ,mF  =  0)  transition  (with  linewidth  QF)  as  well 
as  sidebands  corresponding  to  An  =  ±2  vibrational  transitions,  which  will  be  discussed  in 
detail,  below.  By  scanning  over  a  broader  range,  it  is  possible  to  map  out  the  full  spectrum  of 
Am F  =  0  transitions  (or  Am F  =  ±1,  in  a  transverse  bias  field)  from  which  information  about 
the  magnitude  and  direction  of  the  local  magnetic  field  can  be  extracted.  Similarly,  when  the 
Raman  pulse  duration  rRaman  at  8  ~  0  MHz  is  scanned  in  steps  much  smaller  than  it  is 
possible  to  map  out  the  so-called  ‘Rabi  flopping,’  the  coherent  transfer  of  atomic  population 
between  ground  states  (figure  6(b)). 

Within  the  context  of  atomic  systems  bound  to  harmonic  potentials,  Raman  transitions 
between  quantized  vibrational  levels  have  become  a  standard  tool.  Raman  sideband  cooling 
is  an  essential  component  of  experiments  involving  trapped  ions  [54]  and  has  also  been 
demonstrated  for  alkali  atoms  in  free  space  [74,  75].  In  the  Fock  basis  we  denote  the 
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Raman  Detuning  (MHz)  Raman  Pulse  Duration  ([is) 

Figure  6.  A  Raman  spectrum,  (a)  The  ‘carrier’  transition  (mp  =  0  mp  =0)  and  An  =  2 
motional  sidebands  measured  by  driving  stimulated  Raman  transitions  between  the  hyperhne 
ground  states  of  a  single  caesium  atom  confined  in  an  intracavity  FORT.  Imposed  over  the  trace 
are  the  results  of  cooling  measurements  undertaken  on  each  of  the  resonances.  The  extended  trap 
lifetime  observed  by  driving  the  red  transition  constitutes  empirical  evidence  of  Raman  sideband 
cooling,  (b)  ‘Rabi  flopping’  on  the  Amp  =  0  ‘carrier’  transition  as  discussed  in  the  text. 


vibrational  state  of  an  atom  in  our  FORT  as  n.  By  tuning  our  Raman  beam  (blue,  red)  of 
the  carrier  transition  by  twice  the  harmonic  frequency  of  our  FORT,  it  is  possible  to  excite  the 
atom  from  n  to  (n  +  2,  n  —  2),  the  two  sidebands  observed  in  figure  6(a).  Note  that  because  the 
fields  associated  with  the  Raman  and  FORT  beams  are  symmetric  with  respect  to  the  boundary 
conditions  set  by  the  cavity,  only  even  numbered  An  transitions  are  allowed.  The  positions  of 
the  sidebands  in  the  figure  correspond  reasonably  well  to  the  expectation  ±2v0  —  ±1.1  MHz, 
where  Vo  is  the  vibrational  frequency  for  harmonic  motion  at  the  antinode  of  the  FORT,  which 
is  independently  determined  from  the  FORT  and  caesium  parameters.  However,  we  do  not  yet 
understand  the  observed  lineshapes  nor  the  loss  of  contrast  evidenced  in  figure  6,  which  may 
be  due  to  the  considerable  anharmonicity  of  the  FORT  for  weakly  bound  atoms.  Preliminary 
attempts  at  cooling  centre-of-mass  motion  by  driving  Raman  vibrational  sidebands  empirically 
suggest  that  we  can  extend  the  lifetime  of  trapped  atoms  within  our  cavity,  as  indicated  in 
the  bar  plots  superimposed  over  the  Raman  scan.  In  a  series  of  trials,  atoms  loaded  into  the 
FORT  were  driven  on  either  of  the  blue,  red  or  carrier  transitions  while  being  probed  at  regular 
intervals.  For  each,  the  corresponding  trapping  durations  T,  represented  by  the  relative  sizes 
of  the  bars,  and  probabilities  P  that  an  atom  loaded  at  the  beginning  of  a  1  s  long  trial  would 
survive  until  the  end  of  the  trial  were  recorded.  On  average,  atoms  driven  on  the  red  (cooling) 
sideband  remained  in  the  cavity  substantially  longer  than  those  driven  on  the  other  motional 
transitions. 

4.4.  The  vacuum-Rabi  spectrum  of  one -and-the- same  atom 

Assisted  by  the  capability  to  perform  axial  cooling  using  Raman  transitions,  our  group  was 
recently  able  to  undertake  measurements  of  the  vacuum-Rabi  spectrum  for  one-and-the-same 
atom.  This  is  in  contrast  to  the  measurements  discussed  above  [32,  37]  and  other  recent 
observations  made  with  atoms  trapped  in  a  FORT  [76],  all  of  which  require  averaging  over 
a  large  ensemble  of  atoms  (e.g.,  103-104  atoms  in  [76]).  After  a  single  atom  is  loaded  into 
the  FORT,  a  probe  beam  of  frequency  cop,  varied  over  a  range  near  the  atom-cavity  resonance 
(ol>a  =  (Oci),  is  mode-matched  into  the  cavity,  and  the  transmission  of  the  probe  T(cop)  is 
recorded.  After  each  probe  interval,  a  cycle  of  Raman  sideband  cooling  is  interspersed  before 
the  probe  frequency  is  iterated  forwards  and  the  process  repeated.  By  choosing  only  those 
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Figure  7.  Vacuum-Rabi  splitting  of  one-and-the-same  atom.  The  complete  vacuum-Rabi 
transmission  spectra,  T  ( cop )  for  six  atoms,  selected  at  random  from  a  pool  of  28  such  spectra. 
The  error  bars  represent  statistical  uncertainties.  The  blue  trace  is  the  solution  of  the  master 
equation  for  the  system  [77]. 


trials  for  which  the  empty  cavity  resonance  at  cop  =  cocl  is  suppressed  (i.e.,  those  with  strong 
resonant  absorption  indicating  that  an  atom  is  coupled  to  the  cavity),  we  are  able  to  obtain 
the  vacuum-Rabi  spectrum  for  precisely  one  atom  (six  of  which,  for  six  separate  atoms,  were 
randomly  selected  and  are  shown  in  figure  7)  [77]. 

For  comparison,  also  included  in  figure  7  is  the  solution  to  the  steady-state  master  equation 
for  this  system,  incorporating  only  known  experimental  parameters  and  averaged  over  the  top 
1/3  of  FORT  wells  for  which  g(r)  is  closest  to  its  maximum  value,  go  (he.,  g(r)  >  0.87go)- 
The  results  are  in  quite  good  agreement  with  the  data,  and  the  characteristic  two  peaked 
vacuum-Rabi  structure  is  clearly  present  for  each  atom.  The  asymmetric  features  of  the 
spectrum  (i.e.,  peak  heights,  centroid  locations)  are  principally  the  result  of  the  small  Zeeman 
state-dependent  ac-Stark  shifts  induced  by  the  FORT  in  conjunction  with  optical  pumping 
effects  due  to  the  probe.  In  summation,  these  spectra  contain  detailed  quantitative  information 
about  g  (r) ,  indicating  that  atoms  trapped  and  cooled  within  the  FORT  exist  in  a  narrow  range 
of  near  maximal  values.  This  result  is  emblematic  of  the  type  of  measurement  which  we 
expect  the  Raman  technique  to  enable  in  the  future. 


5.  Conclusion 

We  have  discussed  the  evolution  of  experiments  in  optical  cavity  quantum  electrodynamics, 
emphasizing  those  recent  experiments  enabled  by  intracavity  state-insensitive  optical  dipole 
trapping.  This  work  includes  a  demonstration  of  a  one-atom  laser,  deterministic  generation 
of  single  photons,  the  capacity  for  driving  stimulated  Raman  transitions  between  hyperfine 
ground  states  of  a  trapped  atom,  and  the  observation  of  the  vacuum-Rabi  spectrum,  the 
hallmark  of  strong  coupling,  for  one-and-the-same  atom. 


S564 


R  Miller  et  al 


In  recent  years,  many  advances  in  cavity  QED  have  been  driven  by  an  interplay  with 
quantum  information  science  [78].  In  the  near  future,  the  tools  discussed  above  will  begin 
to  play  an  important  role  as  it  becomes  possible  to  perform  complex  quantum  information 
protocols  [27-30].  In  particular  it  may  even  be  possible,  via  Raman  sideband  cooling,  to  enter 
a  regime  requiring  a  quantized  treatment  for  all  degrees  of  freedom  in  QED,  namely  the  state 
of  the  atom,  its  centre-of-mass  motion,  and  the  field  to  which  it  is  coupled.  This  hearkens 
back  to  Einstein  and  his  introduction  of  the  quantum  hypothesis,  which,  one  hundred  years 
ago,  set  the  stage  for  the  remarkable  experimental  advances  of  today. 
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Photon  blockade  in  an  optical  cavity  with  one 
trapped  atom 
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At  low  temperatures,  sufficiently  small  metallic1  and  semicon¬ 
ductor2  devices  exhibit  the  ‘Coulomb  blockade’  effect,  in  which 
charge  transport  through  the  device  occurs  on  an  electron-by¬ 
electron  basis3.  For  example,  a  single  electron  on  a  metallic  island 
can  block  the  flow  of  another  electron  if  the  charging  energy  of  the 
island  greatly  exceeds  the  thermal  energy.  The  analogous  effect  of 
‘photon  blockade’  has  been  proposed  for  the  transport  of  light 
through  an  optical  system;  this  involves  photon-photon  inter¬ 
actions  in  a  nonlinear  optical  cavity4-13.  Here  we  report  obser¬ 
vations  of  photon  blockade  for  the  light  transmitted  by  an  optical 
cavity  containing  one  trapped  atom,  in  the  regime  of  strong  atom- 
cavity  coupling14.  Excitation  of  the  atom-cavity  system  by  a  first 
photon  blocks  the  transmission  of  a  second  photon,  thereby 
converting  an  incident  poissonian  stream  of  photons  into  a  sub- 
poissonian,  anti-bunched  stream.  This  is  confirmed  by  measure¬ 
ments  of  the  photon  statistics  of  the  transmitted  field.  Our 
observations  of  photon  blockade  represent  an  advance  over 
traditional  nonlinear  optics  and  laser  physics,  into  a  regime 
with  dynamical  processes  involving  atoms  and  photons  taken 
one-by-one. 

An  analogy  between  electron  transport  in  mesoscopic  electronic 
devices  and  photon  transport  through  strongly  coupled  optical 
systems  was  originally  suggested  in  ref.  5.  These  authors  proposed 
that  an  effect  similar  to  Coulomb  blockade  for  electrons1-3  might  be 
possible  for  photons  by  using  photon-photon  interactions  in  a 
nonlinear  optical  cavity5.  In  this  scheme,  strong  dispersive  inter¬ 
actions  enabled  by  electromagnetically  induced  transparency  (EIT) 
cause  the  presence  of  a  ‘first’  photon  within  the  cavity  to  block  the 
transmission  of  a  ‘second’  photon,  leading  to  an  ordered  flow  of 
photons  in  the  transmitted  field. 

After  resolution  of  an  initial  difficulty6,  subsequent  work  has 
confirmed  that  such  photon  blockade  is  indeed  feasible  for  a  single 
intracavity  atom  by  way  of  a  multi-state  EIT  scheme7-9.  Photon 
blockade  is  possible  in  other  settings,  including  in  concert  with 
Coulomb  blockade10  and  via  tunnelling  with  localized  surface  plas- 
mons11.  Photon  blockade  has  also  been  predicted  for  a  two-state 
atom  coupled  to  a  cavity  mode4,9,12,13.  As  illustrated  in  Fig.  la,  the 
underlying  mechanism  is  the  anharmonicity  of  the  Jaynes- 
Cummings  ladder  of  eigenstates4,15.  Resonant  absorption  of  a  photon 
of  frequency  co_  to  reach  the  state  |l,  — )  (where  |n,+(— ))  denotes 
the  higher-  (lower-)  energy  eigenstate  with  n  excitations)  ‘blocks’  the 
absorption  of  a  second  photon  at  co  _  because  transitions  to  1 2,  ±)  are 
detuned  from  resonance. 

Whereas  electrons  interact  directly  via  Coulomb  repulsion,  photon- 
photon  interactions  must  be  mediated  by  matter.  Furthermore, 
verification  of  this  effect  requires  measurements  of  the  quantum 
statistics  of  the  field;  in  contrast,  Coulomb  blockade  can  be  inferred 
directly  from  mean  transport.  Scattering  from  a  single  atom  in  free 
space,  for  example,  provides  a  fundamental  example  of  photon 
blockade16,  albeit  with  the  fluorescent  field  distributed  over  4ir  and 


the  flux  limited  by  the  rate  of  spontaneous  decay  7.  In  contrast, 
cavity-mediated  schemes  offer  the  possibility  of  photon  emission 
into  a  collimated  spatial  mode  with  high  efficiency  and  at  a  rate  set  by 
the  cavity  decay  rate  k ,  which  can  be  much  larger  than  7.  Achieving 
photon  blockade  for  a  single  atom  in  a  cavity  requires  us  to  operate  in 
the  regime  of  strong  coupling,  for  which  the  frequency  scale  g0 
associated  with  reversible  evolution  of  the  atom-cavity  system 
exceeds  the  dissipative  rates  (7,  k)  (ref.  14). 

Here  we  report  observations  of  photon  blockade  in  the  light 
transmitted  by  an  optical  cavity  containing  one  atom  strongly 
coupled  to  the  cavity  field.  For  coherent  excitation  at  the  cavity 


Figure  1 1  The  atomic  level  structure  used  for  implementation  of  the  photon 
blockade  effect,  and  a  simple  diagram  of  the  experiment,  a,  Atomic  level 
diagram  showing  the  lowest-energy  states  for  a  two-state  atom  of  transition 
frequency  oja  coupled  (with  single-photon  Rabi  frequency  g0)  to  a  mode  of 
the  electromagnetic  field  of  frequency  0)  c,  with  o>a  =  o>c  =  <^o  (ref.  15).  Two- 
photon  absorption  is  suppressed  for  a  probe  field  o)p  (arrows)  tuned  to  excite 
the  transition  |0)  — ^  1 1 ,  — ),  cop  =  co0  —  go ,  leading  to  g(2)(0)  <  1  (ref.  13). 
b,  Eigenvalue  structure  for  the  (F  =  4,  nip)  <-►  (F'  —  5; ,  mF)  transition 
coupled  to  two  degenerate  cavity  modes  Zr>z,  as  discussed  in  the 
Supplementary  Information.  Two-photon  absorption  is  likewise  blocked  for 
excitation  tuned  to  the  lowest  eigenstate  (arrows),  c,  Simple  diagram  of  the 
experiment.  BS,  beam  splitter. 
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input,  the  photon  statistics  for  the  cavity  output  are  investigated  by 
measurement  of  the  intensity  correlation  function  g^2\r)y  which 
demonstrates  the  manifestly  nonclassical  character  of  the  transmitted 
field.  Explicitly,  we  find  g(2)(0)  =  (0.13  ±  0.11)  <  1  with  g(2)(0)  < 
g(2)(r),  so  that  the  output  light  is  both  subpoissonian  and  anti¬ 
bunched17.  We  find  that  g(2)(r)  rises  to  unity  at  a  time  r  —  45  ns, 
which  is  consistent  with  the  lifetime  t-  =  2/(y  +  k)  =  48  ns  for  the 
state  1 1 ,  — )  associated  with  the  blockade.  Over  longer  timescales, 
cavity  transmission  exhibits  modulation  arising  from  the  oscillatory 
motion  of  the  atom  trapped  within  the  cavity  mode.  We  use 
this  modulation  to  make  an  estimate  of  the  energy  distribution  for 
the  atomic  centre-of-mass  motion  and  infer  a  maximum  energy 
E/1cb  ~  250  (ji Ky  where  kB  is  the  Boltzmann  constant 

The  schematic  of  our  experiment  in  Fig.  lc  illustrates  the  Fabry- 
Perot  cavity  formed  by  mirrors  (M1?  M2)  into  which  single  optically 
cooled  caesium  atoms  are  loaded.  Atoms  are  trapped  within  the 
cavity  by  a  far-off-resonance  trap  (FORT),  which  is  created  by 
exciting  a  TEM00  cavity  mode  at  XF  =  935.6 nm  (ref.  18).  To  achieve 
strong  coupling,  we  use  the  6Si/2,  F  =  4— ►  6P3/2,  F'  =  b'  transition 
of  the  D2  line  in  caesium  at  Aa  =  852.4 nm  (subscript  A  refers  to 
‘atom’),  for  which  the  maximum  rate  of  coherent  coupling  is 
go/2^  =  34 MHz  for  (F  =  4 ,mF  =  ±4)— ►(F/  =  5 ' ,m'F  =  ±5).  The 
transverse  decay  rate  for  the  6P3/2  atomic  states  is  y/2ir  =  2.6 MHz, 
while  the  cavity  field  decays  at  rate  k/ 2tt  =4.1  MHz.  The  parameters 
of  the  cavity  are  further  discussed  in  the  Methods. 

A  variety  of  factors  make  our  atom-cavity  system  more  complex 
than  the  simple  situation  described  by  the  Jaynes-Cummings  eigen¬ 
states,  including  most  significantly  that  (1)  the  cavity  supports 
two  modes  ly?z  with  orthogonal  linear  polarizations  (j>,  z)  near  Aa  = 
852.4 nm  as  described  in  the  Methods  section,  and  (2)  a  multiplicity 
of  Zeeman  states  are  individually  coupled  to  these  modes  for 
transitions  between  the  manifolds  (F  =  4,  mF)  <-►  ( F '  =  57,  m'F).  An 
indication  of  the  potential  for  this  system  to  achieve  photon  blockade 
is  provided  in  Fig.  lb,  which  displays  the  actual  eigenvalue  structure 
for  the  first  two  excited  manifolds  obtained  by  direct  diagonalization 
of  the  interaction  hamiltonian,  as  discussed  in  the  Supplementary 
Information.  As  for  the  basic  two-state  system,  excitation  to  the 
lowest-energy  state  in  the  one- excitation  manifold  ‘blocks’  sub¬ 
sequent  excitation  because  the  transitions  to  the  two -excitation 
manifold  are  out  of  resonance. 

To  substantiate  this  picture  quantitatively,  we  present  in  Fig.  2 
theoretical  results  from  the  steady-state  solution  to  the  master 
equation  in  various  situations,  all  for  the  case  of  coincident  atomic 
and  cavity  resonances  coa  —  coq  —  <^o-  (Subscripts  Q  and  C2  refer  to 
the  cavity  resonances  near  Aa  and  AR  respectively).  Beginning  with 
the  ideal  setting  of  a  two -state  atom  coupled  to  a  single  cavity  mode, 
we  display  in  Fig.  2a  results  for  the  probe  transmission  spectrum 
T(cOp)  and  the  intensity  correlation  function  g(2)(0)  of  the  field  st 
transmitted  by  mirror  M2  for  excitation  by  a  coherent-state  probe  sp 
of  variable  frequency  cop  incident  upon  the  cavity  mirror  Mx.  Clearly 
evident  in  T( cop)  are  two  peaks  at  cop  =  co±  =  co0  —  go  associated  with 
the  vacuum-Rabi  splitting  for  the  states  |l,  ±).  At  these  peaks,  sv  is 
detuned  for  excitation  |l,±)— »|2,±),  resulting  in  g^2)(0)  <  1  for  et. 
The  poissonian  photon  statistics  of  the  incident  probe  are  thereby 
converted  to  subpoissonian  statistics  for  the  transmitted  field  by  way 
of  the  photon  blockade  effect  illustrated  in  Fig.  la.  For  strong 
coupling  in  the  weak-field  limit,  g(2)(0)  (/c  +  y)2/^  f°r  <^p  =  u± 

(ref.  12),  hence  the  premium  on  achieving  g0  (k,  y).  By  contrast, 
for  cop  =  coq  ±  go/  is  resonant  with  the  two-photon  transition 
|0)  *  |2,  ±),  resulting  in  superpoissonian  statistics  withg(2)(0)  1. 

For  cop  =  co0 ,  there  is  extremely  large  bunching  due  to  quantum 
interference  between  ep  and  the  atomic  polarization12,19. 

In  Fig.  2b  we  examine  the  more  complex  situation  relevant  to  our 
actual  experiment,  namely  a  multi-state  atom  coupled  to  two  cavity 
modes  with  orthogonal  polarizations  j>,  z.  Most  directly  related  to  the 
simple  case  of  Fig.  2a  is  to  excite  one  polarization  eigenmode  with  the 
incident  probe,  taken  here  to  be  sz ,  and  to  detect  the  transmitted  field 
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s\  for  this  same  polarization,  with  the  transmission  spectrum  and 
intensity  correlation  function  denoted  by  Tzz(cop),  gz2)(0),  respect¬ 
ively.  Even  for  the  full  multiplicity  of  states  for  the  F  =  4— >F'  =  5 7 
transition  coupled  to  the  two  cavity  modes  ^>z,  Tzz(cop)  displays  a 
rather  simple  structure,  now  with  a  multiplet  structure  in  place  of  the 
single  vacuum-Rabi  peak  around  cop  —  co0  ±go-  For  a  probe  fre¬ 
quency  tuned  to  the  eigenvalues  cop  =  coo  ±  go,  g^( 0)  —  0.7,  once 
again  dropping  below  unity  as  in  Fig.  2a. 

An  alternative  scheme  is  to  detect  along  z,  but  excite  along 
orthogonal  polarization  j>,  with  the  respective  transmission  and 
correlation  functions  Tyz{ cop),  g^( 0)  also  shown  in  Fig.  2b.  Similar 
to  Tzz(cop),  Tyz( cop)  exhibits  a  multiplet  structure  in  the  vicinity 
of  cop  —  coo  —  go  owing  to  the  nature  of  the  first  excited  states 
of  the  atom-cavity  system.  At  the  extremal  cop  =  co0  ±go,  gy2)(0) 
reaches  a  value  g^y/( 0)  —  0.03  much  smaller  than  for  either  g(2q0)  in 
Fig.  2a,  or  gfj( 0)  in  Fig.  2b,  for  the  same  values  of  (g0>  y)-  Our 

preliminary  hypothesis  is  that  this  reduction  relates  to  the  absence  of 
the  superposed  driving  field  s ?  with  the  transmitted  field  szt  of 
orthogonal  polarization  z  (ref.  20);  photons  in  the  mode  lz  derive 
from  emissions  associated  with  the  atomic  components  of  atom-field 
eigenstates. 

Tuning  the  probe  to  cop  =  co0  ±  go  has  the  additional  benefit  of 
reducing  sensitivity  to  atomic  position,  which  varies  experimentally 
owing  to  atomic  motion  and  the  multiplicity  of  trapping  sites  within 
the  cavity21.  The  atomic  position  affects  the  transmission  via 
the  position  dependence  of  the  coupling  g  —  g0\Hr),  where  \[/  is  the 
TEMqo  spatial  mode  at  Aq  with  maximum  |\^|  =  1,  and  r  is  the 
position  of  the  atom.  Tyz( cop)  is  small  when  |wp  —  co0 1  ^  g,  so  atoms 
which  have  a  lower-than-expected  value  of  g  will  have  a  reduced 
contribution  to  the  photon  statistics. 

An  important  step  in  the  implementation  of  this  strategy  is  our 
recent  measurement  of  the  vacuum-Rabi  spectrum  Tzz(cop)  for  one 
trapped  atom21.  In  that  work  we  obtained  quantitative  agreement  on 
an  atom-by-atom  basis  between  our  observations  and  an  extension 
of  the  theoretical  model  used  to  generate  the  various  plots  in  Fig.  2b. 
The  extended  model  incorporates  a.c.-Stark  shifts  from  the  FORT  as 
well  as  cavity  birefringence.  This  model  predicts  that  corrections  to 


Figure  2  |  Theoretical  results  for  the  transmission  spectra  and  intensity 
correlation  functions,  a,  T(cop),  g(2)(0);  b,  Tzz(up),  g(f(0)  (dashed)  and 
Tyz(<jjp),  g^2)(0)  (red)  from  the  steady-state  solution  to  the  master  equation. 
Included  are  all  transitions  (F  =  4,  nip)  =5',  my)  with  their 
respective  coupling  coefficients  g(0mF,mF  ,  as  well  as  the  two  cavity  modes  lyz 
here  assumed  to  be  degenerate  in  frequency  (see  Supplementary 
Information  for  further  discussion).  The  blue  dotted  lines  indicate 
poissonian  statistics.  Parameters  are  (g0>  7)/2tt  =  (33.9,  4.1,  2.6)  MHz, 

and  the  probe  strength  is  such  that  the  intracavity  photon  number  on 
resonance  without  an  atom  is  0.05. 
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g^( 0)  due  to  these  effects  are  small  for  our  parameters,  as  discussed 
in  the  Supplementary  Information. 

With  these  capabilities,  we  now  report  measurements  of  g^(r)  for 
the  light  transmitted  by  a  cavity  containing  a  single  trapped  atom.  We 
tune  the  probe  s £  to  (cop  —  co0)/2tt  =  —  34  MHz,  near  —go,  and 
acquire  photoelectric  counting  statistics  of  the  field  s\  by  way  of 
two  avalanche  photodiodes  (Dx,  D2),  as  illustrated  in  Fig.  lc.  From 
the  record  of  these  counts,  we  are  able  to  determine  gfj(r)  by  using 
the  procedures  discussed  in  ref.  22.  Data  are  acquired  for  each 
trapped  atom  by  cycling  through  probing,  testing,  and  cooling 
intervals  (of  durations  Afprobe  =  500  ps,  Aftest  =  100ps  and  Atcoo\  = 
1.4  ms,  respectively)  using  a  procedure  similar  to  that  of  ref.  21.  The 
test  beam  is  polarized  along  z  and  resonant  with  the  cavity.  A 
repumping  beam  transverse  to  the  cavity  axis  and  resonant  with 
6S!/2,  F  =  3— >-6P3/2,  F'  =  A'  also  illuminates  the  atom  during  the 
probe  and  test  intervals.  This  beam  prevents  accumulation  of 
population  in  the  F  =  3  ground  state  caused  by  the  probe  off- 
resonantly  exciting  the  F  =  4  — ►  F'  —  A'  transition.  All  probing/ cool¬ 
ing  cycles  end  after  an  interval  Afto t  =  0.3  s,  at  which  point  a  new 
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Figure  3  |  Experimental  measurements  of  the  intensity  correlation  function 
g$(T)  for  incident  excitation  with  polarization  along  y  and  detection  with 
orthogonal  polarization  z.  a,  g®(r)  over  the  interval  |r|  <  1.0  ps 
demonstrates  that  the  transmitted  field  exhibits  both  subpoissonian 
photon  statistics  g^(0)  =  (0.13  ±  0.11)  <  1  and  photon  antibunching 
g(yj(0)  <  g(yj(r)  (ref.  17).  b ,g®(r)  over  longer  intervals  |t|  <  10  ps  displays  a 
pronounced  modulation  due  to  axial  motion  of  the  trapped  atom,  c,  The 
Fourier  transform  g(f  )  of  g^2)(r)  with  the  independently  determined 
minimum  and  maximum  frequencies  pmin  and  v0  for  axial  motion  in  a 
FORT  well  indicated  by  the  dotted  lines.  g^(r)  is  plotted  with  6-ns 
resolution  in  a  and  with  12-ns  resolution  in  b. 


loading  cycle  is  initiated.  We  select  for  the  presence  of  an  atom  by 
requiring  that  Tzz(cop  —  coq)  ^  0.35  for  the  test  beam.  We  use  only 
those  data  records  associated  with  probing  intervals  after  which  the 
presence  of  an  atom  was  detected  and  for  which  the  presence  of  an 
atom  was  detected  in  all  preceding  intervals.  If  there  is  no  atom  and 
the  probe  is  tuned  to  be  resonant  with  the  cavity  (co  p  =  co  Ci) ,  then  the 
photon  number  in  mode  ly  due  to  s y  is  0.21  and  the  polarizing  beam 
splitter  at  the  output  of  the  cavity  (PBS  in  Fig.  lc)  suppresses 
detection  of  this  light  by  a  factor  of  —94. 

Figure  3  presents  an  example  of  gfj(r)  determined  from  the 
recorded  time-resolved  coincidences  at  (Dl5  D2).  In  Fig.  3a,  the 
manifestly  nonclassical  character  of  the  transmitted  field  is  clearly 
observed  with  a  large  reduction  in  g^f( 0)  below  unity,  gj®( 0)  = 
(0. 13  ±  0.11)  <  1,  corresponding  to  the  subpoissonian  character  of 
the  transmitted  field,  and  with  gyf(0)  <  gyf(r)  as  a  manifestation  of 
photon  antibunching.  We  find  that  g(2)(r)  rises  to  unity  at  a  time 
r  —  45  ns,  which  is  consistent  with  a  simple  estimate  of  r~  = 
2/(7  +  k)  =  48  ns  based  upon  the  lifetime  for  the  state  1 1,  — ). 

Although  for  small  |r|  our  observations  of  g^(r)  are  in  reasonable 
agreement  with  the  predictions  from  our  theoretical  model,  there  are 
significant  deviations  on  longer  timescales.  Modulation  that  is  not 
present  in  the  model  is  evident  in  Fig.  3b,  which  arises  from  the 
centre-of-mass  motion  of  the  trapped  atom.  In  support  of  this 
assertion,  Fig.  3c  displays  the  Fourier  transform  g(f)  of  g^(r), 
which  exhibits  a  narrow  peak  at  frequency  /0  —  535  kHz  just  below 
the  independently  determined  frequency  vq  —  544  kHz  for  harmonic 
motion  of  a  trapped  atom  about  an  antinode  of  the  FORT  in  the  axial 
direction  x.  This  modulation  is  analogous  to  that  observed  in  ref.  23 
for  g(2)(r)  for  the  light  from  a  single  ion,  which  arose  from  micro¬ 
motion  of  the  ion  in  the  radio -frequency  trap. 

Here,  U(r)  =  U0  sin2(2Ttx/Ac2)exp(  — 2p2/w^)  is  the  FORT 
potential,  which  gives  rise  to  an  anharmonic  ladder  of  vibrational 
states  with  energies  {Em}.  Here  m  —  0  to  ramax  =  99  correspond  to 
the  bound  states  in  the  axial  dimension  for  radial  coordinate  p  = 
yj y2  +z2  =  0.  The  anharmonicity  leads  to  the  observed  offset 
fo  <  Vq  due  to  the  distribution  of  energies  for  axial  motion  in  the 
FORT  well.  Indeed,  the  frequency  vm[n  =  (£mmax  —  Emmax-i)/h  at  the 
top  of  the  well  is  approximately  half  that  at  the  bottom  of  the  well, 
Vo  =  (Ei  —  Eo)/h.  By  comparing  the  measured  distribution  of  fre¬ 
quencies  exhibited  by  g(f)  with  the  calculated  axial  frequencies  {vm}, 
we  estimate  that  those  atoms  from  which  data  was  obtained  are 
trapped  in  the  lowest-lying  axial  states  m  ^  10,  which  corresponds  to 
a  maximum  energy  E/kn  —  250  pK.  This  energy  estimate  is  consist¬ 
ent  with  other  measurements  of  g$(r)  that  we  have  made,  as  well  as 
the  Fourier  transform  of  the  record  of  the  transmitted  intensity  and 
the  transmission  spectra  of  ref.  2 1 . 

We  have  demonstrated  photon  blockade  for  the  transmission  of  an 
optical  cavity  strongly  coupled  to  a  single  trapped  atom4-9,12,13.  The 
observed  nonclassical  photon  statistics  for  the  transmitted  field  result 
from  strong  nonlinear  interactions  at  the  single-photon  level,  in 
analogy  with  the  phenomena  of  Coulomb  blockade  for  electron 
transport1-3.  Extensions  of  our  work  include  operation  in  a  pulsed 
mode,  as  was  analysed  in  ref.  5,  thereby  realizing  a  source  for  single 
photons  ‘on  demand’22.  As  we  improve  the  effectiveness  of  our 
cooling  procedure,  we  should  be  able  to  explore  the  dependence  of 
gfz(T) on  probe  detuning,  cop  —  co0 ,  as  well  as  to  move  to  higher  levels 
of  excitation  to  increase  the  intracavity  photon  number  towards 
unity  and  the  output  flux  towards  the  maximum  value  ^  k  for 
subpoissonian  photons. 

METHODS 

Cavity  and  detection  parameters.  The  physical  length  of  the  cavity  used  in  this 
work  is  42.2  pm  and  the  finesse  is  4.3  X  105.  The  cavity  length  is  independently 
stabilized  such  that  a  TEM00  longitudinal  mode  at  Aq  is  resonant  with  the  tree- 
space  atomic  transition  at  Aa  and  another  TEM00  mode  at  Ac2  is  resonant  at  A^ 
At  the  cavity  centre  x  =  0,  the  mode  waists  Wq2  =  {23.4,24.5}  pm  at  Aq2  = 
{852.4,935.6}  nm. 
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The  TEMqo  longitudinal  mode  for  the  FORT  is  driven  by  a  linearly  polarized 
input  field  sport,  resulting  in  nearly  equal  a.c.  Stark  shifts  for  Zeeman  states  in 
the  6Sij2 ,  F  =  3,4  manifold.  At  an  antinode  of  the  field,  the  peak  value  of  the 
trapping  potential  for  these  states  is  Uo/h=  —43  MHz  for  all  our  measure¬ 
ments.  Zeeman  states  of  the  6 P3/2,  F' =5'  manifold  experience  a  similar 
trapping  potential,  but  with  a  weak  dependence  on  m'F  (ref.  18). 

Stress- induced  birefringence  in  the  cavity  mirrors  leads  to  a  mode  splitting 
Acoc1/2tt  =  4.4  ±  0.2  MHz  of  the  two  cavity  modes  lyz  with  orthogonal  linear 
polarizations  (j>,z).  sport  is  linearly  polarized  and  aligned  along  z,  the  higher- 
frequency  mode. 

The  efficiency  for  photon  escape  from  the  cavity,  limited  by  losses  inherent  to 
the  mirror  substrates,  is  ae2  =  0.6  ±  0.1.  The  propagation  efficiency  from  M2  to 
detectors  (Dl5  D2)  is  cep  =  0.41  ±  0.03,  with  each  detector  then  receiving  half  of 
the  photons.  The  avalanche  photodiodes  (Dl5  D2)  have  quantum  efficiencies 
ceD  =  0.49  ±  0.05. 

Photon  statistics.  The  transmission  spectrum  T(cop)  is  proportional  to  the 
ratio  of  photon  flux  (s|et)  transmitted  by  M2  to  the  flux  |gp  |2  incident  upon  Ml5 
and  normalized  such  that  a  cavity  without  an  atom  has  a  resonant  transmission 
of  unity,  i.e.  T(cop  =  ojq)  =  1-  For  a  field  with  intensity  operator  I(t),g^2\r)  = 
( :  I(t)I(t  +  r) : )/( :  I(t ) : )(  :  I(t  +  t)  : ),  where  the  colons  denote  time  and  normal 
ordering  (ref.  17).  g®(r),  displayed  in  Fig.  3a  and  shown  with  a  6-ns  resolution, 
has  been  corrected  for  background  counts  due  to  detector  dark  counts  and 
scattered  light  from  the  repumping  beam.  Without  this  correction,  g®( 0)  — 
(0.18  ±  0.10)  is  directly  derived  from  the  recorded  counts. 
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We  investigate  the  suitability  of  toroidal  microcavities  for  strong-coupling  cavity  quantum  electrodynamics 
(QED).  Numerical  modeling  of  the  optical  modes  demonstrate  a  significant  reduction  of  the  modal  volume 
with  respect  to  the  whispering  gallery  modes  of  dielectric  spheres,  while  retaining  the  high-quality  factors 
representative  of  spherical  cavities.  The  extra  degree  of  freedom  of  toroid  microcavities  can  be  used  to  achieve 
improved  cavity  QED  characteristics.  Numerical  results  for  atom-cavity  coupling  strength  g,  critical  atom 
number  N0,  and  critical  photon  number  n0  for  cesium  are  calculated  and  shown  to  exceed  values  currently 
possible  using  Fabry-Perot  cavities.  Modeling  predicts  coupling  rates  g/277  exceeding  700  MHz  and  critical 
atom  numbers  approaching  10“7  in  optimized  structures.  Furthermore,  preliminary  experimental  measurements 
of  toroidal  cavities  at  a  wavelength  of  852  nm  indicate  that  quality  factors  in  excess  of  108  can  be  obtained  in 
a  50- /un  principal  diameter  cavity,  which  would  result  in  strong-coupling  values  of  (gl(27r)  ,n0,N0) 
=  (86  MHz, 4. 6 X  10-4, 1.0X  10-3). 
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I.  INTRODUCTION 

The  use  of  an  optical  microcavity  can  greatly  enhance  the 
interaction  of  an  atom  with  the  electromagnetic  field  such 
that  even  a  single  atom  or  photon  can  significantly  change 
the  dynamical  evolution  of  the  atom-cavity  system  [1]. 
Achieving  the  regime  of  “strong  coupling”  [2,3]  is  critically 
dependent  on  the  characteristics  of  the  optical  cavity  and 
generally  requires  the  optical  modes  to  be  confined  in  a  small 
mode  volume  for  extended  periods  of  time  (or  equivalently 
high  Q  factor). 

Recent  experimental  realizations  of  strong  coupling  have 
employed  high-finesse  Fabry-Perot  (FP)  optical  microcavi¬ 
ties  [4-9].  Our  experiments  at  Caltech  include  the  realization 
of  an  “atom-cavity  microscope”  with  a  single  atom  bound  in 
orbit  by  single  photons  [4]  and  the  development  of  a  laser 
that  operates  with  “one  and  the  same”  atom  [10].  Fabry- 
Perot  cavities,  while  possessing  ultrahigh-quality  factors  and 
finesse,  are  difficult  to  manufacture  and  control,  requiring 
sophisticated  dielectric  mirror  coatings  as  well  as  accurate 
feedback  for  resonant  wavelength  control.  Due  in  part  to 
these  reasons,  there  has  been  increased  interest  in  other  mi¬ 
crocavity  systems  which  not  only  can  address  some  or  all  of 
the  limitations  of  Fabry-Perot  cavities,  but  which  in  principle 
can  have  improved  optical  properties. 

Based  upon  the  pioneering  work  of  Braginsky  and  col¬ 
leagues  [11],  whispering-gallery-mode  cavities  have  also 
been  investigated  for  cavity  QED  (CQED)  experiments  for 
many  years  [12].  Experimental  studies  have  demonstrated  Q 
factors  approaching  1010  in  a  silica  microsphere  whispering 
gallery  cavity  [13,14],  with  values  exceeding  108  readily 
achievable  over  a  broad  range  of  cavity  diameters  and  wave¬ 
lengths.  The  combination  of  their  very  low  cavity  losses, 
small  mode  volumes,  and  their  relative  ease  of  fabrication 
makes  them  promising  candidates  for  experiments  in  CQED 
[15,16].  Furthermore,  the  ability  to  couple  these  cavities  with 


record  coupling  efficiencies  to  an  optical  fiber  [17]  (the  me¬ 
dium  of  choice  for  low-loss  transport  of  classical  and  non- 
classical  states  [18])  is  fundamentally  important  in  CQED 
and  bears  promise  for  realizing  quantum  networks. 

Recently,  a  new  type  of  whispering-gallery-mode  optical 
microcavity  was  demonstrated,  which  not  only  retains  the 
high-quality  factors  of  spherical  cavities,  but  also  has  signifi¬ 
cant  advantages  in  fabrication  reproducibility,  control,  and 
mode  structure.  These  cavities  consist  of  a  toroidally  shaped 
silica  cavity  supported  by  a  silicon  pillar  on  a  microelec¬ 
tronic  chip  [19].  The  toroidal  cavity  shape  allows  an  extra 
level  of  geometric  control  over  that  provided  by  a  spherical 
cavity  and  thus  begs  the  question  as  to  how  these  structures 
compare  with  silica  microspheres  and  other  microcavity  de¬ 
signs  for  strong-coupling  cavity  QED.  In  this  paper  we  nu¬ 
merically  investigate  the  suitability  of  toroidal  microcavities 
for  strong-coupling  cavity  QED  experiments,  and  for  pur¬ 
poses  of  comparison,  we  focus  on  the  interaction  with  atomic 
cesium  [4,20].  We  show  that  toroid  microcavities  can 
achieve  ultrahigh-quality  factors  exceeding  108  while  simul¬ 
taneously  obtaining  very  large  coupling  rates  between  the 
cavity  and  a  cesium  atom.  It  is  found  that  these  cavities  not 
only  surpass  the  projected  limits  of  FP  technology  [20],  but 
also  either  exceed  or  compare  favorably  to  other  cavity  de¬ 
signs  such  as  photonic  band-gap  devices  [21,22].  Last,  we 
present  preliminary  experimental  measurements  of  quality 
factors  for  toroidal  cavities  at  a  wavelength  of  852  nm,  suit¬ 
able  for  strong-coupling  CQED  with  atomic  cesium.  These 
results  show  that  currently  attainable  Q  values  are  already 
quite  promising. 

II.  STRONG  COUPLING  IN  AN  ATOM-CAVITY  SYSTEM 

The  coupling  rate  g  between  an  atomic  system  and  an 
electromagnetic  field  is  related  to  the  single-photon  Rabi  fre- 
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quency  (l  =  2g  and  can  be  expressed  in  terms  of  the  atomic 
and  cavity  parameters  by  [1] 

g(r)  =  y±\E(r)/EmJ^VJV~m,  (1) 

Va  ~  3cA2/(477'y±) ,  (2) 

where  y±  is  the  transverse  atomic  dipole  transition  rate, 

|  E(r)/EmaJ  denotes  the  normalized  electric  field  strength  at 
the  atom’s  location  r,  Va  is  a  characteristic  atomic  interaction 
volume  (which  depends  on  the  atomic  dipole  transition  rate, 
the  transition  wavelength  X,  and  the  speed  of  light  c),  and  Vm 
is  the  cavity-electromagnetic-mode  volume.  Assuming  the 
atom  interacts  with  the  electromagnetic  field  for  a  time  T , 
strong  atom-field  coupling  occurs  if  the  rate  of  coupling  ex¬ 
ceeds  all  dissipative  mechanisms — i.e.,  g>  (/c,  y±9T^1).  In 
this  expression  k  denotes  the  cavity  field  decay  rate,  given  in 
terms  of  the  cavity  quality  factor  g  by  k=ttcI(\Q).  The 
degree  of  strong  coupling  can  also  be  related  to  a  set  of 
normalized  parameters  [1], 

n0=y{/(2g2),  (3) 

N0  =  2ylK/(g2),  (4) 

where  n0  is  the  critical  photon  number,  which  is  the  number 
of  photons  required  to  saturate  an  intracavity  atom,  and  N0  is 
the  critical  atom  number,  which  gives  the  number  of  atoms 
required  to  have  an  appreciable  effect  on  the  cavity  transmis¬ 
sion.  Note  that  (N0,n0)<  1  provides  a  necessary  but  not  suf¬ 
ficient  condition  for  strong  coupling. 

Examining  these  parameters,  we  see  that  only  the  critical 
atom  number  N0^Vm/Q  is  dependent  on  the  cavity-loss  rate 
(or  equivalently  g  factor).  It  is  the  possibility  of  realizing 
extremely  low  critical  atom  numbers  with  ultrahigh-g  micro¬ 
cavities  that  has  fostered  the  investigation  of  silica  micro¬ 
spheres  for  strong-coupling  CQED  experiments.  However, 
the  geometry  of  a  spherical  dielectric  dictates  a  definite  rela¬ 
tionship  between  cavity-mode  volume  Vm  and  the  associated 
quality  factor  g  and,  hence,  of  the  value  of  the  coupling 
parameter  g  V^1/2  while  still  maintaining  ultrahigh-quality 
factors  [23].  This  is  a  result  of  the  fact  that  to  achieve  large 
atom-cavity  coupling  rates  (comparable  to  or  exceeding 
those  of  FP  cavities)  the  cavity  diameter  must  be  made  small 
[8 -/mi-diameter  sphere  gives  g/( 2tt)  —  740  MHz]  in  order  to 
both  lower  the  modal  volume  and  to  increase  the  electric 
field  strength  at  the  atomic  position  (assumed  to  be  the  cavity 
surface  at  the  point  of  maximum  electric  field  strength). 
However,  at  the  optimum  radius  for  atom-coupling  strength, 
the  tunneling  loss  of  the  microcavity  results  in  a  low  achiev¬ 
able  g  factor  (g  —  4  X  104),  thereby  raising  the  critical  atom 
number.  While  the  relatively  large  mode  volumes  of  silica 
microsphere  cavities  preclude  them  from  competing  with  ul¬ 
trasmall  mode  volume  cavities  (such  as  photonic  band-gap 
cavities)  on  the  basis  of  coupling  strength  alone,  there  is  the 
possibility  to  access  simultaneously  both  ultrahigh-g  and 
small  mode  volume,  using  toroidal  microresonators. 


FIG.  1 .  Scanning  electron  micrograph  of  a  toroidal  microcavity. 
The  principal  and  minor  diameters  are  denoted  by  D  and  d, 
respectively. 

III.  TOROIDAL  MICRORESONATORS 

Toroidal  microresonators  are  chip-based  microcavities 
that  possess  ultrahigh-g  (>108)  whispering-gallery  type 
modes  [19].  The  realization  of  ultrahigh-g  chip-based  reso¬ 
nators  allows  improvements  in  fabrication  and  control,  while 
additionally  allowing  integration  with  complementary  opti¬ 
cal,  mechanical,  or  electrical  components.  In  brief,  these 
resonators  are  fabricated  by  standard  lithographic  and  etch¬ 
ing  techniques,  followed  by  a  laser-reflow  process,  as  out¬ 
lined  in  Ref.  [19].  The  combination  of  thermal  isolation  of 
the  initial  preform  periphery  and  thermal  heat  sinking  of  the 
preform  interior  through  the  strong  heat  conduction  of  the 
silicon  support  pillar  results  in  a  preferential  melting  of  the 
preform  along  the  disk  periphery  under  C02  laser  irradiation. 
Surface  tension  then  induces  a  collapse  of  the  silica  disk 
preform,  resulting  in  a  toroidally  shaped  boundary,  with  the 
final  geometry  controlled  by  a  combination  of  irradiation 
flux  and  exposure  time.  Importantly,  as  the  optical  mode  re¬ 
sides  in  the  extremely  uniform  and  smooth  (reflowed)  pe¬ 
riphery  of  the  structure,  the  quality  factors  of  optical 
whispering-gallery  modes  can  achieve  ultrahigh-g  perfor¬ 
mance,  exceeding  108.  Figure  1  shows  a  scanning  electron 
micrograph  of  the  side  view  of  a  typical  toroidal  microcavity. 
Quality  factors  as  high  as  4  X 108  at  a  wavelength  of 
1550  nm  (corresponding  to  a  photon  lifetime  of  —300  ns) 
have  been  measured  [24]. 

IV.  MICROTOROID  NUMERICAL  MODELING 

In  order  to  investigate  the  properties  of  microtoroids  for 
CQED,  this  paper  will  focus  on  the  D2  transition  of  cesium 
which  occurs  at  a  wavelength  of  852.359  nm  (y1/27T 
—  2.6  MHz)  [20],  with  scaling  to  other  systems  accom¬ 
plished  in  the  fashion  of  Ref.  [23].  Fundamentally,  the  cou¬ 
pling  between  an  atom  and  a  cavity  field  can  be  specified  by 
four  parameters:  the  atomic  transition  moment,  the  cavity 
field  strength  at  the  atom’s  location,  the  cavity  mode  volume 
Vm,  and  the  cavity  quality  factor  g.  Since  the  optical  modes 
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FIG.  2.  Electric  field  magnitude  for  the  whispering-gallery 
modes  of  a  spherical  (top  row)  cavity  of  diameter  18  jam  and  a 
toroidal  cavity  (bottom  row)  with  principal  diameter  of  18  jam  and 
a  minor  diameter  of  1  jmm.  The  left  (right)  column  shows  the  TE- 
(TM-)  polarized  mode  near  850  nm.  The  arrows  indicate  the  loca¬ 
tion  of  the  maximum  external  electric  field  strength,  where  we  as¬ 
sume  the  atom  is  located.  The  dotted  lines  in  the  two-dimensional 
field  distribution  indicate  the  cross  section  where  the  electric  field  is 
displayed. 

are  confined  to  the  interior  dielectric  in  whispering-gallery- 
type  resonators,  the  atom  can  interact  only  with  the  evanes¬ 
cent  field  of  the  cavity  mode.  In  the  following  discussion,  the 
atom  is  assumed  to  be  located  near  the  resonator  surface  at 
the  location  where  the  electric  field  strength  is  largest,  as 
illustrated  in  Fig.  2.  For  TM-polarized  modes  (defined  such 
that  the  dominant  electric  field  component  is  in  the  radial 
direction)  this  occurs  at  the  outer  cavity  boundary  in  the 
equatorial  plane,  while  for  TE-polarized  modes  (dominant 
electric  field  component  in  the  azimuthal-vertical  direction) 
the  location  of  the  maximum  external  field  strength  is  more 
complicated.  As  the  toroidal  geometry  is  compressed  with 
respect  to  a  sphere  (i.e.,  reducing  the  ratio  of  minor-to- 
principal  toroid  diameter),  the  maximum  field  strength  for  a 
TE-polarized  mode  changes  from  the  equatorial  outer  cavity 
boundary  to  approaching  the  azimuthal  axis  (see  Fig.  2). 
While  the  precise  localization  of  the  atom  at  the  cavity  eva¬ 
nescent  field  maximum  has  been  analyzed  in  detail  [25,26], 
such  localization  has  not  yet  been  achieved  experimentally. 
Nonetheless,  this  assumption  allows  a  simple  way  to  charac¬ 
terize  the  relative  merit  of  this  cavity  geometry  with  respect 
to  other  cavity  designs.  Also,  in  what  follows  we  will  only 
consider  the  fundamental  radial  and  azimuthal  modes  for 
both  polarizations  (TE  and  TM),  as  they  possess  the  smallest 
modal  volumes  and  thus  the  highest  coupling  strengths. 

The  microtoroid  geometry,  which  exhibits  a  dumbbell¬ 
shaped  cross  section,  can  in  most  cases  be  considered  a  torus, 
as  the  presence  of  the  supporting  disk  structure  only  affects 
the  optical  mode  when  the  torus  diameter  becomes  compa¬ 
rable  to  the  radial  extent  of  the  optical  mode.  As  shown  in 
Fig.  3,  this  point  occurs  when  the  toroid  minor  diameter  (i.e., 
the  cross-sectional  diameter  of  the  torus)  is  below  approxi¬ 
mately  1.5  jmm  for  a  principal  diameter  of  16  jmm.  Further¬ 
more,  through  improvements  in  fabrication  the  influence  of 
the  toroid  support  can  in  principle  be  minimized.  In  contrast 
to  FP  and  microsphere  cavities,  the  optical  modes  of  a  toroid 
do  not  possess  analytic  solutions.  While  one  can  derive  ap¬ 


FIG.  3.  Electric  field  profiles  for  a  toroidal  cavity  with  a  princi¬ 
pal  diameter  D- 20  jmm  and  minor  diameters  d- 20,  12,  6,  3,  1.5, 
and  0.75  jmm.  The  calculations  correspond  to  a  TM-polarized  mode 
near  850  nm.  The  optical  mode  behaves  as  a  whispering-gallery- 
type  mode  until  the  minor  diameter  is  below  approximately  1 .5  jmm, 
at  which  point  the  mode  approaches  that  of  a  step-index  optical 
fiber  [28]. 

proximate  expressions  for  the  optical  behavior  of  these  struc¬ 
tures  for  both  the  low  transverse  compression  (spherelike) 
and  high  transverse  compression  (step-index,  fiberlike)  re¬ 
gimes,  we  are  mostly  interested  in  the  intermediate  geometri¬ 
cal  regimes,  as  these  are  both  experimentally  accessible  and 
retain  the  most  desirable  properties  of  whispering-gallery- 
type  microcavities.  To  accomplish  this  task,  a  two- 
dimensional  finite-element  eigenmode-eigenvalue  solver  was 
used  to  characterize  the  optical  modes  of  the  cavity  over  the 
complete  geometrical  range,  after  explicitly  accounting  for 
the  rotational  symmetry.  The  optical  modes  were  calculated 
in  a  full-vectorial  model,  which  provides  the  complete  elec¬ 
tric  field  dependence.  The  accuracy  of  the  numerical  tech¬ 
nique  was  carefully  verified  by  comparison  with  results  us¬ 
ing  the  analytical  solution  for  a  microsphere  cavity  [27].  The 
results  for  the  mode  volumes,  resonance  wavelengths,  and 
field  profiles  were  in  good  agreement  (fractional  error  was 
less  than  1CT4  and  10-2  for  the  resonance  wavelength  and 
modal  volume,  respectively).  Furthermore,  the  error  in  the 
radiation  quality  factor  was  less  than  10%  over  a  wide  value 
of  radiation  g’s  (103-1014),  demonstrating  that  this  method 
can  give  the  accuracy  required  to  investigate  the  fundamental 
radiation-loss  limits  in  the  cavity  geometries  of  interest  in 
this  work.  Due  to  the  fact  that  for  smaller  cavity  geometries 
the  resonance  wavelengths  do  not  necessarily  coincide  with 
the  cesium  transition  of  interest,  the  data  in  this  work  were 
evaluated  by  using  values  calculated  at  the  closest  resonance 
wavelengths,  both  blueshifted  and  redshifted  with  respect  to 
the  desired  resonance,  to  extrapolate  values  at  the  desired 
wavelength  (the  mode  volumes  were  linearly  extrapolated 
and  the  radiation  quality  factors  exponentially  extrapolated 
as  a  function  of  wavelength). 

A.  Mode  volume 

The  optical-mode  volume  is  determined  by 
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FIG.  4.  Calculated  mode  volumes  for  a  silica  toroidal  mi¬ 
croresonator  versus  minor  diameter  for  principal  diameters  of  20, 
18,  and  16  yam.  The  plot  shows  both  TM  (squares)  and  TE  (circles) 
polarizations.  As  the  minor  diameter  is  reduced  a  slow  reduction  of 
modal  volume  due  to  confinement  in  the  azimuthal  direction  occurs, 
followed  by  a  fast  reduction  for  large  confinement  when  the  optical 
mode  is  strongly  compressed  in  both  the  radial  and  azimuthal 
directions. 

[  e(r)\E(r)\2d3r 

j  Vn 


maxi 

where  Vq  represents  a  quantization  volume  of  the  electro¬ 
magnetic  field  and  \E\  is  the  electric  field  strength  [26].  In 
these  calculations,  we  have  chosen  the  quantization  volume 
cross  section  to  consist  of  a  square  region  of  approximately 
10  yam  width  and  height  centered  about  the  radial  cavity 
boundary.  This  choice  allows  the  mode  volume  to  be  deter¬ 
mined  to  a  good  accuracy  while  minimizing  computational 
requirements.  As  a  further  confirmation  of  the  validity  of  this 
approach,  we  note  that  the  radiation  loss  is  weak  for  the 
range  of  geometries  modeled  in  this  work,  resulting  in  only  a 
marginal  difference  in  the  numerically  calculated  mode  vol¬ 
ume  for  different  choices  of  quantization  volume. 

Figure  4  shows  the  calculated  modal  volume  for  the  fun¬ 
damental  mode  of  a  toroidal  cavity  as  a  function  of  minor 
diameter  and  for  principal  diameters  ranging  from 
16  to  20  yam.  For  clarity,  only  data  for  minor  diameters  be¬ 
low  4  yam  are  shown.  Both  TM  (squares)  and  TE  (circles) 
polarizations  are  shown.  The  calculations  show  a  reduction 
of  modal  volume  for  both  polarizations  as  the  toroid  minor 
diameter  is  decreased.  This  is  expected  when  considering  the 
additional  confinement  provided  by  the  toroid  geometry  be¬ 
yond  the  spherical  geometry,  as  illustrated  in  the  electric  field 
plots  of  Fig.  3.  As  the  minor  diameter  is  decreased,  there  is 
initially  a  slow  reduction  of  modal  volume,  which  agrees 
very  well  with  a  simple  model  that  accounts  for  transverse 
guiding  (azimuthal  direction)  using  an  approximate  one¬ 


dimensional  harmonic  oscillator  model.  This  approach  re¬ 
sults  in  a  reduction  of  modal  volume  which  scales  as 
(d/D)m  with  respect  to  that  of  a  spherical  cavity.  This  for¬ 
mula  holds  for  minor  diameters  greater  than  approximately 
2  yam  for  the  principal  diameters  considered  in  this  work. 
For  smaller  diameters,  the  spatial  confinement  becomes 
strong  enough  that  the  optical  mode  is  additionally  com¬ 
pressed  in  the  radial  direction.  This  results  in  a  faster  reduc¬ 
tion  of  modal  volume,  with  the  optical  modes  approaching 
those  of  a  step-index  optical  fiber  (this  occurs  for  a  minor 
diameter  below  approximately  1  yam)  [28].  The  mode  vol¬ 
ume  reduces  until  the  point  where  the  optical  mode  becomes 
delocalized  due  to  the  weak  geometrical  confinement,  caus¬ 
ing  a  finite  minimum  value.  Determination  of  the  exact  point 
of  the  minimum  modal  volume  upon  reduction  of  minor  di¬ 
ameter  (for  a  fixed  principal  diameter)  can  be  uncertain,  as 
the  choice  of  quantization  volume  now  plays  a  critical  role 
(as  discussed  above).  For  this  reason  the  results  in  Fig.  4 
show  the  modal  volume  only  for  inner  diameters  down  to 
0.65  yam,  where  mode  volume  determination  was  unambigu¬ 
ous. 

Calculation  of  the  modal  volume  and  the  maximum  elec¬ 
tric  field  amplitude  at  the  exterior  cavity  equatorial  boundary 
is  straightforward,  giving  a  simple  way  to  calculate  both  the 
coupling  strength  and  the  critical  photon  number.  In  order  to 
obtain  the  cavity  decay  rate  k  and  the  critical  atom  number 
N0,  however,  the  cavity  Q  factor  must  be  determined. 

B.  Quality  factor 

The  radiation  loss  of  the  optical  modes  of  a  spherical 
cavity  is  easily  found  by  consideration  of  the  analytic  char¬ 
acteristic  equation  [29] 

^_2b{^kR[({nkR)\_  _  [kRh^(kR)Y 
ukRj  f(nkR)  kRh\V]{kR) 

where  n  is  the  refractive  index  of  the  spherical  cavity  (the 
external  index  is  assumed  to  be  unity),  R  is  the  cavity  radius, 
b  represents  the  polarization  of  the  optical  mode  (1  for  TM 
and  0  for  TE),  and  j€  (h^)  represents  the  spherical  Bessel 
(Hankel)  function.  The  prime  denotes  differentiation  with  re¬ 
spect  to  the  argument  of  the  Bessel  (Hankel)  function.  This 
equation  accounts  for  radiation  loss  through  the  use  of  an 
outgoing  wave  outside  the  cavity,  as  given  by  the  complex 
Hankel  function  of  the  first  kind.  Solution  of  this  equation 
results  in  a  complex  wave  number  k=kRe+ikIm,  which  deter¬ 
mines  both  the  resonance  wavelength  (X  =  2i r/^e)  and  the 
radiation  quality  factor  \Qmd-  kRel  (2kIm)]. 

However,  while  the  spherical  solution  can  provide  some 
insight  into  the  scaling  of  the  radiation  quality  factor  for 
toroidal  cavities  where  the  minor  diameter  is  large  (sphere 
like),  the  radiation  loss  when  the  optical  mode  is  strongly 
confined  (as  represented  by  small  minor  diameters)  is  ex¬ 
pected  to  decrease  much  more  rapidly.  Figure  5  shows  nu¬ 
merical  calculations  of  the  radiative  quality  factor  as  the  mi¬ 
nor  diameter  is  decreased  for  various  principal  diameters  of 
16,  18,  and  20  yam.  We  observe  an  initially  slow  reduction  of 
the  radiative  quality  factor  in  the  geometrical  regime  where 
the  minor  diameter  exceeds  the  radial  extent  of  the  optical 
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FIG.  5.  Calculated  radiation  loss  for  a  toroidal  microcavity  as  a 
function  of  minor  diameter,  for  principal  diameters  of  20,  18,  and 
16  jam.  Both  TM  (squares)  and  TE  (circles)  polarizations  are 
shown.  The  data  show  a  slow  reduction  of  Q  as  the  minor  diameter 
is  reduced  while  the  mode  behaves  primarily  as  a  whispering- 
gallery-type  mode.  However,  as  the  geometrical  confinement  in¬ 
creases  to  such  a  point  as  the  optical  mode  approaches  that  of  a 
step-index  fiber,  there  is  a  significant  reduction  of  the  quality  factor. 

mode  (i.e.,  where  the  optical  mode  exhibits  whispering- 
gallery  behavior).  As  the  minor  diameter  is  reduced  to  a  level 
comparable  to  or  smaller  than  the  radial  extent  of  the  optical 
mode  (step-index  fiber  like  regime),  the  drop-off  of  the  ra¬ 
diative  Q  is  much  more  dramatic,  with  a  decrease  of  over  an 
order  of  magnitude  for  a  reduction  of  inner  diameter  of  just 
50  nm. 

The  total  optical  loss  of  a  cavity  has  contributions  not 
only  from  radiation  loss,  but  also  includes  other  dissipative 
mechanisms,  such  as  intrinsic  material  absorption,  losses  re¬ 
sulting  from  both  surface  and  bulk  scattering,  and  losses 
stemming  from  contaminates  on  the  resonator  surface  [30]. 
One  of  the  dominant  contaminates  which  adversely  affects 
the  cavity  Q  is  OH  and  water  adsorbed  onto  the  cavity  sur¬ 
face.  While  prior  investigations  of  these  loss  mechanisms 
have  resulted  in  approximate  expressions  for  water  absorp¬ 
tion  and  surface  scattering  [14,31],  only  very  large  resonators 
were  studied,  as  opposed  to  the  much  smaller  diameter  cavi¬ 
ties  studied  in  this  work.  To  obtain  an  improved  estimate  of 
the  effect  of  water  on  the  small  diameter  cavities  in  this 
paper,  a  simple  model  was  used  which  determines  the  frac¬ 
tion  of  optical  energy  absorbed  by  a  monolayer  of  water 
located  at  the  cavity  surface.  This  method  gives  an  estimated 
quality  factor  for  a  monolayer  of  water  to  be  greater  than 
1010  for  the  case  of  a  spherical  resonator  with  a  principal 
diameter  of  50  /zm.  While  the  water-limited  quality  factor 
will  be  slightly  lower  for  the  smaller  principal  diameter  cavi¬ 
ties  in  this  work  and  also  slightly  lower  due  to  the  increased 
overlap  between  the  optical  mode  and  the  cavity  surface  in  a 
toroidal  geometry,  these  values  are  comparable  to  the  quality 
factor  due  solely  to  the  intrinsic  absorption  of  silica  in  the 
800-nm  wavelength  band.  As  in  principle  with  proper  fabri¬ 


FIG.  6.  Total  quality  factor  for  a  toroidal  microcavity  versus 
minor  diameter  for  principal  diameters  of  16,  18,  and  20  jam.  Both 
TE  (circles)  and  TM  (squares)  polarizations  are  shown.  The  total 
quality  factor  is  composed  of  the  radiative  quality  factor  from  Fig. 
5  along  with  the  silica-absorption-limited  Qmat=2AX  1010  at  a 
wavelength  of  852  nm.  The  plots  indicate  that  the  total  quality  fac¬ 
tor  is  limited  by  silica  absorption  when  the  principal  diameter  is 
larger  than  16  jam  and  the  minor  diameter  is  larger  than  approxi¬ 
mately  1  jam.  Furthermore,  both  polarizations  have  similar  quality 
factors  over  the  range  of  geometries  studied. 

cation  the  presence  of  water  and  OH  can  be  prevented,  with 
surface  scattering  minimized,  we  will  focus  only  on  the  con¬ 
tributions  from  intrinsic  silica  absorption  and  radiation  loss. 
These  two  mechanisms  put  a  fundamental  limit  on  the  Q 
possible  in  these  structures. 

Figure  6  shows  the  calculated  total  quality  factor  for  vari¬ 
ous  principal  toroid  diameters  in  the  range  of  16-20  /z m,  as 
a  function  of  the  minor  diameter.  The  total  quality  factor  is 
calculated  through  the  relation  1/ Qtotal=l/ Qrad+l/ Qmat, 
where  only  radiation  loss  and  silica  absorption  are  included. 
For  principal  diameters  less  than  18  gm,  there  is  a  mono¬ 
tonic  decrease  in  quality  factor  as  the  minor  diameter  is  de¬ 
creased.  This  is  a  result  of  the  whispering-gallery-loss  in¬ 
crease  due  to  the  additional  confinement.  For  larger  principal 
diameters,  the  overall  quality  factor  is  clamped  near  the  lim¬ 
iting  value  resulting  from  silica  absorption  for  most  minor 
diameters  (with  only  a  slight  decrease  as  minor  diameter  is 
reduced),  until  the  minor  diameter  is  small  enough  that  the 
radiative  quality  factor  decreases  below  the  quality  factor 
due  to  silica  absorption.  For  the  principal  diameters  studied 
in  this  work,  this  point  occurs  as  a  minor  diameter  of  around 
1  jmm. 

C.  Cavity  QED  parameters 

The  determination  of  the  coupling  strength  from  the 
modal  volume  follows  from  Eq.  (1).  Figure  7  shows  the 
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Minor  diameter  (pm) 

FIG.  7.  Atom-cavity  coupling  parameter  g  vs  minor  diameter  for 
toroidal  cavities  having  a  principal  diameter  of  16,  18,  and  20  jmm, 
with  g  increasing  for  smaller  principal  diameters.  Both  TE  (circles) 
and  TM  (squares)  polarizations  are  shown.  The  plots  indicate  that 
the  coupling  strength  increases  dramatically  as  the  minor  diameter 
decreases  below  1 .5  jmm,  which  is  a  result  of  the  rapid  reduction  of 
mode  volume  and  the  increased  electric  field  strength  at  the  cavity 
surface. 

atom-cavity  coupling  rate  g/ (2  v)  for  various  toroid  principal 
diameters  as  the  toroid  minor  diameter  is  decreased.  It  can  be 
seen  that  there  is  a  monotonic  rise  in  g  for  higher- aspect-ratio 
toroids  (i.e.,  D/d),  as  a  direct  result  of  the  compression  of 
modal  volume.  The  rate  of  increase  of  g  as  the  minor  diam¬ 
eter  is  reduced  increases  dramatically  as  the  toroid  geometry 
transitions  from  a  whispering-gallery-type  mode  to  a 
strongly  confined  step-index  fiber-type  mode.  This  is  due  not 
only  to  the  faster  rate  of  reduction  of  mode  volume  in  the 
step-index  fiber  like  regime  as  the  minor  diameter  is  de¬ 
creased,  but  also  due  to  the  increase  in  electric  field  strength 
at  the  cavity  surface  [as  g ^\E\(Vm)~l/2].  Note  that  the  cou¬ 
pling  strengths  shown  do  not  correspond  to  the  absolute 
maximum  for  these  structures,  as  this  work  has  focused  on 
the  simultaneous  realization  of  high-quality  factors  and  small 
modal  volume.  Therefore,  mode  volumes  were  calculated 
only  down  to  where  the  radiation  quality  factor  is  equal  to  or 
slightly  exceeds  107.  Also,  as  mentioned  previously,  by  mak¬ 
ing  this  restriction  we  prevent  any  uncertainty  in  the  calcu¬ 
lated  mode  volumes  (and  hence  g)  through  the  definition  of 
the  modal  quantization  volume.  Under  these  assumptions, 
the  calculations  indicate  that  coupling  parameters  exceeding 
700  MHz  are  possible. 

Figure  8  shows  the  corresponding  critical  photon  numbers 
(n0).  The  results  reveal  that  values  as  low  as  6X  10-6  are 
possible,  with  the  associated  quality  factors  exceeding  107. 
As  will  be  discussed  in  more  detail  in  the  next  section,  this 
value  is  not  only  comparable  to  the  fundamental  limit  of  FP 
technology,  but  also  vastly  exceeds  that  possible  for  fused 
silica  microspheres  with  a  comparable  quality  factor. 

One  of  the  primary  reasons  high- Q  whispering-gallery- 
mode  cavities  are  promising  for  CQED  is  their  very  low 
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FIG.  8.  Critical  photon  number  n0  vs  minor  toroid  diameter  for 
a  cavity  with  principal  diameters  of  16,  18,  and  20  jmm.  Both  TE 
(circles)  and  TM  (squares)  polarizations  are  shown.  The  plots  show 
that  as  both  toroid  principal  diameter  and  minor  diameter  are  re¬ 
duced,  the  critical  photon  number  decreases.  This  follows  directly 
from  the  behavior  of  the  atom-cavity  coupling  parameter  g,  as  in¬ 
dicated  in  Fig.  7.  The  calculations  show  that  critical  photon  num¬ 
bers  of  6X  10-6  are  possible  (with  quality  factors  exceeding  107). 

critical  atom  number.  Figure  9  shows  the  calculated  critical 
atom  number  versus  minor  diameter  for  toroid  principal  di¬ 
ameters  of  16,  18,  and  20  jmm.  The  plot  shows  that  for  the 
larger  principal  diameters  of  18  and  20  jmm  there  is  a  mini¬ 
mum  in  the  critical  atom  number  as  the  toroidal  minor  diam- 


FIG.  9.  Critical  atom  number  Nq  vs  minor  diameter  for  a  toroi¬ 
dal  microcavity  with  principal  diameters  of  16,  18,  and  20  jam.  For 
small  minor  diameters  the  critical  atom  number  decreases  as  the 
principal  diameter  increases.  Both  TE  (circles)  and  TM  (squares) 
polarizations  are  shown.  The  plots  indicate  that  there  is  a  minimum 
value  of  the  critical  atom  number  near  2  X  10-7  for  a  toroidal  cavity 
with  a  principal  diameter  of  20  jam  and  an  inner  diameter  of  1  jam 
(TM  mode). 
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eter  is  reduced.  The  minimum  occurs  near  a  minor  diameter 
of  1  jmm.  This  minimum  arises  from  the  clamping  of  the  total 
quality  factor  (to  the  quality  factor  resulting  from  silica  ab¬ 
sorption)  for  larger  minor  diameters  when  the  principal  di¬ 
ameter  is  greater  than  approximately  18  jmm.  Thus,  by  reduc¬ 
ing  the  minor  diameter  for  a  fixed  principal  diameter,  the 
quality  factor  is  nearly  unchanged  while  the  coupling 
strength  is  monotonically  increasing.  The  critical  atom  num¬ 
ber  decreases  until  the  region  where  the  minor  diameter  is 
such  that  the  overall  Q  is  determined  by  whispering-gallery 
loss.  At  this  point  the  critical  atom  number  increases  ap¬ 
proximately  exponentially.  The  plot  for  the  20  jmm  principal 
diameter  shows  that  in  a  toroidal  geometry  slightly  larger 
principal  diameters  can  offer  some  benefit,  as  the  minor  di¬ 
ameter  can  be  compressed  more  strongly  while  maintaining 
high  radiative  quality  factors  and,  thereby,  lowering  the  criti¬ 
cal  atom  number.  A  critical  atom  number  of  approximately 
2  X  10-7  is  possible  using  a  toroid  principal  diameter  of 
20  jmm  and  a  minor  diameter  of  1  jmm. 

V.  EXPERIMENTAL  MEASUREMENT 
OF  MICROTOROIDS  FOR  STRONG-COUPLING 
CAVITY  QED  AT  852  nm 

The  presented  numerical  results  indicate  that  toroidal 
cavities  can  theoretically  obtain  high  values  of  atom-cavity 
coupling  while  simultaneously  retaining  an  extremely  low 
critical  photon  number  and  in  particular  an  exceedingly 
small  critical  atom  number.  While  in  principle  the  critical 
atom  number  can  be  more  than  100  times  smaller  than  any 
currently  demonstrated  cavity,  the  necessity  of  realizing 
material-limited  quality  factors  exceeding  2  X  1010  is  experi¬ 
mentally  challenging.  The  current  record  for  any  cavity  is 
9  X  109  [14],  in  a  large-diameter  microsphere  cavity,  whereas 
for  toroidal  cavities  quality  factors  as  high  as  4  X  108  at  a 
resonance  wavelength  of  1550  nm  have  been  realized  [24]. 
However,  for  cavity  quality  factors  much  larger  than  108,  the 
dominant  dissipative  mechanism  in  the  atom-cavity  system  is 
the  radiative  decay  rate  of  the  atomic  medium,  which  is 
2.61  MHz  for  the  D2  transition  of  cesium.  For  this  reason 
more  “modest”  quality  factors,  in  the  range  of  current  experi¬ 
mentally  achievable  values  (e.g.,  a  few  hundred  million)  are 
attractive.  As  these  values  are  currently  realizable  for  toroi¬ 
dal  cavities  at  a  wavelength  of  1550  nm,  we  have  investi¬ 
gated  experimentally  the  quality  factors  and  fabrication  lim¬ 
its  for  structures  designed  for  strong  coupling  to  the  cesium 
transition  at  a  wavelength  of  852  nm. 

As  toroidal  cavities  are  fabricated  using  a  combination  of 
lithography  and  a  silica  reflow  process,  the  advantages  of 
lithographic  control  and  parallelism  are  obtained  and,  in  fact, 
are  a  significant  step  forward  over  spherical  cavities.  As  the 
shape  of  the  initial  silica  preform  dictates  the  maximum  pos¬ 
sible  principal  and  minor  diameter  and  is  lithographically 
formed,  precise  control  of  the  geometry  dimensions  is  pos¬ 
sible.  Reproducible  principal  diameters  ranging  from 
>100tol2/zm  have  been  fabricated.  This  lower  value, 
while  currently  dictated  by  the  available  laser  power  in  our 
setup,  is  sufficient  to  obtain  the  range  of  principal  diameters 
optimally  suited  for  CQED,  as  indicated  above.  While  the 


capability  to  obtain  reproducible  principal  diameters  is  a  sig¬ 
nificant  improvement  over  spherical  cavities,  the  ability  to 
accurately  control  the  minor  diameter  is  particularly  impor¬ 
tant  to  CQED.  As  noted  previously  [19],  the  final  minor  di¬ 
ameter  of  the  fabricated  structures  is  a  result  of  a  combina¬ 
tion  of  factors,  which  are  the  initial  silica  preform  thickness, 
the  supporting  pillar  size,  and  the  laser  irradiation  intensity 
and  duration.  Minor  diameters  as  small  as  3  /xm  at  principal 
diameters  as  low  as  12  jmm  have  been  realized  experimen¬ 
tally. 

We  have  measured  the  quality  factor  of  a  series  of  fiber- 
taper-coupled  toroidal  microcavities  at  a  wavelength  of 
852  nm,  using  an  experimental  apparatus  similar  to  previous 
work  [19,32].  The  excitation  laser  was  a  New  Focus  Vortex 
laser  with  a  tunability  of  40  GHz  with  a  center  wavelength  of 
852.359  nm.  The  laser  output  was  double  passed  through  an 
acousto-optic  modulator  for  the  purpose  of  performing  a  cav¬ 
ity  ringdown  measurement.  The  resulting  beam  was  able  to 
be  extinguished  by  a  TTL  electrical  control  signal,  with  a 
corresponding  optical  decay  time  of  15  ns.  This  beam  was 
then  coupled  into  a  single-mode  850  nm  fiber  and  subse¬ 
quently  interacted  with  the  toroidal  resonators  through  the 
tapered  portion  of  the  fiber.  Due  to  the  limited  tuning  range 
of  the  excitation  laser  (which  is  less  than  the  free-spectral 
range  between  fundamental  modes  in  the  cavity  principal 
diameters  of  interest),  overlap  of  a  fundamental  resonance 
with  the  laser  wavelength  range  was  difficult.  Obtaining  an 
optical  fundamental  mode  at  852.359  nm  was  achieved  by 
thermally  shifting  the  optical  resonance  through  the  use  of  a 
Peltier  heating  element,  which  allowed  tuning  of  the  cavity 
resonance  by  up  to  approximately  50  GHz.  Upon  realization 
of  a  fundamental  cavity  resonance  at  the  proper  wavelength, 
the  intrinsic  quality  factor  was  inferred  two  ways  (Fig.  10): 
through  cavity  ringdown  [19]  and  through  the  threshold  for 
stimulated  Raman  scattering  [32].  The  results  of  both  mea¬ 
surements  were  in  agreement  and  resulted  in  a  measured 
quality  factor  as  high  as  Qtotai=  1.2  X  108  in  a  cavity  with  a 
principal  diameter  of  50  jmm  and  a  minor  diameter  of  6  jmm. 
For  this  cavity  geometry,  the  whispering-gallery  loss  is  neg¬ 
ligible  (Qrad~  1036)  compared  to  the  intrinsic  silica  absorp¬ 
tion  loss,  such  that  the  overall  theoretical  quality  factor  can 
be  as  high  as  Qtotai~2  X  1010.  We  expect  upon  further  mea¬ 
surements  that  this  quality  factor  can  be  increased  to  levels 
comparable  to  measurements  performed  at  a  wavelength  of 
1550  nm  (4X108). 

While  this  cavity  geometry  is  far  from  the  optimal  geom¬ 
etry  suggested  in  this  paper,  this  structure  was  chosen  in 
order  to  increase  the  likelihood  of  finding  a  fundamental 
resonance  at  852  nm.  Even  for  this  relatively  large  structure, 
cavity  QED  parameters  of  (g/(27r) ,xt0,V0)  =  (86  MHz, 4.6 
X  10“4,1.0  X  10  3)  are  calculated.  Comparison  of  these  val¬ 
ues  to  current  FP  cavities  [4,10,20]  indicates  that  even  with¬ 
out  additional  improvements  in  fabrication  these  results  are 
close  in  coupling  strength  and  improved  with  respect  to  the 
critical  atom  number.  Additionally,  if  we  restrict  the  geom¬ 
etry  and  overall  quality  factor  to  values  which  are  currently 
realizable  (i.e.,  a  quality  factor  of  108  at  a  wavelength  of 
852  nm  with  a  minor  diameter  of  3.5  jmm ,  which  represents  a 
reasonably  comfortable  margin  from  the  actual  current  lim¬ 
its),  the  optimal  principal  diameter  is  13  jmm  (this  geometry 
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FIG.  10.  Experimental  measurement  of  the  intrinsic  quality  fac¬ 
tor  for  a  toroidal  microresonator  at  a  resonance  wavelength  of 
852  nm.  The  main  figure  shows  the  generation  of  stimulated  Raman 
scattering,  illustrated  by  the  secondary  peak  located  at  a  wavelength 
of  890  nm.  The  threshold  pump  power  for  stimulated  Raman  scat¬ 
tering  (53  julW)  can  be  used  to  infer  the  intrinsic  quality  factor  of 
1  X  108  for  this  cavity.  The  inset  shows  the  temporal  cavity  decay 
resulting  from  a  series  of  ringdown  measurements  for  a  different 
toroidal  microcavity.  The  measured  photon  lifetime  of  r-  22  ns  cor¬ 
responds  to  a  loaded  quality  factor  of  <2L=0.48  X  108.  After  correct¬ 
ing  for  fiber-taper  loading  and  the  presence  of  backscattering,  an 
intrinsic  quality  factor  of  1.2  X  108  is  obtained. 


has  a  radiative  quality  factor  of  1.8  X  108).  For  these  values 
the  TM-polarized  optical  mode  would  have  CQED  param¬ 
eters  of  (g / (2 7r),n0, N0)  =  (450  MHz,  1.7  X  10“5,4.5  X  10“5), 
which  are  far  superior  to  current  FP  cavities. 


VI.  COMPARISON  OF  MICROTOROIDS 
WITH  OTHER  RESONATORS  FOR  CAVITY  QED 

Table  I  presents  a  comparison  of  CQED  parameters  for 
various  cavity  types  including  toroidal,  FP,  and  photonic 
crystal.  To  date,  most  experimental  work  has  involved  the 
use  of  Fabry-Perot  cavities,  with  current  state-of-the-art  fab¬ 
rication  technology  allowing  the  attainment  of  coupling 
strengths  of  110  MHz,  with  corresponding  critical  atom 
numbers  of  6  X  10-3  [4].  Estimates  on  the  theoretical  perfor¬ 
mance  limits  of  FP  cavities  have  also  been  investigated  [20], 
predicting  coupling  rates  as  large  as  770  MHz,  with  a  corre¬ 
sponding  critical  atom  number  of  2  X  10-4.  While  this  level 
of  performance  may  be  theoretically  possible,  the  current 
necessity  of  expensive  and  sophisticated  high-reflection  di¬ 
electric  mirror  coatings  does  not  bode  well  for  easy  improve¬ 
ments  with  respect  to  current  technology.  This  is  one  of  the 
reasons  silica  microspheres  are  of  such  high  interest.  Calcu¬ 
lation  of  the  limits  possible  with  silica  microspheres  [23] 
shows  that  not  only  is  it  possible  to  obtain  high  values  of 


atom-cavity  coupling  solely  by  changing  the  cavity  diameter, 
which  is  easily  in  the  realm  of  current  fabrication  capability, 
but  their  ultra  high-quality  factors  result  in  significant  im¬ 
provement  in  the  critical  atom  number  (with  values  ap¬ 
proaching  3  X  10-6  possible  provided  that  silica  absorption- 
limited  quality  factors  can  be  obtained).  Even  using  quality 
factors  in  the  range  of  a  few  hundred  million,  which  is  al¬ 
ready  experimentally  demonstrated,  critical  atom  numbers 
around  10-4  are  possible,  which  is  comparable  to  the  FP 
limit.  From  the  analysis  of  the  previous  section,  we  see  that 
toroidal  cavities  can  attain  coupling  strengths  comparable  to 
or  exceeding  the  best  values  possible  for  either  FP  or  micro¬ 
sphere  cavities,  while  at  the  same  time  providing  much  lower 
critical  atom  numbers.  As  discussed  previously,  this  arises 
from  the  extra  level  of  geometrical  control  possible  in  a  to- 
roidally  shaped  cavity,  which  allows  one  to  retain  both  the 
high-coupling  strength  representative  of  small-mode  volume 
cavities  while  preserving  high-quality  factors.  Clearly  this 
fact,  along  with  other  advantages  in  control  and  reproducibil¬ 
ity  over  spherical  cavities,  suggests  that  these  structures  are 
promising  for  CQED  experiments. 

Fast,  a  comparison  with  photonic  band-gap  (PBG)  cavi¬ 
ties  is  also  provided  in  the  table.  Due  to  the  realization  of 
optical  mode  volumes  near  the  fundamental  limit  in  a  dielec¬ 
tric  cavity  [21],  combined  with  recent  results  demonstrating 
reasonably  high-quality  factors  (—45  000)  [33],  these  cavi¬ 
ties  are  strong  candidates  for  chip-based  strong-coupling 
CQED  [22].  While  these  structures  can  potentially  achieve 
atom-cavity  coupling  strengths  g^  17  GHz  [22],  far  greater 
than  those  possible  in  a  silica  dielectric  cavity,  their  much 
lower  quality  factors  result  in  greater  critical  atom  numbers 
than  possible  in  toroidal  microcavities.  For  example,  the 
work  of  Ref.  [22]  projects  N0=6.4X  10-5.  We  also  note  that 
the  correspondingly  lower  quality  factors  also  result  in  mod¬ 
est  ratios  of  coupling  to  dissipation  g/max(y± ,  k)  (a  figure  of 
merit  indicative  of  the  number  of  Rabi  oscillations  which 
occur)  of  —4  [22],  much  lower  than  predicted  for  toroidal 
structures  (—165).  Furthermore,  we  can  consider  an  addi¬ 
tional  figure  of  merit:  namely,  the  “rate  of  optical  informa¬ 
tion  per  atom”  [1],  given  by  R  =  g2/  k.  The  table  indicates 
that  toroidal  cavities  compare  favorably  with  PBG  cavities  in 
this  figure  of  merit  as  well. 


VII.  CONCLUSION 


Our  work  has  demonstrated  that  toroidal  resonators  are 
promising  cavities  for  investigation  of  the  coupling  of  an 
atomic  system  to  the  electromagnetic  field  in  the  regime  of 
strong  coupling.  Not  only  are  these  structures  arguably  sim¬ 
pler  to  manufacture  and  control  than  other  structures  such  as 
microspheres  and  FP  cavities,  but  also  allow  integration  on  a 
silicon  chip,  paving  the  way  for  the  addition  of  atom  traps 
[34]  and  waveguides  which  can  enhance  the  capability  and 
possibly  reduce  the  experimental  complexity  of  CQED  stud¬ 
ies.  Furthermore,  we  note  that  in  addition  to  the  enhanced 
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TABLE  I.  Summary  of  the  relevant  parameters  for  cavity  QED  for  a  variety  of  resonator  systems.  The  table  shows  both  the  experimental 
state  of  the  art  [23]  and  the  projected  limits  for  a  Fabry-Perot  cavity  [20],  plus  current  experimental  results  with  silica  microspheres  [16]. 
Furthermore,  a  theoretical  comparison  between  silica  microspheres  [23],  photonic  band-gap  cavities  [22],  and  toroidal  microresonators  (this 
work)  is  also  given.  The  results  indicate  that  toroidal  cavities  can  uniformly  exceed  the  performance  on  these  parameters  for  both  FP  cavities 
and  silica  microspheres.  Comparison  with  PBG  cavities  indicates  that  toroids  possess  much  lower  atom-cavity  coupling  strengths  (as  a  result 
of  their  much  larger  mode  volumes),  but  still  result  in  greatly  improved  critical  atom  numbers  due  to  their  very  large  quality  factors. 


Resonator  system 

Coupling  coefficient 
g/(27T)  (MHz) 

Critical  photon 
number 

n0 

Critical  atom 
number 

No 

Coupling  to 
dissipation  ratio 
g/max(y±,/c) 

Rate  of 

optical  information 

R  =  g2/  k  (Mbits/sec) 

Fabry-Perot 

no 

2.8  X  10‘4 

6.1  X10-3 

7.8 

5.4  X  103 

experimental  state  of  the  art 

Fabry-Perot  projected  limits 

770 

5.7  X  1(T6 

1.9  X  10-4 

36 

1.7  X  105 

Micro  sphere  experimental 

24 

5.5  X  IQ’3 

3.0  x  10“2 

7.2 

1.1  X103 

(D  -  1 20  jum) 

Microsphere  theory 

Maximum  g  (D  =  l. 25  jam) 

750 

6.1  X  1(T6 

7.3  X  10_1 

0.01 

4.5  X  101 

Minimum  N0  (D=  18  jam) 

280 

4.3  X  1(T5 

3.1  X10-6 

107 

1.1  X107 

Photonic  band-gap  cavity 

17000 

7.6  X  10~9 

6.4X10-5 

3.9 

5.1  X105 

Toroidal  microcavity  theory 

Maximum  g 

>700 

6.0  X1(T6 

2.0  X10-4 

40 

1.6  X105 

Minimum  N0 

430 

2.0  X10-5 

2.0  X  10-7 

165 

1.6  X108 

performance  benefit  of  having  a  toroidal  geometry,  the  capa¬ 
bility  to  retain  a  relatively  large  resonator  diameter  over 
other  structures  results  in  a  smaller  free- spectral  range 
(FSR).  This  allows  not  only  easier  tuning  of  the  cavity  reso¬ 
nance  location  to  correspond  precisely  to  the  atomic  transi¬ 
tion  wavelength,  but  also  may  allow  integration  of  a  supple¬ 
mental  far-off-resonance  trap  by  exciting  the  cavity  at  a 
multiple  of  the  free- spectral  range.  The  realization  of  a  cavity 
with  a  smaller  FSR  may  allow  a  closer  matching  of  a  sec¬ 
ondary  resonance  location  to  the  pump  wavelength  which 
corresponds  to  state-insensitive  trapping  of  atomic  cesium 
[7],  which  can  simplify  the  atom-cavity  dynamics.  The  use 
of  a  silica  dielectric  whispering-gallery  cavity  also  allows 
operation  over  a  broad  range  of  wavelengths,  with  very-high- 
quality  factors  possible  for  nearly  all  resonances.  This  is  in 
strong  contrast  to  the  mirror  reflectivity  limits  of  coated  FP 
cavities. 

The  ability  to  connect  distant  quantum  nodes  with  high 
efficiency,  preferably  over  optical  fiber,  is  very  desirable  for 
quantum  networks.  Using  FP  cavities,  optical  fiber  coupling 
is  possible;  however,  the  overall  coupling  efficiency  is  mod¬ 
est  (—70%).  Fiber- taper-coupled  microtoroids  allow  cou¬ 
pling  efficiencies  in  excess  of  99%  [17],  above  both  FP  and 
PBG  cavities  (97%)  [35].  This  capability  to  obtain  near- 
complete  input  and  output  coupling  efficiencies  strongly  sug¬ 
gests  the  use  of  fiber-coupled  silica  whispering-gallery  cavi¬ 
ties,  such  as  microtoroids,  as  building  blocks  to  enable  high- 
performance  quantum  networks. 


As  a  further  note,  the  use  of  higher-index  contrast  dielec¬ 
tric  material  can  allow  additional  improvements  in  the  per¬ 
formance  of  these  structures.  The  use  of  silica  as  the  dielec¬ 
tric  of  choice  in  both  the  spherical  geometry  and  in  the 
toroidal  microcavities  studied  in  this  work  was  convenient, 
as  these  structures  not  only  possess  record  high-quality  fac¬ 
tors  but  are  currently  producible.  However,  as  the  radiative 
quality  factor  of  a  whispering-gallery-type  cavity  is  strongly 
dependent  on  the  refractive  index  difference  between  the 
structure  and  external  environment,  much  smaller  modal  vol¬ 
umes  are  possible  for  a  given  quality  factor  with  the  use  of  a 
higher-index  resonator  material.  In  fact,  this  is  one  of  the 
reasons  PBG  cavities  fabricated  from  silicon  or  other  high- 
index  dielectrics  can  obtain  ultrasmall  mode  volumes.  A 
simple  comparison  of  the  mode  volume  possible  in  a  silicon 
toroid  shows  that  a  mode  volume  on  the  order  of  only  about 
10  times  larger  than  PBG  cavities  is  possible,  with  much 
higher-quality  factors.  While  this  work  has  focused  on  silica 
microcavities,  the  reflow  process  is  a  relatively  flexible 
method,  thus  suggesting  that  it  may  be  possible  to  also  create 
high-index  ultrahigh-g  quality  factor  cavities  which  come 
closer  to  the  large  coupling  strengths  of  PBG  cavities  while 
further  improving  the  critical  atom  number. 

Last,  the  current  experimental  ability  to  obtain  large  cou¬ 
pling  strengths  with  quality  factors  exceeding  108  is  promis¬ 
ing  for  the  immediate  use  of  these  structures  in  strong¬ 
coupling  studies.  We  are  currently  pressing  forward 
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on  improving  the  fabrication  capabilities  and  losses  of  these 
structures.  Coupled  with  the  intrinsic  fiber-optic  compatibil¬ 
ity  of  these  structures  and  the  demonstration  of  near  lossless 
excitation  and  extraction  of  optical  energy  from  these  struc¬ 
tures  using  tapered  optical  fibers  [17],  toroidal  microcavities 
can  provide  a  highly  advantageous  experimental  system  for 
the  investigation  of  strong-coupling  cavity  QED. 
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We  report  an  investigation  to  establish  the  physical  mechanisms  responsible  for  decoherence  in 
the  generation  of  photon  pairs  from  atomic  ensembles,  via  the  protocol  of  Duan  et.  al.  for  long 
distance  quantum  communication  [Nature  (London)  414,  413  (2001)]  and  present  the  experimental 
techniques  necessary  to  properly  control  the  process.  We  develop  a  theory  to  model  in  detail  the 
decoherence  process  in  experiments  with  magneto-optical  traps.  The  inhomogeneous  broadening  of 
the  ground  state  by  the  trap  magnetic  held  is  identified  as  the  principal  mechanism  for  decoherence. 

The  theory  includes  the  Zeeman  structure  of  the  atomic  hyperhne  levels  used  in  the  experiment,  and 
the  polarization  of  both  excitation  fields  and  detected  photons.  In  conjunction  with  our  theoretical 
analysis,  we  report  a  series  of  measurements  to  characterize  and  control  the  coherence  time  in  our 
experimental  setup.  We  use  copropagating  stimulated  Raman  spectroscopy  to  access  directly  the 
ground  state  energy  distribution  of  the  ensemble.  These  spectroscopic  measurements  allow  us  to 
switch  off  the  trap  magnetic  held  in  a  controlled  way,  optimizing  the  repetition  rate  for  single-photon 
measurements.  With  the  magnetic  held  off,  we  then  measure  nonclassical  correlations  for  pairs  of 
photons  generated  by  the  ensemble  as  a  function  of  the  storage  time  of  the  single  collective  atomic 
excitation.  We  report  coherence  times  longer  than  10  fis,  corresponding  to  an  increase  of  two  orders 
of  magnitude  compared  to  previous  results  in  cold  ensembles.  The  coherence  time  is  now  two  orders 
of  magnitude  longer  than  the  duration  of  the  excitation  pulses.  The  comparison  between  these 
experimental  results  and  the  theory  shows  good  agreement.  Finally,  we  employ  our  theory  to  devise 
ways  to  improve  the  experiment  by  optical  pumping  to  specific  initial  states. 


I.  INTRODUCTION 

Quantum  memory  is  a  key  resource  for  many  quantum- 
information  protocols.  Usually  it  is  associated  with  the 
basic  requirements  for  quantum  computation  [1,  2],  but 
in  recent  years  also  quantum  communication  protocols 
started  to  rely  on  it.  The  requirement  of  memory  was  in¬ 
troduced  in  quantum  communication  as  part  of  the  idea 
for  quantum  repeaters  [3,  4],  a  possible  solution  for  the 
problem  of  quantum  communication  over  long  distances. 
In  this  case,  memory  is  essential  to  increase  the  prob¬ 
ability  of  success  of  the  chain  of  conditional  steps  that 
underlies  the  protocol,  and  makes  feasible  scalable  quan¬ 
tum  networks. 

A  significant  step  toward  the  realization  of  the  quan¬ 
tum  repeater  idea  was  a  proposal  by  Duan,  Lukin,  Cirac, 
and  Zoller  (DLCZ)  for  its  implementation  using  linear 
optics  and  atomic  ensembles  [5].  The  DLCZ  protocol  is 
based  on  the  generation  of  single  photons  by  spontaneous 
Raman  scattering  in  atomic  ensembles  [6] .  The  detection 
of  a  single  photon  in  the  forward  propagating  mode  her¬ 
alds  the  presence  of  a  single  collective  atomic  excitation 
in  the  sample,  due  to  a  collective  enhancement  effect. 
This  excitation  can  be  stored  for  a  time  up  to  the  co¬ 
herence  time  of  the  ground  states  of  the  atoms  and  then 
converted  back  into  a  light  field.  Entanglement  of  distant 
ensembles  in  the  excitation  number  basis  is  generated  by 
interference  [7],  and  extended  to  longer  distances  by  en¬ 
tanglement  swapping  [8,  9].  The  final  pairs  of  ensembles, 
far  apart,  can  then  be  used  for  entanglement-based  quan¬ 
tum  cryptography  [5,  10],  probabilistic  quantum  telepor¬ 
tation  and  violation  of  Bell  inequality.  This  proposal  has 
received  much  attention  in  the  past  two  years  and  several 


groups  are  presently  pursuing  its  experimental  implemen¬ 
tation  [11,  12,  13,  14,  15,  16,  17,  18]. 

In  this  article,  we  analyze  the  decoherence  processes 
present  in  the  DLCZ  protocol,  and  describe  experiments 
to  mitigate  the  problem.  We  construct  a  theory  for  the 
decoherence  process  in  the  photon-pair  generation.  Par¬ 
ticularly,  our  analysis  concentrates  in  its  implementation 
with  cold  atomic  ensembles,  but  many  results  should 
also  apply  to  studies  with  room-temperature  ensembles 
in  vapor  cells.  We  propose  various  strategies  to  increase 
the  system’s  coherence  time,  and  introduce  experimental 
techniques  necessary  for  its  characterization  and  control. 
We  also  report  the  first  experimental  steps  in  this  direc¬ 
tion,  with  an  increase  of  more  than  two  order  of  magni¬ 
tude  in  the  coherence  time  with  respect  to  the  previously 
reported  works  with  cold  atoms  [11,  14,  16,  17,  18]. 

The  coherence  times  reported  up  to  now  by  the  sev¬ 
eral  groups  working  on  the  implementation  of  the  pro¬ 
tocol  are  all  shorter  or  of  the  order  of  a  couple  of  mi¬ 
croseconds.  Furthermore,  for  all  experiments  to  date, 
the  reported  coherence  times  are  of  the  order  of  the  ex¬ 
citation  pulses  duration.  However,  for  using  this  system 
as  a  quantum  memory,  it  is  important  to  obtain  storage 
time  much  longer  than  the  excitation  pulses.  Moreover, 
for  the  DLCZ  protocol  to  become  a  viable  alternative  for 
long  distance  quantum  communication,  long  coherence 
time  is  crucial  and  major  efforts  are  required  to  increase 
it.  The  main  goal  of  the  present  article  is  then  to  provide 
the  initial  steps  in  this  direction,  and  to  establish  several 
techniques  and  ideas  for  the  next  steps. 

Only  two  types  of  systems  have  been  employed  in  the 
experiments  up  to  now:  vapor  cells  [12,  13,  15]  and 
cold  atoms  in  magneto-optical  traps  [11,  14,  16,  17,  18]. 
In  both  systems,  however,  the  experiments  have  not 
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achieved  yet  their  respective  state-of-the-art  coherence 
times.  The  vapor-cell  studies,  for  example,  did  not  em¬ 
ploy  paraffin  coated  cells  [19,  20];  the  coherence  times 
were  effectively  limited  to  the  time  the  atoms  take  to  dif¬ 
fuse  out  of  the  excitation  region,  which  is  of  the  order 
of  microseconds.  Recently,  high  fidelity  atomic  quantum 
memory  of  the  state  of  a  light  pulse  was  achieved  with 
such  paraffin  coated  cells  [21]  with  memory  times  of  up  to 
4  ms.  Coherence  times  of  tens  of  milliseconds,  however, 
are  commonly  achieved  in  this  system  [22],  and  there  are 
reports  of  coherence  times  as  high  as  one  second  [19]. 
The  difference  in  these  values  is  largely  due  to  measure¬ 
ments  of  decay  of  different  coherent  processes  [19].  How 
the  coherence  required  for  the  generation  of  photon  pairs 
from  atomic  vapors  will  decay  as  the  atoms  collide  with 
the  walls  of  paraffin  coated  cells  is  still  to  be  determined. 

The  use  of  atomic  traps  to  generate  photon  pairs  for 
the  DLCZ  protocol  has  the  advantage  of  providing  a  high 
density  of  atoms  distributed  in  a  small  spectral  region, 
due  to  the  suppression  of  Doppler  broadening  by  the  cool¬ 
ing  process.  This  allows  the  use  of  excitation  laser  pulses 
tuned  closer  to  resonance,  which  requires  much  less  power 
and  makes  it  easier  to  filter  the  excitation  pulses  from  the 
Ramam-scattered  photons.  However,  atomic  traps  also 
introduce  a  different  set  of  complications.  In  the  case 
of  the  magneto-optical  traps  (MOT)  used  up  to  now, 
the  magnetic  field  of  the  trap  induces  decoherence  on 
a  timescale  of  the  order  or  smaller  than  a  few  hundreds 
nanoseconds  [16,  17,  18].  The  first  results  with  the  MOT 
magnetic  field  off  are  reported  in  the  present  article,  with 
coherence  times  on  the  order  of  10  fi s.  As  will  be  dis¬ 
cussed  below  in  detail,  a  better  nulling  of  the  magnetic 
field  combined  with  optical  pumping  to  specific  Zeeman 
levels  might  increase  the  coherence  time,  in  a  straightfor¬ 
ward  way,  to  hundreds  of  microseconds. 

Further  improvements  with  MOTs  would  face  the 
problem  of  diffusion  of  atoms  from  the  excitation  region 
and,  most  troublesome,  from  the  MOT  itself.  This  prob¬ 
lem  can  in  principle  be  mitigated  by  improved  cooling 
techniques.  However,  along  these  lines,  it  would  be  dif¬ 
ficult  to  increase  the  coherence  time  above  a  couple  of 
milliseconds.  A  possible  solution  then  is  to  use  an  op¬ 
tical  dipole  trap  to  hold  the  atoms  during  the  write- 
and-read  process.  Hyperfine  coherence  times  of  hun¬ 
dreds  of  milliseconds  have  already  been  observed  in  such 
traps  [23,  24]. 

In  the  following,  Secs.  II  and  IV  are  devoted  to  theo¬ 
retical  results  and  Sec.  Ill  to  associated  experiments.  In 
Sec.  II A  we  give  a  general  introduction  to  the  photon- 
pair  generation  process  behind  the  DLCZ  protocol.  In 
Sec.  II B,  we  derive  a  theory  for  the  probability  of 
joint  detection  of  these  photons  pairs  generated  from  an 
atomic  ensemble  in  a  magneto-optical  trap.  This  theory 
is  a  direct  extension  of  a  previous  theoretical  treatment 
reported  in  Ref.  6,  to  which  we  added  explicitly  the  read¬ 
ing  process  and  the  Zeeman  structure  of  the  levels.  In 
this  way,  we  are  able  to  model  the  action  of  the  magnetic 
field  over  the  atoms,  and  to  study  the  dependence  of  the 


correlations  with  the  light  polarization. 

Section  III  describes  an  experimental  investigation 
leading  to  the  nulling  of  the  magnetic  field  in  the  photon- 
pair  correlation  measurements,  with  the  subsequent  in¬ 
crease  in  the  system  coherence  time  and  degree  of  cor¬ 
relation.  In  Sec.  Ill  A,  we  describe  a  series  of  Raman- 
spectroscopy  experiments  to  characterize  the  system  and 
optimize  the  process  of  zeroing  the  magnetic  field.  We  de¬ 
termine  the  set  of  experimental  conditions  that  result  in 
a  good  compromise  between  atomic  density  and  magnetic 
field  cancellation,  which  we  used  in  the  correlation  mea¬ 
surements.  Section  IIIB  describes  then  measurements  of 
nonclassical  correlations  for  the  photon  pairs  generated 
by  the  MOT.  We  compare  results  with  magnetic  field  on 
and  with  magnetic  field  off.  The  magnetic  field  off  mea¬ 
surements  present  a  higher  degree  of  correlation,  and  a 
hundred  times  larger  coherence  time.  We  compare  the 
shape  of  the  experimental  curves  with  magnetic  field  on 
and  off  to  our  theory,  obtaining  good  agreement.  We  also 
show  how  the  two-photon  wavepacket  that  describes  the 
detailed  temporal  structure  of  the  photon  pair  generation 
is  modified  by  the  magnetic  field. 

Finally,  based  on  the  procedure  for  comparison  be¬ 
tween  theory  and  experiment  described  in  Sec.  IIIB,  we 
formulate  in  Sec.  IV  a  proposal  to  improve  our  experi¬ 
mental  signal.  We  suggest  using  a  combination  of  optical 
pumping  to  a  specific  initial  state  and  polarization  of  the 
light  fields  to  increase  both  our  detection  efficiency  and 
coherence  time.  Section  V  is  dedicated  to  our  conclu¬ 
sions. 


II.  THEORY 

The  basic  theory  for  the  DLCZ  protocol  is  described 
in  Refs.  [5]  and  [6].  The  general  idea  of  the  protocol  is 
treated  in  Ref.  [5] ,  while  Ref.  [6]  gives  a  detailed  analysis 
of  the  collective  emission  of  photons  through  spontaneous 
Raman  scattering  following  excitation  by  free-space  light. 
Section  II B  provides  an  extension  of  the  theoretical  treat¬ 
ment  of  Ref.  [6]  to  better  account  for  our  experimental 
conditions.  The  emphasis  here  is  the  modeling  of  the  de¬ 
coherence  process  due  to  external  magnetic  fields,  and  in 
particular  for  experiments  using  magneto-optical  traps. 
To  model  this  decoherence,  the  essential  elements  to  be 
introduced  in  the  previous  theory  of  Ref.  [6]  are  the  Zee- 
man  structure  of  all  levels  and  an  explicit  treatment  of 
the  reading  process.  On  the  other  hand,  the  theory  in 
Sec.  II B  is  a  simplification  of  the  treatment  of  Ref.  [6] 
concerning  the  spatial  mode  of  the  photons.  We  consider 
only  the  forward,  collectively  enhanced  emission.  The 
reading  process  is  also  treated  in  a  simplified,  perturba¬ 
tive  way,  while  the  experiments  are  done  with  stronger 
read  pulses  on  resonance.  This  later  difference  between 
theory  and  experiment  will  result  in  some  noticeable  dis¬ 
crepancy  in  Sec.  IIIB 2,  where  we  discuss  measurements 
of  the  two-photon  wavepacket  of  the  pair-generation  pro¬ 
cess.  In  general,  however,  the  comparison  between  the- 


3 


ory  and  experiment  performed  in  Sec.  IIIB  results  in 
very  good  agreement,  which  indicates  that  the  theory 
in  Sec.  II B  takes  into  account  the  essential  physical  ele¬ 
ments  behind  the  decoherence  process. 

A.  Photon  pair  generation 

The  building  block  of  the  DLCZ  protocol  is  an  ensem¬ 
ble  of  N  identical  atoms  with  lambda-type  energy  level 
configuration  as  shown  in  Fig.  1,  which  we  briefly  discuss 
here  in  an  ideal  setting.  In  the  experiments  discussed 
in  this  article,  the  lower  states  | g)  and  |s)  are  hyperfine 
sublevels  of  the  electronic  ground  state  of  Cesium  atoms. 
First,  all  atoms  are  prepared  in  the  state  | g).  By  sending 
in  a  weak,  off-resonant  laser  pulse,  one  atom  of  the  en¬ 
semble  might  be  transfered  from  |  g)  to  |s),  thus  emitting 
a  photon  (field  1)  at  a  frequency  or  polarization  differ¬ 
ent  from  the  original  exciting  field.  A  key  element  of 
the  protocol  is  the  collective  enhancement  of  this  spon¬ 
taneous  Raman  scattering  in  a  forward  direction,  which 
is  determined  by  the  spatial  mode  of  the  laser  pulse  and 
the  geometry  of  the  excitation  region  [6].  If  the  laser 
intensity  is  low  enough  so  that  two  excitations  are  very 
unlikely,  the  detection  of  the  photon  generated  in  this 
process  is  a  signature  that  the  ensemble  was  excited  to 
a  symmetrical  collective  state  [5,  6],  which  in  the  ideal 
case  can  be  explicitly  written  as 

1  N 

\U)^^=J2\g)i---\s)i--\g)N,  (1) 

where  the  sum  goes  over  all  atoms  addressed  by  the  laser 
pulse,  and  |la)  indicates  the  state  of  the  atomic  ensemble 
with  just  one  excitation.  This  is  the  “writing”  step  of  the 
protocol  (Fig.  la). 

Since  the  excitation  probability  x  is  very  small,  the 
whole  state  of  the  system  consisting  of  atoms  and 
forward-scattered  mode  of  light  is  in  the  following  form: 

|0>  =  |Oo)|Oi)  +  e^Vx|la)|li)  +  O(x),  (2) 

where  x  <<  1,  \ni)  stands  for  the  state  of  the  forward- 
propagating  light  field  1  with  n  photons,  f3  is  a  phase 
set  by  propagation  to  and  from  the  ensemble,  and  |0a)  = 
a  I d)i-  0(x)  represents  all  the  other  possible  excita¬ 
tion  processes,  which  in  the  ideal  case  occur  with  proba¬ 
bilities  of  order  y2.  The  system  remains  in  this  state  for 
a  time  on  the  order  of  the  lifetime  of  the  ground  states. 
By  sending  in  a  second  (“read”)  pulse  resonant  with  the 
|  s)  — ►  |  b)  transition,  the  state  of  the  atomic  ensemble 
can  be  transferred  deterministically  (read  out)  to  another 
forward-propagating  light  field  2  at  the  |  b)  — ►  | g)  tran¬ 
sition  (see  Fig.  lb).  In  this  way,  it  is  possible  to  access 
the  quantum  state  of  the  atoms.  This  reading  process 
is  then  closely  related  to  low- light- level  Electromagneti- 
cally  Induced  Transparency  [25,  26].  After  the  read  out, 
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FIG.  1:  Relevant  level  structure  of  the  atoms  in  the  ensemble 
for  (a)  writing  and  (b)  reading  processes,  with  | g)  the  initial 
ground  state  and  |  s)  the  ground  state  for  storing  an  excita¬ 
tion.  | a)  and  |  b)  are  excited  states.  The  transition  | g)  — >  | a) 
is  initially  coupled  by  a  classical  laser  pulse  (write  beam)  de¬ 
tuned  from  resonance,  and  the  forward-scattered  Stokes  light 
(field  1)  comes  from  the  transition  | a)  — ►  |s),  which  has  dif¬ 
ferent  polarization  or  frequency  to  the  write  light.  A  classical 
read  pulse  then  couples  the  transition  |  s)  — >  |6),  leading  to  the 
emission  of  forward-scattered  anti- Stokes  light  (field  2)  from 
the  transition  | b)  —>  \g). 

the  state  of  the  system  becomes: 

\<t>)  =  |01)|02)  +  e^Vx|li)|l2)+O(X),  (3) 

where  7  is  a  phase  that  includes  /?  and  the  propagation 
phases  to  and  from  the  ensemble  related  to  the  reading 
process.  Fields  1  and  2  exhibit  now  strong  correlations  in 
the  photon  number  basis,  and  can  be  described  as  photon 
pairs.  These  non-classical  correlations  can  be  measured 
by  photoelectric  detection.  Since  the  field  2  maps  the 
state  of  the  atoms,  the  correlations  between  field  1  and 
field  2  can  then  be  used  to  infer  correlations  between  field 
1  and  the  collective  atomic  excitations  in  the  sample. 

B.  Decoherence 

In  order  to  analyze  the  decoherence  process  in  the  gen¬ 
eration  of  pairs  from  an  atomic  ensemble  as  described  in 
Sec.  II  A,  we  need  to  expand  the  theoretical  treatment 
of  Ref.  [6]  to  include  other  experimentally  relevant  fea¬ 
tures.  For  our  experiments  in  particular,  it  is  essential 
to  include  the  splitting  of  the  Zeeman  structure  of  the 
atomic  ground  states  due  to  the  magnetic  field.  The 
MOT  quadrupole  field  generates  an  inhomogeneous  dis¬ 
tribution  of  splittings  throughout  the  ensemble.  As  the 
system  evolves  in  time,  this  results  in  dephasing  between 
different  regions  of  the  atomic  cloud,  and  in  a  respective 
decay  of  the  coherence  of  the  collective  state.  It  is  also 
important  to  include  explicitly  the  reading  process  in  the 
theory.  For  simplicity,  this  is  done  by  considering  a  read 
process  similar  to  the  write  process,  i.e.,  with  small  prob¬ 
ability  of  excitation  and  detuned  from  the  excited  state. 
Note  that  in  the  actual  experiment,  the  read  beam  is 
stronger  than  the  write  beam  and  is  on  resonance.  This 
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will  lead  to  small  discrepancies  when  comparing  the  ex¬ 
perimental  results  to  the  theory,  that  will  be  discussed  in 
section  III  B  2. 

The  inclusion  of  Zeeman  structure  in  the  theory  al¬ 
lows  a  detailed  discussion  of  the  effect  of  light  polariza¬ 
tion  in  the  experiment.  This  is  important  to  evaluate 
different  excitation  and  detection  schemes.  It  also  gives 
a  better  description  of  the  initial  state,  and  of  its  role 
on  the  subsequent  coherent  pair  generation.  Together, 
the  analysis  of  different  polarization  schemes  and  of  dif¬ 
ferent  initial  states  led  to  specific  proposals  of  ways  to 
improve  the  whole  process.  These  features  of  the  theory 
are  not  specifically  related  to  the  MOT  magnetic  field, 
and  should  apply  to  pair  generation  in  other  systems, 
like  vapor  cells  or  dipole  traps. 

Our  treatment  starts  by  considering  a  sample  of  N 
four- level  atoms,  such  as  in  Fig.  2.  The  four  levels  rep¬ 
resent  manifolds  of  Zeeman  sublevels  and  are  indicated 
by  their  respective  F  quantum  numbers.  A  specific  state 
of  the  Fj  manifold  of  the  i-th  atom  is  represented  by  its 
ket  | rrij)i,  where  rrij  is  the  azimuthal  quantum  number. 
Two  pumping  fields  act  on  the  system,  namely  a  write 
field  £ga  and  a  read  field  where 


£ga(r,t)  =  uw{r,ty^z-“^ePw  ,  (4a) 

£sb(r,t)  =  Ur{r,t)el(krZ~“rt)ePr ,  (4b) 

which  couple  the  transitions  Fg  — >  Fa  and  Fs  — >  F5, 
respectively.  The  functions  uw  and  ur  give  the  slowly- 
varying  envelopes  of  the  write  and  read  pulses,  respec¬ 
tively,  and  ePw  and  ePr  are  their  polarization  vectors.  As 
a  result  of  their  action,  two  Raman  fields  are  sponta¬ 
neously  generated  in  the  sample: 

£sa(r,t )  (X  y]  [  dk1yipieikl'r~u,';it)ePl  ,  (5a) 

pi  J 

£gb(r,t)  oc  [  ^6fc2P2e^2'r-^t)eP2  ,  (5b) 

P2  ** 


where  =  \ki\ c  and  pi  is  a  label  for  the  field  polariza¬ 
tion.  and  are  the  annihilation  operators  for 

the  Raman  fields  1  and  2,  respectively,  which  couple  the 
transitions  Fs  — >  Fa  and  Fg  — »  F&.  The  state  of  field  1 
with  just  one  photon  excited  in  mode  k\p\  will  be  desig¬ 
nated  by  |l£  ).  A  similar  notation  will  be  used  for  field 

2. 

The  Hamiltonian  for  the  system  of  N  atoms  can  be 
written  as 


(6) 


FIG.  2:  Energy  level  scheme  considered  for  the  atomic  ensem¬ 
bles 


where 


N 


H0  =  ^2  \  E  +  M BgsmsBZi )  \ms)i(ms\ 

i=  1  L  ms  =—Fs 

+  E  VBggmgBZi\mg)i(mg\ 

17lg  =  —  Fg 

Fa  Fb 

+  E]  huJa\ma)i(ma\  +  ^2  hub\mb)i(mb\ 


ma  =  —  Fa 


mb  =  —  Fh 


(7) 


is  the  free-atom  Hamiltonian,  and 

N  (  Fa 


V(t)  =  E  \  E  E  (-dmamg  ■  4a)  I 


i — 1  \  rria —  Fa  Trig —  Fg 
Fs  Fa 


T  ^  ^  ^  ^  ^msma  ’  \rns)i(rna 


tit s —  Fs  ma —  Fa 
Fb  Fs 


E  E  (  dmbms  ’  E'sb'j  |  ^b)i 


[m. 


mb=—Fb  ms  =  —  Fi 
Fa  Fb 


+  E  E  (  dmgmb  ■?lb)  \mg)i(mb\  >  (8) 


mg  =  —  Fg  mb  =  —  Fb 


gives  the  time-dependent  interaction  Hamiltonian,  djk 
is  the  dipole  moment  for  the  j  — »  k  transition,  fiB  the 
Bohr  magneton,  gj  the  hyperfine  Lande  factor  for  level 
Fj,  and  BZi  is  the  magnetic  field  in  the  position  of  the 
i-th  atom.  The  magnetic  field  direction  is  taken  as  the 
quantization  z  axis.  We  neglect  the  Zeeman  splitting  of 
the  excited  states  since  we  want  to  investigate  a  situation 
where  it  is  always  smaller  than  the  excited-states  natural 


H(t)  =Ho  +  V(t) 
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linewidths.  The  factors  —djk  •  Skj  can  also  be  written  as 

_ rj  .  f  —  IF  7 /  (ip.  f\pd^wZi—U>wt)  fQoA 

Ujmamg  <-'ga  —  IYmamg  aw  y  i->  b •>  \zfcx) 

—d  .  [  rtk^K^lPl  a t  -fi 

amsma  & sa  ~  /. /  aKlJXrnsmaa^lpie  ’ 

Pi 

(9b) 

-<46ms  •  4b  =  Kmbrnsur{fu  t)ei{krZi~Urt) ,  (9c) 

_  V  .  ?t  _  V  f  rlkn  K^2P2  p-l$2-r-u)%  t) 

amgmb  Lgb-  2^  /  aK2J^rngmb°^2p2e  > 

P2  J 

(9d) 

where  Kmamg,  K%%a,  Kmbms,  and  are  coupling 

constants  for  the  corresponding  transition. 

The  temporal  evolution  of  the  coupled  system  consist¬ 
ing  of  ensemble  +  Raman  fields  is  described  by  the  evolu¬ 
tion  of  its  density  matrix  p(t).  In  the  interaction  picture, 
the  corresponding  operator  pi(t)  is  given  by 

pI(t)  =  uI(t)mu}(t),  (io) 

where  Ui(t)  is  the  temporal  evolution  operator,  and  the 
initial  state  p(0)  can  be  written  as 

p( 0)  =  PFi(O)0pF2(O)0pi(O)0p2(O)0-  •  -0Pat(O)  ,  (11) 


Hamiltonian  Vi(t)  in  the  interaction  picture  as 

N 

Vj(t)  =  eiHot/hy^e-iHot/h  =  y.(t) .  (15) 

i=  1 


1.  Probability  for  joint  detections 

We  want  to  calculate  in  the  lowest  order  of  perturba¬ 
tion  the  probability  of  detecting  a  single  photon  in  field 
1  followed  by  another  photon  in  field  2.  The  first  step  is 
then  to  calculate  the  restriction  of  the  coupled  state  p(t) 
to  the  space  of  states  of  fields  1  and  2: 

PF1F2{t)  =  Tr^  [p{t)]  ■  (16) 

The  symbol  Tr  a  indicates  a  partial  trace  over  all  atomic 
states.  The  probability  for  detecting  two  photons,  one  in 
mode  kipi  and  the  other  in  mode  /C2P2,  up  to  time  t  is 
then  given  by 


with  pf1( 0)  the  initial  state  of  field  1,  pp2{ 0)  the  initial 
state  of  field  2,  and  /5i(0)  the  initial  state  of  the  i-th  atom. 
For  most  of  what  follows,  we  will  be  interested  in  the 
case  where  the  fields  1  and  2  are  initially  vacuum  states, 
PFi(O)  =  \vacF1)(vacF1\  and  pjp2( 0)  =  \vacF2)(vacF2\, 
and  all  atoms  are  initially  in  the  same  incoherent  dis¬ 
tribution  over  the  Zeeman  sublevels  of  the  Fg  state: 

Pi{  0)  =  ^  Drng\mg)i(mg\ ,  (12) 

rng  =  ~Fg 

with  Drrig  giving  the  probability  of  finding  an  atom  in  the 
mg  state  at  t  =  0.  In  section  IV  however,  we  will  consider 
the  case  where  all  the  atoms  are  optically  pumped  in  one 
of  the  Zeeman  sublevel  ( mp  =  0). 

The  operator  U (t)  can  be  written  as  a  Dyson  series  in 
the  form 


N  N 

Ui(t)  =  1  +  \t)  +  5  (13) 

i=  1  i=l 


p\h2(t,klPl,k2p2)  =  (l£ipi  I  (lfe2p2  \PFi  F2  (t)  1 1^2P2  )  I  lfeipi  ) 

=  Tfcipi  l(lfc2p2  [p(fj\  |-*-fe2p2)|-’-ft1p1)- 

(17) 


Since  all  atoms  are  initially  in  the  ground  state  Fg,  the 
lowest  order  term  of  series  (13)  that  results  in  a  single 
photon  in  field  1  and  another  photon  in  field  2  is  the 
fifth  term,  which  accounts  for  the  four  transitions  carried 
successively  by  the  write  field,  photon  1,  read  field,  and 
photon  2,  respectively.  Substituting  Eqs.  (10)  and  (13) 
into  Eq.  (17)  and  keeping  only  the  lowest  order  term,  we 
arrive  then  at 


Pi2(t,kipi,k2p2)  = 

N 

E  FPl 1  (!fc-2P2  |T^ [$4)  t0/5(0)^i4)+(0]  |lfe2P2 )  1  h1P1 ) • 

(18) 


ij=l 


where 

uj\t)  =  (-i 'j  J*  dt'Viit') , 

U-2\t)  =  j  «{  dt"Vi{t')Vi(t") ,  (14) 

and  so  on.  The  single-atom  interaction  operator  Vi(t)  is 
defined  from  the  expression  for  the  general  interaction 


Note  that  U ^  acts  only  over  the  k- th  atom.  Thus,  the 
trace  Tr  A  on  each  term  of  the  double  sum  can  be  written 
as  a  trace  Tr^  over  the  states  of  the  atoms  at  which 
the  U ^  operator  is  acting,  since  all  other  atoms  remain 
in  their  initial  state.  Two  different  cases  are  present  in 
Eq.  (18).  If  i  ^  j,  the  two  operators  act  over  two  different 
atoms  and  the  initial  state  p(0)  simplifies  to  pp1( 0)  0 
Pf2 (0)  0  Pi(0)  0  pj( 0).  If  i  =  j,  then  p(0)  ->  pFl(0)  0 
Pf2{ 0)  0  Pi(0).  With  these  observations  in  mind,  we  see 
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that  Eq.  (18)  can  then  be  written  as 
Pi2{t,kipi,k2p2)  = 

N 

^(4)(f)^(°)  bacF2)|mcFl) 

£,.7  =  1 
£#J 

x  (vacFl I (mcf2 I Tr,-  ft(0)wj4)t(i)  |ljj2P2)|lfclPl> 

N 

+  E^m 1 <1fe2J>JTr-  N4)  WM  (°)  ®  M (0)  ®  Pi(o) 


7=1 


*  "“*«)]  IIE=«>I1*1P1>- 

Substituting  Eq.  (12),  we  have 
p\2(t,hpi,k2p2)  = 


(19) 


JV  Fg 

E  E  DmgAi^TTlg ,  777/^) 

7=1  m„  =  —  Ft 


Lg—  ±  g 

Fa 


N  Fg 

E  E  E  Dmg  \Ai(m'g,mg)\2 


N 


-E 


^  ^  DrngAi^TTig^TTig^j 


ttl  a  —  F n 


(20) 


Finally,  substituting  the  specific  expressions  for  U-A  (t) 
and  Vi(t),  we  find  that  Ai(mg,mg)  can  be  written  as 


Ai(mg,  mg)  =  d(mg,ma)  ^z.+k^-k^-k^) 

n  ^7/ 
ms  =  —  Fs 

x  [ ^/ei(Aws2-A’-+a^)*' 

Jo 

t' 

7(Z\r  Q>is  )t 


X 


Jo 


Jo 

x  f  dt^uw{rut,v)ei^-a^t'\  (23) 
Jo 

where  ^wi  ~ I-  (^s  — 

-CJW  -  UJS ,  Acj^2  =  a;fc-  -  cjr  +  cjs,  and 

U  Ex 

'A'",.'",)  -  E  E  K^XKrmbmK^KZamg 

mb  =  —  Fij  ma  =  —  Fa 

(24) 

gives  the  strength  of  an  specific  excitation  pathway  in 
which  the  atom  starts  at  mg ,  then  goes  to  ras,  and 
ends  at  mg  again.  The  Zeeman  splittings  are  written 
in  terms  of  the  parameters  aig  =  tiBggFngBzJTi  and 
ais  =  gBgsmsBZi/h. 


where 


Ai(m'g,mg)  = 

mipi  1 0k2p2  I  Kl^i4)  (t)\mg)i\vacF2)\vacF1 ) •  (21) 


Note  that  the  first  term  on  the  right  side  of  Eq.  (20) 
scales  as  TV2,  while  the  two  remaining  terms  scale  with 
N  only.  Since  we  are  interested  in  the  limit  of  large  TV, 
we  can  then  approximate 


Pi2(t,k1p1,k2P2) 


N  Fg 

DrngAi^rrig^mg) 

7=1  mg  =  —  Fg 

(22) 


Thus,  for  large  TV,  only  transitions  that  start  and  end  in 
the  same  state  contribute  to  the  pair  generation.  This 
result  can  be  understood  as  a  constructive  interference 
between  all  pathways  that  connect  the  ensemble  back  to 
its  initial  state,  after  which  it  is  not  possible  to  distin¬ 
guish  which  atom  made  the  transition  [27].  Pathways 
connecting  different  initial  and  final  states  leave  a  trace 
in  the  ensemble,  which  in  principle  can  give  information 
on  which  specific  atom  made  the  transition.  In  this  last 
case,  the  number  of  possible  pathways  generating  the  pair 
of  photons  is  then  linearly  proportional  to  the  number  of 
atoms  TV.  Eq  (22)  expresses  the  collective  enhancement 
that  is  essential  to  the  scheme  of  ref.  [5] . 


2.  Forward  emission 

In  order  to  simplify  the  following  analysis  while  keep¬ 
ing  the  essential  trends  of  the  temporal  dynamics,  we 
will  focus  now  on  the  treatment  of  the  forward,  reso¬ 
nant  emission  from  the  atomic  ensemble.  In  the  forward 
direction,  the  light  emitted  by  the  sample  satisfies  the 
phase-matching  condition 

krZi  +  kwZi  —  ki  •  ri  -  k2  •  n  =  0.  (25) 

The  resonant  conditions  for  the  Raman  fields  are  Aujr  = 

ki 

0  and  Au^  =0.  A  discussion  about  deviations  from 
these  conditions  can  be  found  at  Ref.  [6]. 

Under  these  assumptions,  and  with  the  slow  envelope 
functions  written  as 

Ur{ri,  t)  =  qr(?i)fr{t) ,  (26a) 

uw(n,t)  =  qw(ri)fw(t ) ,  (26b) 

Equation  (23)  becomes 

,  N  \  / ->  \  \ — ^  d(mq,ms)  ,  . 

Ai(mg,  rrig)  =  qr\r i)qw\r i)  2_^  - -4 - F{t,  Zi) , 

ms  =—Fs  71 

(27) 


7 


with 


,i(A  r—ais)tn 


F(t,  m )  =  f  dt,ei(-Ar+a^')t'  [  dt" fr{t")t 
Jo  Jo 

x  [  dt",ei^~Aw+aia'>t'"  [  dtw fw(tn')ei{A'“-ai°')t'u . 

Jo  Jo 

(28) 


Note  that  the  F  function  depends  on  the  parameters  for 
a  specific  atom  only  through  zi  that  specifies  its  position 
along  the  quantization  axis.  In  this  way,  after  a  certain 
time,  atoms  in  different  parts  of  the  ensemble  contribute 
to  the  probability  amplitude  of  the  process  with  different 
phases. 

If  we  consider  a  uniform  distribution  of  atoms  through¬ 
out  the  beam  path,  and  neglecting  the  z  dependence  on 
the  q  functions,  the  sum  over  all  atoms  may  be  trans¬ 
formed  in  the  following  integral 


^2qr{ri)qw(n)^y  j  J  j  dx  dy  dz  qr(x,y)qw(x,y) 

J « 


qr(x,y)qw(x,y)  N 


dxdy^'y  -T  ldz 

Li 


r L /2  dz 

(qr{x,y)qw(x,y))  N  —  , 

J-L/2  L 


(29) 


where  V  =  AL  gives  the  volume  of  the  excitation  region, 
A  its  transverse  area,  and  L  its  length. 

Substituting  Eqs.  (27)  and  (29)  in  Eq.  (22),  we  finally 
obtain 


Pl2(t)=C 


where 


F9  F9  pL/ 2  7 

V  V  Dmgd(mg,ms)  /  — F(t,z ) 

_  E. _ E-  J-L/2  L 


ma  =  —  Fa  ms  =  —  Ft 


C  =  N2  \{qr(x,y)qw(x,y))\2  , 


(30) 

(31) 


is  a  constant.  After  the  read  pulse  has  left  the  sample 
(i.e.,  when  £  — ►  oo),  Expression  (30)  is  then  proportional 
to  the  total  probability  of  detecting  the  pair  of  photons 
in  one  trial.  Details  on  how  to  compare  this  expression  to 
the  experimental  results  will  be  discussed  in  Sec.  IIIB  1. 
In  the  experimentally  important  case  of  square  pulses, 
it  is  straightforward  to  obtain  analytical  expressions  for 
both  F(£,  z)  and  pi2(t)  in  the  limit  of  large  Aw  and  Ar. 


3.  Probability  density 

Equation  (30)  gives  the  total  probability  of  detecting 
one  photon  in  field  2  after  detecting  a  photon  in  field  1. 
Now  we  want  to  obtain  the  probability  of  finding  photon 
2  between  times  t2  and  1 2  +  At2  and  photon  1  between 
times  t\  and  t\  +  Ati,  for  small  A^2  and  At\. 


The  first  step  in  this  calculation  is  to  note  that  Eq.  (30) 
can  be  written  as, 


P\h2(t)  =  \m2-  (32) 

The  function  cj)(t)  gives  then  a  probability  amplitude  for 
the  process  where  the  two  photons  are  found  up  to  time  t. 
It  consists  of  an  integral  over  all  possible  pairs  of  detec¬ 
tion  times  (£2,  ti),  representing  different  excitation  path¬ 
ways,  and  can  in  principle  also  be  written  as 

4>{t)  =  f  dt  j  f  dt2P(t2,t1 ),  (33) 

Jo  J t\ 

where  we  considered  explicitly  £2  >  £i-  T>(£2,£i)  repre¬ 
sents  then  a  density  of  probability  amplitude. 

The  probability  amplitude  for  finding  photon  2  be¬ 
tween  times  £2  and  £2  +  A£2 ,  and  photon  1  between  times 
£1  and  £1  +  A£i,  can  be  obtained  then  by  restriction  over 
the  temporal  integral  in  Eq.  (30).  Since  all  the  tempo¬ 
ral  dynamics  in  Eq.  (30)  is  in  the  function  F(t,z),  we 
need  to  calculate  first  the  restriction  of  F(£,  z)  for  these 
specific  processes.  In  order  to  do  so,  note  that,  in  the 
fourth  order  integral  of  F(£,  z),  the  emission  of  photon  2 
is  described  by  the  last  integral  (over  £'),  while  photon  1 
emission  is  described  by  the  third  integral  (over  £"').  The 
restriction  of  F(t,  z)  for  the  emission  of  photon  2  between 
times  £2  and  £2  +  A£2,  and  photon  1  between  times  £1  and 
£1  +  A£i,  is  then  given  by  [28] 

pt2~\~  At  2 

G(t2,  At2,t1,At1)  =  /  dt'ei{-^r+aig)t 

Jt2 

pt'  /*ti+Ati 

X  /  dt" fr{t")e^Ar~ais^}t"  /  y^rr ei(-^w+ais)t"' 

Jo  Jt 1 

X  [  dt'u fw(t'u)ei(A™-ai^t'\  (34) 

Jo 

Equation  (34)  can  be  directly  evaluated  for  the  case  of 
square  pulses  and  large  detunings,  such  that  Ar,  Aw  >> 
At^iAt^1.  If  the  time  intervals  are  also  small  when 
compared  to  the  timescale  of  oscillations  determined  by 
the  Zeeman  shifts  (i.e.,  A£2,A£i  <<  a”1,^1),  then 
Eq.  (34)  can  be  written  as 


G(£2,  A£2,  £1,  A£i)  =  g(t2 ,  £i)A£x  A£2  ,  (35) 


with 


g(t2,h)  =  -PAMll  e<(a,-a.)(t3-t1)  (36) 

ZAr  ZA-y, 

In  this  case,  F(£,  z)  can  be  derived  by: 

F(t,z)=  f  dti  f  dt2g(t2,t  1)  .  (37) 

Jo  Jti 

An  important  remark  is  that,  since  any  pulse  envelope 
can  be  approximated  by  a  sum  of  square  pulses  of  differ¬ 
ent  intensities  and  small  duration,  Eq.  (36)  is  indeed  valid 
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for  arbitrary  pulse  shapes,  as  long  as  the  envelope  tem¬ 
poral  variation  occurs  in  a  much  longer  timescale  than 
At i  or  At 2  • 

The  connection  between  g(t2,ti)  and  the  density  of 
probability  amplitude  Pfa,  t±)  is  then  made  through  the 
relation 

f9  f9 

P(t2,t  i)  =Vc  E  E  Dmgd(mg,ms) 

mg=—Fg  rns  =  —  Fs 


Finally,  the  probability  density  for  detecting  one  photon 
from  field  1  at  time  t\  and  another  from  field  2  at  t2  is 
associated  to 

V(t2,ti)  =  \P(t2,h)\2.  (39) 

This  is  the  quantity  to  be  compared  with  the  experimen¬ 
tal  results  of  Sec.  IIIB2,  for  the  two-photon  wavepacket 
of  the  photon  pair. 

III.  EXPERIMENTS 

Up  to  now,  the  experimental  implementation  of  the 
DLCZ  protocol  in  MOTs  have  been  plagued  by  extremely 
short  coherence  times  [11,  14,  16,  17].  As  discussed 
above,  this  short  coherence  time  is  a  result  of  the  action 
of  the  MOT  quadrupole  field  over  the  Zeeman  structure 
of  the  hyperfine  ground  states.  In  the  following,  we  are 
going  to  describe  a  series  of  experiments  that  allowed  us 
to  obtain  photon  pairs  from  the  trapped  atomic  cloud 
in  a  situation  of  very  small  magnetic  field.  In  this  way, 
we  were  able  to  measure  coherence  times  of  more  than 
10  /is  (more  than  two  orders  of  magnitude  longer  than 
the  duration  of  the  excitation  pulses),  and  two-photon 
wavepackets  for  the  photon  pairs  that  do  not  exhibit  dis¬ 
tortion  by  decoherence  even  when  write  and  read  pulses 
cease  overlapping  in  time  [17]. 

The  crucial  point  is  to  turn  off  the  MOT  magnetic 
field  and  determine  the  experimental  conditions  with  a 
best  tradeoff  between  high  repetition  rate  and  high  op¬ 
tical  density.  Note  that  the  atoms  fly  away  from  the 
trap  and  the  density  starts  to  decrease  when  the  mag¬ 
netic  field  is  turned  off.  Hence,  the  MOT  field  has  to 
be  turned  off  as  fast  as  possible,  to  decrease  the  tran¬ 
sient  time  and  maximize  the  region  with  low  magnetic 
field  and  high  density.  A  fast  turning  off  of  the  magnetic 
field  in  our  metallic  vacuum  chambers,  however,  is  not 
straightforward  and  requires  specific  techniques,  as  will 
be  discussed  in  Sec.  Ill  A. 

Inside  each  MOT-off  period,  it  is  possible  to  conduct 
many  trials  of  the  photon  pair  experiments.  These  are 
photon  counting  measurements  that  require  many  events 
in  order  to  acquire  good  statistics.  Hence,  we  would  like 
to  have  as  many  MOT-off  periods  as  possible  to  accumu¬ 
late  a  large  number  of  trials.  However,  the  MOT  needs 


some  time  to  recover  its  original  density  after  each  off 
period,  and  this  time  limits  how  often  it  can  be  turned 
off  while  still  keeping  a  high  enough  atomic  density. 

During  the  process  of  turning  off  the  magnetic  field  and 
determining  the  proper  conditions  for  the  photon  count¬ 
ing  experiments,  it  was  essential  to  be  able  to  perform 
simpler  experiments  giving  direct  access  to  the  ground 
state  broadening  by  the  magnetic  field.  We  chose  then 
to  setup  a  copropagating  stimulated  Raman  spectroscopy 
apparatus  to  help  us  in  this  process.  The  results  for  the 
Raman  spectroscopy  measurements  and  the  investigation 
to  determine  the  best  experimental  conditions  for  the 
photon  pair  generation  are  described  also  in  Sec.  Ill  A. 

The  nonclassical  correlation  experiments  are  discussed 
in  Sec.  IIIB.  There  we  show  that  the  coherence  time  in¬ 
creases  by  more  than  two  orders  of  magnitude  once  the 
magnetic  field  is  switched  off,  and  describe  measurements 
of  the  shape  of  the  two-photon  wavepacket  in  both  situa¬ 
tions.  In  this  section,  we  also  compare  the  experimental 
results  with  the  theory  of  Sec.  II B. 


A.  Characterization  and  magnetic  field  nulling 

As  anticipated  above,  we  use  copropagating  stimulated 
Raman  spectroscopy  [29]  to  probe  directly  the  broaden¬ 
ing  of  the  hyperfine  ground  states.  Our  choice  for  this 
specific  technique  is  based  on  the  fact  that  it  is  insensitive 
to  Doppler  broadening,  but  very  sensitive  to  any  broad¬ 
ening  caused  by  magnetic  fields,  exactly  like  the  spon¬ 
taneous  Raman  emission  process  underlying  the  photon 
pair  generation  in  our  experiment.  Raman  stimulated 
transitions  (see  Fig.  3a)  are  two-photon  transitions  con¬ 
necting  one  ground-state  hyperfine  level  to  the  other  one, 
in  which  a  single  photon  is  absorbed  from  one  Raman 
beam  and  another  photon  is  emitted  in  the  other  beam 
by  stimulated  emission  through  a  virtual  level,  which  is 
located  3  GHz  below  the  Cesium  D 2  line  in  our  setup. 

The  Raman  process  is  resonant  if  the  frequency  differ¬ 
ence  of  the  two  Raman  beams  equals  the  ground-state  hy¬ 
perfine  interval,  around  9.192631770  GHz  for  Cesium.  In 
the  absence  of  collisions  and  transit  broadening,  this  two- 
photon  resonance  is  very  sharp,  with  a  linewidth  limited 
only  by  the  power  and  duration  of  the  Raman  beams  [29] . 
In  this  way,  since  the  specific  value  of  the  hyperfine  inter¬ 
val  for  transitions  between  | mg)  and  \ms)  states  changes 
with  the  magnetic  field,  scanning  the  frequency  of  one 
Raman  beam  with  respect  to  the  other  gives  direct  in¬ 
formation  on  the  frequency  distribution  of  possible  two- 
photon  resonances  dislocated  by  the  magnetic  field,  i.e., 
on  the  broadening  of  the  ground  state. 

Our  setup  for  Raman  spectroscopy  is  shown  in  Fig.  3a. 
The  two  Raman  beams  and  a  probe  beam  are  coupled 
to  the  same  polarization  maintaining  fiber,  which  takes 
the  beams  close  to  the  MOT  and  provides  good  mode¬ 
matching  between  them.  The  probe  beam  is  coupled  with 
the  same  polarization  as  the  Raman  field  connecting  the 
F  =  3  ground  state  to  the  virtual  level,  the  other  Raman 
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field  is  coupled  with  the  orthogonal  polarization.  The 
lens  at  the  fiber  output  focus  the  beam  to  a  diameter  of 
150  /im  in  the  MOT  region.  After  the  fiber,  the  beams 
pass  through  a  50/50  beam  splitter  cube.  The  transmit¬ 
ted  parts  of  the  beams  are  used  as  a  reference  to  compen¬ 
sate  for  power  fluctuations.  The  reflected  part  is  directed 
to  the  MOT,  forming  an  angle  of  about  0  ~  3°  with  the 
quadrupole-field  z  axis.  The  shaded  area  around  the  z 
axis  in  Fig.  3a  indicates  the  path  of  one  of  our  trapping 
beams.  The  absorption  of  the  probe  beam  by  the  atoms 
in  the  MOT  is  then  measured  with  a  second  detector,  by 
comparing  the  probe  pulse  height  with  MOT  on  and  off. 


FIG.  3:  (a)  Experimental  Raman  spectroscopy  setup.  The 
Raman  beams  and  the  probe  beam  are  coupled  into  a  po¬ 
larization  maintaining  (PM)  fiber  and  sent  trough  a  beam 
spitter  cube  (BS).  The  reflected  part  is  focused  into  the  sam¬ 
ple  with  an  angle  of  3  degrees  with  respect  to  the  quadrupole 
field  z  axis,  while  the  transmitted  part  is  used  as  a  reference, 
(b)  Relevant  level  structures  and  laser  frequencies  for  Raman 
spectroscopy. 

Before  the  Raman  pulses  reach  the  MOT,  an  optical 
pumping  cycle  moves  the  whole  atomic  population  to  just 
one  of  the  hyperfine  ground  states.  Note  that  for  the 
following  experiments,  we  make  no  attempt  to  optically 
pump  the  atoms  onto  a  specific  Zeeman  state.  Hence, 
the  atomic  ensemble  is  unpolarized  and  all  Zeeman  sub¬ 
states  are  populated.  The  action  of  the  Raman  pulses,  of 
about  150  /jbs  duration  and  10  /iW  power,  then  transfers 
some  population  to  the  initially  empty  level  if  their  rela¬ 
tive  detuning  matches  one  of  the  two-photon  transitions 
of  the  sample.  The  probe  pulse  has  a  duration  of  5  fi s 
and  comes  50  /is  after  the  Raman  pulses.  It  is  resonant 
with  the  cycling  transition  connecting  the  initially  empty 
ground  state  to  the  6P3/2  level  [F  =  4  — >  F'  =  5  if  the 
empty  ground  state  is  F  =  4,  F  =  3  — »  F'  =  2  for  empty 
F  =  3  state] .  The  probe  power  is  about  50  nW,  to  guar¬ 
antee  a  low  saturation  of  the  transition.  It  is  then  very 
sensitive  to  any  change  in  the  initial  population,  and  its 


absorption  indicates  that  the  Raman  pulses  succeeded  in 
transferring  some  population  from  one  ground  state  to 
the  other. 

In  this  way,  a  plot  of  the  medium  optical  depth  for  the 
probe  pulse  as  a  function  of  the  detuning  between  the 
two  Raman  fields  gives  a  direct  measure  of  the  ensemble 
distribution  of  energies  in  the  ground  states.  Examples 
of  such  plots  with  the  MOT  magnetic  field  on  and  off 
are  shown  in  Figs.  4a  and  4b,  respectively.  In  Fig.  4b 
the  Raman  pulses  are  delayed  4  ms  from  the  moment  the 
magnetic  field  was  turned  off,  and  the  nulling  of  the  field 
was  performed  using  additional  bias  coils  located  around 
the  MOT  and  looking  for  a  reduced  width  of  the  Ra¬ 
man  trace.  From  Fig.  4a  to  4b,  the  width  of  the  signal 
is  then  reduced  by  more  than  two  orders  of  magnitude, 
from  5  MHz  to  about  20  kHz.  The  20  kHz  linewidth  of 
Fig.  4b,  however,  also  includes  about  10  kHz  that  comes 
from  power  broadening  by  the  Raman  beams.  To  mea¬ 
sure  this  power  broadening,  we  applied  an  extra  DC  field 
in  the  z  direction  in  order  to  split  the  central  peak  be¬ 
tween  the  various  mp  — >  m'F  transitions,  and  then  mea¬ 
sured  the  width  of  the  magnetic-field-insensitive  transi¬ 
tion  mp  =  0  — >  m'F  =  0.  As  mentioned  above,  the 


FIG.  4:  (a)  Raman  trace  with  the  quadrupole  MOT  mag¬ 
netic  field  on.  The  trace  represents  the  absorbtion  of  the 
probe  pulse  following  the  Raman  beams,  as  a  function  of  the 
Raman  detuning  S.  The  line  width  FWHM  is  around  5  MHz. 
(b)  Raman  trace  4  ms  after  the  quadrupole  field  has  been 
switched  off.  The  fitted  linewidth  is  20  kHz,  including  10  kHz 
of  power  broadening  due  to  the  Raman  beams 

quadrupole  field  of  the  MOT  should  be  switched  off  as 
fast  as  possible,  in  order  to  maintain  the  high  optical 
density  needed  for  the  DLCZ-type  experiments.  How¬ 
ever,  switching  off  the  magnetic  field  generated  by  the 
MOT  coils  is  usually  retarded  for  two  reasons.  First, 
the  current  in  the  coils  decays  exponentially,  with  a  time 
constant  proportional  to  the  inductance  of  the  coils.  Sec¬ 
ond,  the  field  decay  time  is  increased  by  eddy  currents 
in  the  metallic  part  of  our  vacuum  chamber.  Depending 
on  the  metallic  configuration  of  chamber  and  coils,  the 
transient  period  can  last  for  tens  of  ms.  In  order  to  ob¬ 
tain  a  faster  transient,  we  use  a  fast-switching  electronic 
circuit  [30,  31].  This  circuit  allows  a  quick  reversal  of 
the  current  in  the  quadrupole  coils  in  order  to  compen¬ 
sate  for  the  eddy  currents,  and  resulted  in  a  substantial 
reduction  of  the  transient  time  in  our  system. 
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A  detailed  description  of  the  magnetic  field  transient 
is  given  in  Fig.  5a,  which  plots  the  Raman  scan  linewidth 
as  a  function  of  the  delay  from  the  moment  the  field  was 
switched  off.  Figure  5a  then  shows  the  timescale  over 
which  the  ground  state  has  its  energy-distribution  profile 
changed  from  Fig.  4a  to  Fig.  4b.  We  can  see  that  after  a 
few  miliseconds,  the  linewidth  asymptotically  reaches  a 
plateau,  given  by  the  residual  DC  field  in  the  chamber, 
that  we  estimate  in  this  case  to  be  on  the  order  of  10  mG. 
The  dashed  line  in  Fig.  5a  indicates  the  measured  power 
broadening.  Shorter  transients  can  be  obtained  with  a 
different  metallic  chamber  configuration  (like  in  Ref.  30) 
or  using  non- metallic  vacuum  chambers. 


a  reasonably  small  value  corresponding  to  an  acceptable 
decoherence  time,  and  the  higher  limit  by  the  restriction 
that  the  density  should  not  vary  too  much  throughout 
the  region.  We  accepted  a  variation  of  about  30%  in  the 
density.  The  linewidth  varies  by  about  30  kHz  in  the 
same  interval. 

A  better  cancellation  of  the  magnetic  field  can  in  prin¬ 
ciple  lead  to  even  smaller  linewidths  and,  consequently, 
longer  coherence  times.  However,  improvements  along 
this  line  will  eventually  be  limited  by  a  different  problem: 
the  diffusion  of  atoms  out  of  the  excitation  region.  This 
effect  of  course  depends  on  the  temperature  of  the  sample 
and  on  the  diameter  of  the  excitation  beams.  In  order  to 
directly  measure  this  diffusion  time,  we  use  again  Raman 
spectroscopy.  In  this  case,  Raman  traces  are  recorded  as 
a  function  of  the  delay  between  the  Raman  pulses  and 
probe.  The  measurement  is  done  when  the  magnetic 
field  is  off,  such  that  there  is  only  one  narrow  peak  in 
the  Raman  trace,  like  in  Fig.  3d.  In  this  case,  the  area  of 
the  peak  profile  is  proportional  to  the  number  of  atoms 
in  the  excitation  region.  Figure  6  shows  a  plot  of  this 
area  as  a  function  of  delay.  We  see  that  the  population 
decays  with  a  time  constant  of  900  ps,  as  given  by  an 
exponential  fit  to  the  data  (solid  line).  Note  that  this 
measurement  was  done  with  beams  that  have  150  p m  di¬ 
ameter,  while  in  the  correlation  measurements  described 
later  we  use  beams  with  60  p m  diameter,  leading  to  a 
diffusion  time  of  the  order  of  360  p s. 


FIG.  5:  (a)  Evolution  of  the  ground  state  linewidth  and  (b)  of 
the  optical  depth  of  the  sample  as  a  function  of  the  delay  from 
the  time  when  the  current  is  switched  off  in  the  MOT  coils. 
The  linewidth  is  measured  with  Raman  spectroscopy.  The 
dashed  line  represents  the  measured  power  broadening  due  to 
the  Raman  beams.  The  OD  is  determined  by  measuring  the 
absorbtion  of  a  probe  pulse  in  the  sample.  In  both  graphs, 
the  dashed  area  represents  the  window  used  for  measuring 
correlations  at  the  single  photon  level. 

In  order  to  estimate  the  optimal  region  for  photon 
counting  measurements,  it  is  important  to  independently 
measure  the  decay  of  the  optical  depth  after  the  magnetic 
field  is  switched  off.  In  our  setup  this  is  done  in  a  straight¬ 
forward  way  by  turning  off  the  Raman  beams  and  using 
a  probe  pulse  close  to  resonance  with  the  ground  state 
that  concentrates  all  the  atomic  population.  The  results 
of  such  measurement  are  shown  in  Fig.  5b,  for  which  the 
population  was  initially  pumped  to  F  =  4  and  the  probe 
tuned  10  MHz  below  the  F  =  4  — >  F'  =  5  transition. 
The  optical  depth  measurements  in  Fig.  5b  were  obtained 
from  the  absorption  at  10  MHz  detuning  and  assuming 
a  Lorentzian  lineshape  for  the  atomic  transition  with  a 
natural  linewidth  corrected  for  power  broadening  by  the 
probe  beam. 

Together,  the  results  in  Figs.  5a  and  5b  allow  us  to 
determine  an  optimal  window  for  the  experiments  of 
Sec.  IIIB,  i.e.,  between  3  and  5.5  ms  (dashed  region  in 
both  figures).  The  lower  limit  of  this  region  is  determined 
by  the  moment  when  the  residual  magnetic  field  reaches 


probe  delay  (ps) 


FIG.  6:  Diffusion  of  atoms  out  of  the  excitation  region.  The 
solid  line  is  an  exponential  fit  with  a  time  constant  of  900  ps. 
The  Raman  beam  diameter  is  150  pm. 


B.  Nonclassical  correlations 

In  order  to  characterize  the  coherence  time  of  the  sys¬ 
tem  for  various  quantum  information  applications,  e.g. 
for  the  DLCZ  protocol  or  for  generation  of  conditional 
single  photons,  the  measurements  must  be  performed  at 
the  single-photon  level.  In  particular,  one  must  know 
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how  long  a  single  excitation  can  be  stored  in  the  quan¬ 
tum  memory.  For  this  purpose,  we  perform  correlation 
measurements  between  fields  1  and  2  as  a  function  of  the 
time  delay  At  between  write  and  read  pulses,  thereby 
probing  how  the  nonclassical  character  of  these  correla¬ 
tions  (  and  hence  of  the  correlations  between  field  1  and 
the  collective  atomic  excitations)  is  preserved  during  the 
storage  process. 

In  order  to  investigate  the  quantum  nature  of  the  cor¬ 
relations,  we  use  the  fact  that  there  exists  a  well-defined 
border  between  the  classical  and  quantum  domains  for 
fields  1  and  2  that  can  be  operationally  accessed  via  co¬ 
incidence  detection,  as  was  first  demonstrated  in  the  pi¬ 
oneering  work  by  Clauser  [32].  In  this  way,  we  measure 
the  joint  detection  probability  pn  for  detecting  a  photon 
in  both  fields  1  and  2  in  the  same  trial,  and  the  proba¬ 
bilities  pi  and  p2  to  register  a  single  detection  event  in 
field  1  and  field  2,  respectively.  By  splitting  field  i  with 
a  50-50  beamsplitter  and  directing  the  output  to  the  two 
detectors,  the  joint  probabilities  pa  are  also  measured, 
where  i  —  1  or  2.  Fields  for  which  the  Glauber- Sudarshan 
phase-space  function  is  well-behaved  (i.e.,  classical  fields) 
are  constrained  by  a  Cauchy- Schwarz  inequality  for  the 
various  probabilities  [32,  33],  namely: 


[fl!2  ft)]2 

<711  <722 


<1, 


(40) 


where  gn  =  pii/pl,  522  =  P22/P2 ,  9 12 (t)  =  Pi2/(piP2), 
and  t  denotes  the  time  separation  between  the  detection 
of  photons  1  and  2.  In  our  system,  gn  m  ^22  =  2  in  the 
ideal  case.  However,  in  practice,  gn  and  #22  are  mea¬ 
sured  to  be  smaller  than  2,  due  to  various  experimental 
imperfections.  Hence  in  our  case  measuring  gn  >  2  her¬ 
alds  nonclassical  correlations,  and  in  the  following  we  will 
use  this  quantity  as  another  figure  of  merit  to  quantify 
the  loss  of  coherence  in  the  quantum  memory. 

The  experimental  setup  used  to  measure  nonclassical 
correlations  between  fields  1  and  2  is  shown  in  Fig.  7.  As 
already  mentioned  the  sample  consists  in  a  cold  atomic 
ensemble  of  Cesium  atoms  in  a  magneto-optical  trap. 
Each  trial  consists  of  a  period  of  cooling  and  trapping, 
and  of  a  period  of  measurement  during  which  all  the 
beams  responsible  for  cooling  and  trapping  the  atoms 
are  switched  off.  During  the  measurement  period,  the 
atoms  are  initially  prepared  in  level  | g)  (F=4)  by  optical 
pumping  with  a  laser  beam  resonant  with  the  transition 
6 S1/2(F  =  3)  6 P3/2(F'  =  4). 

A  laser  pulse  with  150  ns  duration  from  the  write  beam 
then  illuminates  the  sample.  The  write  beam  is  tuned 
near  the  | g)  — ►  | a)  (corresponding  to  F  =  4  — >  Fr  = 
4  of  the  D2  line,  at  852  nm)  and  induces  spontaneous 
Raman  scattering  to  the  initially  empty  level  |s)  (F  =  3). 
The  intensity  of  the  pulse  is  made  sufficiently  weak,  such 
that  the  probability  of  creating  more  than  one  excitation 
in  the  symmetric  collective  mode  is  very  low.  After  a 
variable  delay  At,  the  stored  excitation  is  converted  into 
a  photon  in  field  2,  by  sending  a  read  pulse  tuned  to  the 
transition  |  s)  — >  |  b)  (corresponding  to  F  =  3  — >  F'  =  4 


transition  of  the  D\  line,  at  894  nm).  The  write  and 
read  beams  are  orthogonally  polarized  and  combined  at 
the  polarizing  beam  splitter  PBS  1  (see  Fig.  7).  At  PBS 
1,  the  write  and  read  beams  are  spatially  mode-matched 
with  a  measured  overlap  of  about  93%.  The  beams  are 
focussed  to  a  waist  of  about  30 pm  in  the  sample  region. 


FIG.  7:  Experimental  setup.  Write  and  read  pulses  propagate 
sequentially  into  a  cloud  of  cold  Cs  atoms  (MOT),  generating 
pairs  of  correlated  output  photons  1  and  2.  The  write  and 
read  pulses  have  orthogonal  polarizations,  are  combined  at 
polarizing  beam  splitter  PBS1,  and  then  focused  in  the  Cs 
MOT  with  a  waist  of  approximately  30  pm.  The  output  fields 
are  split  by  PBS2,  which  also  serves  as  a  first  stage  of  filtering 
the  (write,  read)  beams  from  the  (1,2)  fields.  For  example, 
field  2  is  transmitted  by  PBS2  to  be  subsequently  registered 
by  detector  D3  or  D4  while  the  read  pulse  itself  is  reflected 
at  PBS2.  Further  filtering  is  achieved  by  passing  each  of  the 
outputs  from  PBS2  through  separate  frequency  filters.  SM 
stands  for  single  mode. 


After  the  MOT,  fields  1  and  2  are  detected  at  the 
two  different  outputs  of  PBS  2.  A  challenging  aspect 
of  the  experiment  is  to  separate  the  classical  pulses  from 
the  weak  nonclassical  fields,  since  they  are  temporally 
and  spatially  overlapped,  and  their  frequencies  are  only 
9  GHz  apart.  This  is  done  in  several  steps,  which  are 
explained  in  detail  in  Refs.  [11],  [14],  and  [17].  After 
the  filters,  fields  1  and  2  are  coupled  into  optical  fibers, 
split  by  50/50  fiber  beam  splitters,  and  detected  by  four 
single-photon  Silicon  avalanche  photodiodes  (APD).  Fi¬ 
nally,  the  electronic  signals  of  the  APDs  are  sent  to  a 
data  acquisition  card,  in  order  to  record  the  detection 
events  and  analyze  the  correlations. 


1.  Coherence  time  measurements 

In  order  to  characterize  the  system’s  coherence  time, 
we  measure  gn  and  R  as  a  function  of  the  delay  At  be¬ 
tween  write  and  read  pulses.  We  then  compare  the  the¬ 
oretical  quantity  pn{At)  =  ^Pn(At)  to  the  measured 
#12  (At)  by  way  of  a  single  overall  scaling  parameter  £ 
for  all  At,  as  the  rate  of  single  counts  in  fields  1  and  2 
( pi  and  P2)  is  measured  not  to  depend  on  At,  to  within 
20%.  In  Fig.  8a  we  show  our  results  for  gn  with  the 
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MOT  magnetic  field  on  together  with  the  corresponding 
theoretical  fitting.  This  figure  was  presented  already  in  a 
previous  article  [17]  and  shows  a  fast  decay  of  the  coher¬ 
ence  between  fields  1  and  2,  taking  place  in  a  time  scale 
of  less  than  200  ns.  Note,  however,  that  the  coherence 
time  is  actually  smaller  than  100  ns,  since  the  write  pulse 
itself  has  a  duration  of  150  ns.  The  repetition  rate  of  the 
trials  in  this  case  is  250  kHz.  The  rate  of  coincidence 
events  (detection  of  photon  1  and  photon  2  within  the 
same  trial)  is  between  2  and  3  counts  per  second. 

The  theoretical  joint  probability  pfy  is  calculated  from 
Eq.  (30),  assuming  (7=1.  In  this  way,  we  need  to  per¬ 
form  integrals  of  the  F  function  over  the  £  coordinate. 
This  function  depends  on  2  only  through  the  parameters 
ag  and  as.  The  atomic  ensemble  is  assumed  to  be  ini¬ 
tially  unpolarized,  i.e.,  with  the  atoms  evenly  distributed 
among  all  Zeeman  states  of  the  | g)  level.  For  the  ground 
states  of  Cesium,  we  have  that  the  hyperfine  Lande  fac¬ 
tors  gg  and  gs  of  levels  | g)  and  |s),  respectively,  are  given 
by  fiBgg/h  =  —/jbBgs/h  =  0.35MHz/G,  so  that  we  can 
write 

ag  =  2nKmg  (|-)  ,  (41a) 

as  =  —2nKms  (-^)  ,  (41b) 

where  we  considered  the  magnetic  field  for  the  MOT  in 
the  form  Bz  =  bz,  with  b  the  field  gradient  in  the  center 
of  the  MOT,  and  the  constant  K  given  by 

k  ,  asigk .  (i2) 

The  value  of  Kuip  gives  an  estimate  for  the  inhomoge¬ 

neous  broadening  associated  with  level  | F^mp)  due  to  the 
magnetic-field  gradient  b.  Note  that  writing  ag  and  as  as 
in  Eqs.  (41)  allows  us  to  perform  all  spatial  integrations 
over  the  dimensionless  coordinate  s  =  z/L,  and  to  com¬ 
bine  many  of  the  relevant  experimental  parameters  in  a 
single  parameter  (K).  For  our  experiment,  L  =  3.6  mm 
and  b  =  8.7  G/cm,  so  that  K  =  1.1  MHz.  This  K  value 
is  consistent  with  the  measurement  of  the  ground-state 
broadening  shown  in  Fig.  4a. 

The  solid  curve  in  Fig.  8a  shows  the  theoretical  fit¬ 
ting  of  pi2  (At)  to  the  experimental  data.  We  considered 
K  =  1.1  MHz  in  the  theory,  as  estimated  above  for  our 
experimental  conditions.  The  only  fitting  parameter  used 
was  £,  which  was  found  to  be  £  =  1.05  x  108.  Note  that 
the  theoretical  quantity  pfy  giyes  the  probability  for  joint 
detection  of  the  two  photons,  while  gi2  is  a  measure  of 
this  joint  probability  normalized  by  the  probability  of  un¬ 
correlated  coincidence  detections.  Thus  the  scaling  factor 
£  should  be  given  roughly  by  the  inverse  of  the  probability 
for  these  uncorrelated  coincidences.  A  theoretical  estima¬ 
tion  for  this  value  is  given  by  £ th  =  [p^At  —>  oo)]_1, 
i.e.,  the  inverse  of  the  theoretical  joint  probability  af¬ 
ter  the  coherence  has  completely  decayed.  For  the  solid 
curve  in  Fig.  8a,  we  find  £ th  =  1.96  x  108.  The  difference 


between  £  and  £t/l  can  be  attributed  to  other  sources  of 
uncorrelated  coincidences  (such  as  dark  counts  in  the  de¬ 
tectors,  or  leakage  from  the  filters)  that  are  not  accounted 
by  the  theory,  which  leads  to  £  <  £t/l.  It  is  also  impor¬ 
tant  to  have  in  mind  that  the  noise  floor  is  higher  when 
the  pulses  are  overlapping,  since  there  is  more  leakage 
from  the  filters  in  this  condition.  This  results  in  some 
extra  discrepancy  when  comparing  theory  to  experiment 
by  means  of  one  single  scaling  parameter  to  all  regions  of 
Fig.  8a. 
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FIG.  8:  Measurement  of  g\2  as  a  function  of  the  storage 
time,  (a)  with  the  quadrupole  field  on  (taken  from  [17])  and 
(b)  with  the  quadrupole  field  off.  The  observed  decay  in  (b) 
is  consistent  with  the  residual  magnetic  field  in  the  chamber, 
as  measured  by  Raman  spectroscopy. 

The  <7i2  (At)  measurements  with  magnetic  field  off  are 
presented  in  Fig.  8b.  In  this  case,  we  use  the  information 
acquired  from  the  investigation  of  Sec.  Ill  A  and  turn  off 
the  field  for  a  duration  of  5.5  ms,  at  40  Hz  repetition  rate. 
From  the  magnetic-field-off  period,  we  use  for  correlation 
measurements  only  the  2.5  ms  window  shown  in  Fig.  5. 
This  2.5  ms  window  is  then  divided  in  208  trial  periods 
of  12  ps,  which  results  in  an  overall  repetition  rate  of 
8.3  kHz.  In  the  beginning  of  each  trial,  the  trap  light  of 
the  MOT  (tuned  in  the  F  =  4  to  F'  =  5  transition  of  the 
D2  line)  is  turned  on  for  0.6  ps,  and  its  repumper  laser 
(tuned  from  F  =  3  to  F'  =  4)  for  1  gs.  This  procedure 
prepares  the  system  in  the  proper  initial  state,  with  all 
atoms  at  the  F  =  4  hyperfine  level  of  the  ground  state. 
In  this  case,  the  rate  of  coincidence  counts  drops  to  about 
0.33  coincidences/s. 
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Figure  8b  shows  then  an  increase  of  more  than  two 
orders  of  magnitude  on  the  coherence  time  of  the  sys¬ 
tem,  when  the  magnetic  field  is  turned  off.  The  coher¬ 
ence  time  is  now  limited  mainly  by  the  rate  at  which 
we  can  turn  off  the  magnetic  field,  and  also  to  some  ex¬ 
tent  by  our  ability  to  magnetically  isolate  the  system. 
Note  that  in  Fig.  5a  the  Raman-trace  linewidth  indicates 
that  the  magnetic  field  in  the  measurement  window  is 
still  decaying.  The  solid  curve  in  Fig.  8b  gives  the  decay 
theoretically  expected  for  a  magnetic-field  gradient  such 
that  K  =  12  kHz,  corresponding  to  magnetic  fields  of 
the  order  or  smaller  than  30  mG  acting  on  the  ensemble. 
This  gives  a  reasonable  approximation  to  the  behavior  of 
gi2  under  the  action  of  the  residual  magnetic  field,  even 
though  the  spatial  dependence  of  this  field  can  be  more 
complicated  than  a  simple  linear  gradient.  The  change  in 
K  from  1.1  MHz  to  12  kHz  is  consistent  with  the  reduc¬ 
tion  of  the  ground  state  linewidth  between  the  two  cases, 
as  measured  directly  by  the  Raman  spectroscopy  setup. 
Finally,  for  Fig.  8b  f  =  0.67  x  108  and  £th  =  2.2  x  108. 

From  Fig.  8.b,  we  see  that  the  correlations  are  still 
highly  nonclassical  after  a  storage  time  of  10  gs.  How¬ 
ever,  from  the  theoretical  fitting  we  can  infer  that  g\2 
should  became  smaller  than  2  at  about  25  /is,  which  gives 
an  estimation  for  our  quantum  memory  time. 

As  discussed  above,  the  measurements  with  gi2  >  2 
give  a  strong  indication  of  the  nonclassical  correlations 
observed  in  our  system,  based  on  reasonable  assumptions 
for  gn  and  #22-  The  most  appropriate  verification  of  the 
nonclassical  nature  of  fields  1  and  2,  however,  is  given  by 
the  measurement  of  R  as  defined  in  Eq.  (40).  Such  mea¬ 
surements  with  the  magnetic  field  off  are  shown  in  Fig.  9. 
More  specifically,  in  Fig.  9a  we  show  the  measurements 
of  gn  and  #22  for  the  same  data  points  of  Fig.  8b.  Sub¬ 
stituting  the  results  of  Figs.  8b  and  9a  in  (40),  we  then 
obtain  the  values  of  R  shown  in  Fig.  9b,  which  confirm 
the  strong  nonclassical  correlation  present  in  our  system 
for  more  than  10  /is. 

The  R  measurement  presents  considerably  larger  error 
bars  than  for  gi2.  This  comes  from  the  large  statistical 
uncertainties  involved  in  the  determination  of  $22,  which 
requires  measurement  of  the  two-photon  component  of 
field  2  [14].  For  this  reason,  we  decided  to  carry  out  a 
much  longer  run  of  the  experiment  for  the  longest  coher¬ 
ence  time  we  were  able  to  probe,  10  /is,  which  resulted 
in  the  considerably  smaller  statistical  error  of  this  point. 


2.  Two-photon  wavepackets 

Central  to  the  DLCZ  protocol  is  the  ability  to  write 
and  read  collective  spin  excitations  into  and  out  of  an 
atomic  ensemble,  with  efficient  conversion  of  discrete  spin 
excitations  to  single-photon  wavepackets.  A  critical  as¬ 
pect  of  such  wave  packets  is  that  they  are  emitted  into 
well  defined  spatiotemporal  modes  to  enable  quantum 
interference  between  emissions  from  separate  ensembles 
(e.g.,  for  entanglement  based  quantum  cryptography  [5]). 


The  high  efficiencies  achieved  in  the  work  of  Ref.  14  en¬ 
abled  us  to  investigate  in  detail  the  temporal  properties 
of  the  nonclassical  correlations  between  emitted  photon 
pairs  [17],  providing  a  direct  look  at  various  important 
features  of  the  two-photon  wavepacket  (field  1  +  field  2) 
generated  by  the  system.  In  the  following  analysis,  our 
main  quantity  of  interest  is  pr(£i,  £2),  the  joint  probabil¬ 
ity  for  photoelectric  detection  of  photon  1  at  time  t\  and 
photon  2  at  time  £2  within  a  time  window  of  duration 
r.  The  times  for  this  quantity  are  counted  starting  from 
the  beginning  of  the  write  pulse.  This  quantities  is  de¬ 
termined  from  the  record  of  time-stamped  detections  on 
all  four  photodetectors.  The  detectors  have  a  time  reso¬ 
lution  of  2  ns  (minimum  bin  size),  but  usually  we  need 
to  consider  larger  bins  to  acquire  enough  events  for  the 
statistics. 
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FIG.  9:  (a)  Measurement  of  gw  (open  squares)  and  #22  (open 
circles)  as  a  function  of  the  storage  time,  (b)  Measurement 
of  the  coefficient  R  as  a  function  of  the  storage  time.  The  big 
statistical  errors  are  mainly  due  to  statistical  uncertainties  in 
the  measurement  of  gw  and  #22-  The  points  at  10/is  have 
been  measured  for  a  much  longer  time  and  exhibit  smaller 
statistical  error. 

In  our  earlier  experiments  [17],  we  focused  on  two 
cases:  (/)  nearly  simultaneous  application  of  write  and 
read  pulses  with  offset  At  =  50  ns  shorter  than  the  du¬ 
ration  of  either  pulse,  and  (II)  consecutive  (non  over¬ 
lapping)  application  of  write  and  read  pulses  with  At  = 
200  ns.  Results  for  pr(£i,  £2)  are  presented  in  Fig.  10.  In 
case  (/),  Fig.  10a  shows  that  pr(£i,£2)  peaks  along  the 
line  £2  —  £1  =  ££12  —  50  ns  with  a  width  A£i2  —  60  ns,  in 
correspondence  to  the  delay  $£12  and  duration  A£i2  for 
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read-out  associated  with  the  transition  |  s)  — ►  |  b)  — >  |g) 
given  an  initial  transition  |g)  — >•  |a)  — ►  |s)  [12].  In  case 
(II)  with  the  read  pulse  launched  200  ns  after  the  write 
pulse,  the  excitation  is  “stored”  in  the  atomic  ensemble 
until  the  readout.  The  production  of  correlated  photon 
pairs  should  now  be  distributed  along  £2  —  At  +  St\2 
with  width  ~  A£i2.  Instead,  as  shown  in  Fig.  10c, 
pT(t  1,^2)  peaks  towards  the  end  of  the  write  pulse  (i.e., 
£1  >  100  ns),  and  near  the  beginning  of  the  read  pulse 
(i.e.,  200  <  £2  <  300  ns).  Early  events  for  field  1  lead 
to  fewer  correlated  events  for  field  2,  as  Pt- (£1 ,  £2)  decays 
rapidly  beyond  the  line  £2  —  £1  =  ~  175  ns.  The 

marked  contrast  between  pr(£i,£2)  for  At  =  50  and  200 
ns  results  in  a  diminished  ability  for  the  conditional  gen¬ 
eration  of  single  photons  from  excitation  stored  within 
the  atomic  ensemble  [14]  and,  more  generally,  for  the 
implementation  of  the  DLCZ  protocol  for  increasing  At. 
The  underlying  mechanism  is  again  decoherence  within 
the  ensemble. 

By  contrast,  when  the  magnetic  field  is  turned  off,  this 
distortion  in  the  two-photon  wavepacket  is  eliminated 
due  to  the  extended  coherence  time.  We  now  observe 
the  shape  shown  in  Fig.  lOe.  The  delay  in  Fig.  lOe  is 
At  =  1  /is. 

The  theoretical  results  corresponding  to  these  three  sit¬ 
uations  are  shown  in  frames  (b),  (d),  and  (f)  of  Fig.  10. 
These  are  plots  of  Eq.  (39)  averaged  over  4  ns  time  win¬ 
dows  for  both  £2  and  £1,  the  same  time  window  used 
for  the  experimental  data.  We  also  considered  pulses  of 
trapezoidal  shape,  with  20  ns  rising  time,  and  FWHM  of 
150  ns  for  the  write  pulse  and  120  ns  for  the  read  pulse. 
These  values  correspond  to  the  experimental  parameters. 
The  only  effect  of  both  the  time  window  and  pulse  rising 
time  is  to  smooth  the  edges  of  the  distribution.  Differ¬ 
ently  from  the  case  of  integrated  probabilities,  it  is  nec¬ 
essary  here  to  introduce  more  details  in  the  description 
of  the  pulse  shapes,  since  the  theoretical  description  for 
this  signal  predicts  that  it  is  directly  related  to  the  pulse 
profiles  [see  Eq.  (36)]. 

The  main  point  that  calls  our  attention  in  these  figures 
is  the  fact  that  the  theory  offers  a  reasonable  explanation 
for  the  data  from  consecutive  pulses  (At  =  200  ns)  with 
magnetic  field  on,  but  not  for  overlapping  pulses  or  At  = 
1  /is  with  magnetic  field  off.  This  discrepancy  can  be 
simply  understood,  however,  if  we  remember  that  one 
of  the  main  approximations  of  our  theory  is  to  consider 
low  intensities  for  both  write  and  read  pulses.  At  low 
intensities  and  zero  magnetic  field,  the  theory  gives  a 
small  and  constant  probability  for  the  photon  2  emission 
after  photon  1.  From  Eq.  (36),  we  see  that  the  magnetic 
field  introduces  different  phases  for  different  groups  of 
atoms.  These  different  phases  are  proportional  to  the 
time  difference  between  the  emission  of  photons  2  and 
1,  and  result  in  an  overall  decay  of  the  probability  of 
emission  of  the  second  photon  over  time.  In  Figs.  10b 
and  lOf,  however,  we  see  that  the  predicted  decay  time  is 
much  longer  than  the  one  inferred  from  the  experimental 
data. 


On  the  other  hand,  for  the  actual  experiment,  the  high 
intensity  of  the  read  pulse  should  lead  to  a  fast  emission 
of  photon  2  once  the  atom  is  transferred  to  level  Fs .  This 
is  consistent  with  the  short  duration  of  correlation  A£i?2 
in  Figs.  10a  and  lOe,  which  can  be  understood  as  com¬ 
ing  from  the  fast  depletion  of  the  Fs  state.  However, 
this  reasoning  cannot  explain  the  shape  of  Fig.  10c,  since 
the  strong  excitation  alone  should  result  in  a  similar  fast 
depletion  in  the  beginning  of  the  read  pulse  for  any  de¬ 
tection  time  of  photon  1  (as  seen  in  Fig.  lOe).  The  good 
comparison  between  Figs.  10c  and  lOd  comes  from  the 
fact  that  the  decay  due  to  the  magnetic  field  takes  place 
before  the  delayed  readout  process  occurs.  The  shape 
in  Fig.  10c  is  then  a  convolution  of  a  uniform  excitation 
probability  over  t\  (like  in  Fig.  lOe)  with  the  excitation- 
probability  distribution  of  lOd. 


FIG.  10:  Theory  and  experiment  for  two-photon  wavepackets 
Pt(£i,£2).  (a)  Measured  two-photon  wavepackets  for  the  case 
where  write  and  read  pulses  are  overlaped  with  a  delay  of  50 
ns,  with  the  quadrupole  magnetic  field  on.  (b)  Theoretical 
predictions  for  the  same  conditions  as  in  (a),  (c)  Measured 
two-photon  wavepackets  for  the  case  of  consecutive  (non  over¬ 
lapping)  write  and  read  puses  with  a  delay  of  200  ns,  with 
quadrupole  field  on.  (d)  Theoretical  predictions  for  the  same 
conditions  as  in  (c).  (e)  Measured  two-photon  wavepackets 
for  nonoverlapping  write  and  read  pulses,  with  quadrupole 
field  off.  The  delay  between  write  and  read  pulses  is  1  /is.  (f) 
Theoretical  predictions  for  the  same  conditions  as  in  (e) .  The 
vertical  scales  are  given  in  arbitrary  units  proportional  to  the 
joint  probability  of  detecting  photons  1  and  2.  See  text  for 
further  details. 
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IV.  OPTICAL  PUMPING 

The  theory  developed  to  explain  the  data  in  Fig.  8 
can  also  be  used  to  devise  new  ways  to  improve  the 
system.  The  inclusion  of  the  Zeeman  structure  in  the 
theory,  for  example,  allows  the  study  of  different  polar¬ 
ization  schemes  for  both  classical  excitation  and  photon 
detection.  It  also  allows  the  investigation  of  the  role  of 
the  atomic  initial  state  on  the  measured  correlations.  In 
Fig.  11  we  give  two  examples  of  possible  ways  to  im¬ 
prove  the  system.  The  solid  and  dashed  lines  in  the  fig¬ 
ure  represent  the  two  experimental  conditions  of  Fig.  8 
(initially  unpolarized  samples  with  K  =  1.1  MHz  and 
K  =  12  kHz),  but  now  with  the  same  scaling  factor. 
The  dash-dotted  curve  shows  how  the  K  =  12  kHz  curve 
changes  if  the  system  is  initially  spin  polarized,  with  all 
atoms  in  the  | F  =  A^uif  =  0)  state.  Note  that  in  this 
case  the  value  of  pi;2  considerably  increases,  and  the  sys¬ 
tem  develops  a  plateau  coming  from  the  predominant 
transition  | F  =  4,  mriF  =  0)  — ►  \F  =  3,  ttif  =  0)  — ►  | F  = 
A,rriF  =  0),  which  is  magnetic- field  insensitive.  Further¬ 
more,  it  is  possible  to  devise  a  polarization  scheme  of  ex¬ 
citation  that  allows  only  this  specific  transition  for  any 
At,  e.g.  as  when  the  write  pulse  and  field- 1  detection  are 
cr+  polarized,  and  the  read  pulse  and  field-2  detection  are 
a~ .  This  is  the  case  for  the  dotted  curve  in  Fig.  11. 


At  ((is) 


FIG.  11:  Variation  of  pi,2  with  the  delay  At  between  write  and 
read  pulses  for  (solid  curve)  K  —  1.1  MHz  and  an  unpolar¬ 
ized  sample,  (dashed  curve)  K  —  12  kHz  and  an  unpolarized 
sample,  and  (dash-dotted  curve)  K  —  12  kHz  and  an  initially 
spin  polarized  sample  with  all  atoms  in  | F  —  4,  ttif  =  0).  The 
dotted  curve  corresponds  to  an  initially  spin  polarized  sample 
classically  excited  by  fields  with  polarizations  such  that  only  a 
magnetic  insensitive  transition  is  allowed,  see  text  for  details. 
The  same  arbitrary  scaling  factor  was  used  for  all  curves. 

The  idealized  improvements  described  by  the  dotted 
and  dash-dotted  curves  of  Fig.  11,  however,  will  probably 
be  limited  by  two  effects  which  are  not  taken  into  account 


by  the  theory.  First,  in  our  experimental  setup  we  should 
see  a  decay  with  a  timescale  on  the  order  of  360  /rs  due  to 
the  average  time  the  cold  atoms  take  to  cross  the  60  /im 
beam  diameter  of  the  classical  write  and  read  pulses.  Sec¬ 
ond,  the  theory  assumes  the  presence  of  a  magnetic  field 
predominantly  in  the  z  direction,  which  defines  the  quan¬ 
tization  axis.  This  can  be  obtained  by  applying  an  extra 
DC  magnetic  field  along  that  direction,  [34,  35]  but  any 
residual  transverse  field  should  lead  to  some  decay  of  the 
plateau.  In  spite  of  these  restrictions,  however,  we  be¬ 
lieve  that  such  improvements  could  lead  to  an  increase 
of  more  than  an  order  of  magnitude  over  the  largest  ex¬ 
perimental  decoherence  time  of  Fig.  8.  It  is  also  clear 
that  there  is  a  benefit  in  the  careful  preparation  of  the 
initial  state  for  the  magnitude  of  the  measured  correla¬ 
tions.  This  is  an  important  point  that  should  also  be 
taken  into  account  when  considering  the  implementation 
of  the  DLCZ  protocol  in  vapor  cells. 


V.  CONCLUSION 

We  have  presented  a  detailed  study  of  the  decoher¬ 
ence  processes  in  the  generation  of  photon  pairs  from 
atomic  ensembles,  via  the  DLCZ  protocol  of  ref.  [5].  We 
have  identified  the  main  cause  of  decoherence  for  cold 
atoms  in  magneto-optical  traps  as  being  the  inhomoge¬ 
neous  broadening  of  the  hyperfine  ground  states  due  to 
the  quadrupole  magnetic  field  used  to  trap  the  atoms.  A 
detailed  theory  has  been  developed  to  model  this  effect. 
We  also  reported  a  series  of  measurement  to  characterize 
and  control  the  decoherence  using  copropagating  stim¬ 
ulated  Raman  scattering.  These  measurement  allowed 
us  to  switch  off  the  quadrupole  magnetic  field  in  a  con¬ 
trolled  way.  With  the  magnetic  field  off,  we  observed 
highly  nonclassical  correlations  between  the  two  emitted 
photons,  for  a  storage  time  of  up  to  10  p, s,  an  improve¬ 
ment  of  more  than  two  orders  of  magnitude  compared  to 
previous  results  with  cold  atoms.  Furthermore,  contrary 
to  all  related  experiments  reported  up  to  now,  the  coher¬ 
ence  time  is  now  two  orders  of  magnitude  larger  than  the 
excitation  pulses  duration.  This  is  a  crucial  step  in  order 
to  use  atomic  ensembles  as  a  quantum  memory  to  store 
conditional  single  photon  states  or  entanglement  between 
two  distant  ensembles. 
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Measurement-induced  entanglement  for  excitation 
stored  in  remote  atomic  ensembles 

C.  W.  Chou1,  H.  de  Riedmatten1,  D.  Felinto1,  S.  V.  Polyakov1,  S.  J.  van  Enk2  &  H.  J.  Kimble1 


A  critical  requirement  for  diverse  applications  in  quantum  infor¬ 
mation  science  is  the  capability  to  disseminate  quantum  resources 
over  complex  quantum  networks1,2.  For  example,  the  coherent 
distribution  of  entangled  quantum  states  together  with  quantum 
memory  (for  storing  the  states)  can  enable  scalable  architectures 
for  quantum  computation3,  communication4  and  metrology5. 
Here  we  report  observations  of  entanglement  between  two  atomic 
ensembles  located  in  distinct,  spatially  separated  set-ups.  Quan¬ 
tum  interference  in  the  detection  of  a  photon  emitted  by  one  of  the 
samples  projects  the  otherwise  independent  ensembles  into  an 
entangled  state  with  one  joint  excitation  stored  remotely  in  105 
atoms  at  each  site6.  After  a  programmable  delay,  we  confirm 
entanglement  by  mapping  the  state  of  the  atoms  to  optical  fields 
and  measuring  mutual  coherences  and  photon  statistics  for  these 
fields.  We  thereby  determine  a  quantitative  lower  bound  for  the 
entanglement  of  the  joint  state  of  the  ensembles.  Our  observations 
represent  significant  progress  in  the  ability  to  distribute  and  store 
entangled  quantum  states. 

Entanglement  is  a  uniquely  quantum  mechanical  property  of  the 
correlations  among  various  components  of  a  physical  system.  Initial 
demonstrations  of  entanglement  were  made  for  photon  pairs 
from  the  fluorescence  in  atomic  cascades7,8  and  from  parametric 
down-conversion9.  More  recently,  entanglement  has  been  recognized 
as  a  critical  resource  for  accomplishing  tasks  that  are  otherwise 
impossible  in  the  classical  domain1.  Spectacular  advances 
have  been  made  in  the  generation  of  quantum  entanglement  for 
diverse  physical  systems1,2,  including  entanglement  stored  for  many 
seconds  in  trapped  ions  for  distances  on  the  millimetre  scale10,11, 
long-lived  entanglement  of  macroscopic  quantum  spins  persisting 
for  milliseconds  on  the  centimetre  scale12,  and  remote  entanglement 
carried  by  photon  pairs  over  distances  of  tens  of  kilometres  of  optical 
fibres13. 

For  applications  in  quantum  information  science,  entanglement 
can  be  created  deterministically  by  precisely  controlling  quantum 
dynamics  for  a  physical  system,  or  probabilistically  by  quantum 
interference  in  a  suitable  measurement  with  random  instances  of 
success.  In  the  latter  case,  it  is  essential  that  success  be  heralded 
unambiguously  so  that  the  resulting  entangled  state  is  available  for 
subsequent  use.  In  either  case,  quantum  memory  is  required  to  store 
the  entangled  states  until  they  are  required  for  the  protocol  at  hand. 

There  are  by  now  several  examples  of  entanglement  generated  ‘on 
demand’1,  beginning  with  the  realization  of  the  Einstein-Podolsky- 
Rosen  (EPR)  paradox  for  continuous  quantum  variables14  and  the 
deterministic  entanglement  of  the  discrete  internal  states  of  two 
trapped  ions15.  Important  progress  has  been  made  towards  measure¬ 
ment-induced  entanglement  on  various  fronts,  including  the  obser¬ 
vation  of  entanglement  between  a  trapped  ion  and  a  photon  (ref.  16 
and  references  therein). 


Here,  we  report  the  initial  observation  of  entanglement  created 
probabilistically  from  quantum  interference  in  the  measurement 
process,  with  the  resulting  entangled  state  heralded  unambiguously 
and  stored  in  quantum  memory  for  subsequent  use.  As  illustrated  in 
Fig.  1,  the  detection  of  a  photon  from  either  of  two  atomic 
ensembles  (L,  R)  in  an  indistinguishable  fashion  results  in  an 
entangled  state  with  one  ‘spin  excitation  shared  at  a  distance  of 
2.8  m  between  the  ensembles  and  distributed  symmetrically  among 
~105  atoms  at  each  site6.  Confirmation  of  entanglement  is  achieved 
by  mapping  this  stored  excitation  onto  light  fields  after  l-ps  delay6,17 
and  by  suitable  measurements  of  the  quantum  statistics  of  the 
resulting  optical  fields.  Our  results  provide  the  first  realization  of 
the  capability  to  transfer  a  stored  entangled  state  of  matter  to  an 
entangled  state  of  light. 

Our  experiment  is  motivated  by  the  protocol  of  Duan,  Lukin, 
Cirac  and  Zoller  (DLCZ)6  for  the  realization  of  scalable  quantum 
communication  networks  with  atomic  ensembles.  The  DLCZ  pro¬ 
tocol  introduced  a  number  of  ideas  for  quantum  information 
processing  and  is  the  subject  of  active  investigation.  In  this  direction, 
nonclassical  correlations17-24  and  entanglement25  have  been  observed 
between  pairs  of  photons  emitted  by  a  single  atomic  ensemble. 
Observations  of  coherence  between  two  cylindrical  volumes  of  cold 
rubidium  atoms  within  a  single  magneto -optical  trap  have  also  been 
reported26,  although  entanglement  was  not  demonstrated  between 
the  two  regions27,28. 

A  simple  schematic  of  our  experiment  is  given  in  Fig.  1,  with 
further  details  provided  in  refs  17,  21  and  23.  For  the  writing  stage  of 
the  protocol,  two  classical  pulses  traverse  the  L  and  R  ensembles  in 
parallel  and  generate  fields  1L,  1R  by  spontaneous  Raman  scattering 
(see  Fig.  la).  The  intensity  of  the  pulses  is  made  sufficiently  weak  that 
the  probability  of  creating  more  than  one  excitation  in  the  symmetric 
collective  mode6  of  the  ensemble  is  very  low21. 

Entanglement  between  the  L  and  R  ensembles  is  created  by 
combining  the  output  fields  1L,  1R  on  the  beamsplitter  BSx,  with 
outputs  directed  to  two  photodetectors  Dla,  Dlb  (see  Fig.  la).  For 
small  excitation  probability  and  with  unit  overlap  of  the  fields  at 
BSx,  a  detection  event  at  Dia  or  Dib  arises  indistinguishably  from 
either  field  1L  or  1R,  so  that  the  L  and  R  ensembles  are  projected 
into  an  entangled  state,  which  in  the  ideal  case  can  be  written 
as6,29: 

I  ^l,r)  =  cl|1)lI0)r  —  ^r|0)lI  1)r  (1) 

where  |0)LjR,  |1)l,r  refers  to  the  two  ensembles  L  and  R  with  0  and 
1  collective  excitations  respectively,  eL  (or  eR)  is  the  normalized 
amplitude  of  photon  generation  from  ensemble  L  (or  R),  and  the 
sign  (+  or  — )  is  set  by  whichever  detector  records  the  event.  The 
phase  r]  i  =  Aj 6W  +  Ayx,  where  A/3W  is  the  phase  difference  of  the 
write  beams  at  the  L  and  R  ensembles,  and  A71  is  the  phase 
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difference  acquired  by  the  1L  and  1R  fields  in  propagation  from  the 
ensembles  to  the  beamsplitter  BSi.  We  note  that  to  achieve 
entanglement  as  in  equation  (1),  rj  x  has  to  be  kept  constant 
from  trial  to  trial. 

To  verify  the  entanglement,  we  map  the  delocalized  atomic 
excitation  into  a  field  state  by  applying  simultaneously  strong  read 
beams  at  the  two  ensembles  (see  Fig.  lb).  If  the  state  transfer  were  to 
succeed  with  unit  probability,  the  conditional  state  |  1^l,r)  of  the 
ensembles  would  be  mapped  to  an  entangled  state  of  two  modes  for 
the  Stokes  fields  2L  and  2R  given  in  the  ideal  case  by6,29: 

I$LR)  =  eL|l>2jO)2R  ±ei(’,1+’,2)eR|0>2L|l)2R  (2) 

where  |0)2l2r >U)2l,2r  re^er  to  the  Raman  fields  2L,  2R  with  0,  1 
photons,  respectively.  Here,  rj2  =  A  ft-  +  Ay2,  where  A/3r  is  the  phase 
difference  of  the  read  beams  at  the  L  and  R  ensembles,  and  Ay2  is  the 
phase  difference  acquired  by  the  2L  and  2R  fields  in  propagation  from 
the  ensembles  to  the  beamsplitter  BS2  in  Fig.  lb.  In  our  experiment, 
the  phases  rj  i  and  rj  2  can  be  independently  controlled  and  are  actively 
stabilized  by  utilizing  auxiliary  fields  at  1.06  pm  that  co-propagate 


along  the  paths  of  the  write  and  read  beams  and  of  the  1L>  1R  and  2L, 
2r  fields. 

Of  course,  the  states  in  equations  (1)  and  (2)  are  idealizations  that 
must  be  generalized  to  describe  our  actual  experiment6,27,29.  Specifi¬ 
cally,  the  presence  of  various  sources  of  noise  necessarily  transforms 
these  pure  states  into  mixed  states.  Equations  (1)  and  (2)  also  neglect 
the  vacuum  contribution  as  well  as  higher-order  terms,  which  are 
intrinsic  to  DLCZ  protocol  and  which  otherwise  arise  from  diverse 
experimental  imperfections.  Moreover,  the  above  analysis  assumes 
that  all  excitations  are  in  the  correct  ‘modes’  (both  for  optical  fields 
and  for  the  collective  atomic  ‘spin  flips’),  that  excitations  of  the 
ensembles  map  one-to-one  to  photons  in  fields  1  and  2,  and  that 
diverse  sources  of  background  light  are  absent. 

The  procedure  that  we  have  devised  to  provide  a  robust,  model- 
independent  determination  of  entanglement  is  based  upon  quantum 
tomography  of  the  2L  and  2R  fields  (see  Supplementary  Information 
for  details).  Because  entanglement  cannot  be  increased  by  local 
operations  on  either  of  the  two  ensembles,  the  entanglement  for 
the  state  of  the  ensembles  will  be  always  greater  than  or  equal  to  that 


Data 

acquisition 


Figure  1 1  An  overview  of  our  experiment  to  entangle  two  atomic  ensembles 
is  shown,  a,  Set-up  for  generating  entanglement  between  two  pencil-shaped 
ensembles  L  and  R  located  within  spherical  clouds  of  cold  caesium  atoms. 
The  atomic  level  structure  for  the  writing  process  consists  of  the  initial 
ground  state  |g)  (6S1/2,  F  =  4  level  of  atomic  caesium),  the  ground  state  |s) 
for  storing  a  collective  spin  flip  (6S1/2,  F  =  3  level),  and  the  excited  level  \e) 
(6P3/2,  F'  =4).  The  transition  |g)  — *  \e)  in  each  ensemble  is  initially  coupled 
by  a  write  pulse  detuned  from  resonance  to  generate  the  forward-scattered 
anti-Stokes  field  1  from  the  transition  \e)  — ►  |s).  The  L  and  R  ensembles  are 
excited  by  synchronized  writing  pulses  obtained  from  beamsplitter  BSw. 
After  filtering,  the  anti-Stokes  fields  1L  and  1R  are  collected,  coupled  to 
fibre-optic  channels,  and  interfere  at  beamsplitter  BSx,  with  outputs  directed 
towards  two  single-photon  detectors  Dla  and  Dlb.  b,  Schematic  for 
verification  of  entanglement  between  the  L  and  R  ensembles  by  conversion 
of  atomic  to  field  excitation  by  way  of  simultaneous  read  pulses  obtained 


from  BSj-.  The  read  pulses  reach  the  samples  after  a  programmable  delay 
from  the  write  pulses,  and  couple  the  transition  |s)  — ►  \e)  (\e)  being  the 
6P  i/2,  F'  =  4  level),  leading  to  the  emission  of  the  forward-scattered  Stokes 
fields  2l  and  2R  from  the  transition  \e')—*  |g).  The  upper  inset  shows  the 
configuration  used  to  measure  the  diagonal  elements  of  P2L,2R  in  equation 
(3)  from  the  photo-detection  events  at  D2a,  D2b  and  D2c.  Reconfiguring  the 
fibre  connections,  we  can  easily  pass  from  the  configuration  of  the  upper 
inset  to  the  one  of  the  lower  inset,  which  is  used  to  generate  interference  of 
the  2l  and  2R  fields  at  beamsplitter  BS2  to  measure  the  off-diagonal 
coherence  d  in  P2L,2R-  In  a  and  b,  the  incident  write  and  read  beams  are 
orthogonally  polarized  and  combined  at  a  polarizing  beamsplitter  (not 
shown),  and  are  focused  to  a  waist  of  about  30  pm  in  the  sample  region.  All 
beamsplitters  BS  are  polarization-maintaining  fibre  beamsplitters.  The 
—  12  m  arms  of  both  write  and  read  interferometers  are  actively  stabilized 
using  an  auxiliary  Nd:YAG  laser  at  1.06  pm. 
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measured  for  the  state  of  the  light  fields.  Specifically,  conditioned 
upon  a  detection  at  Dla  or  Dlb,  we  consider  the  density  matrix: 


^2l,2r 


( Poo  0 

0  Poi 

> 

0  d 
y  0  0 


0  0  \ 
d  0 
pio  0 
0  Pn  j 


(3) 


which  is  written  in  the  basis  |h)2l  |m)2R ,  with  the  number  of  photons 
{«,  m}  =  {0,  1}.  pij  is  then  the  probability  to  find  z  photons  in  mode 
2l  and  j  photons  in  mode  2R,  and  d  gives  the  coherence  between  the 
|  1)2l  |0)2r  and  |0)2l  I  1)2r  states.  p2l;2r  is  obtained  from  the  full  density 
matrix  p2lj2r  by  restricting  it  to  the  subspace  where  there  is  at  most 
one  photon  in  each  mode,  with  then  P  =  poo  +  poi  +pio  +pn-  The 
concurrence  C(p2lj2r)  for  P2L,2R  provides  a  lower  bound  for  the 
concurrence  C(p2l,2r)  for  p2l,2r  [C(p2lj2r)  >  PC(p2lj2r)],  so  we  devise 
measurements  to  deduce  the  various  components  of  p2l,2r.  The 
concurrence  C(p2lj2r)  can  then  be  calculated  from  equation  (3)  by 
way  of  ref.  30: 


PC  =  max(2|d|  -  2y/(p00pu),0)  (4) 


The  entanglement  of  formation  E  follows  directly  from  C,  where  E 
and  C  both  range  from  0  to  1  for  our  system  and  £  is  a  monotonically 
increasing  function  of  C  (ref.  30). 

As  a  first  step  in  the  determination  of  C  we  measure  the  diagonal 
elements  py.  As  shown  in  Fig.  lb,  the  field-2  output  of  each  ensemble 
is  directed  to  different  sets  of  detectors  in  order  to  record  photon¬ 
counting  probabilities  for  the  fields  2L,  2R  separately.  From  the  record 
of  photoelectric  counting  events,  we  then  deduce  the  diagonal 
elements  of  p2l,2r,  which  are  listed  in  Table  1.  From  equation  (4) 
and  noting  that  \d\2  <  p10poi>  a  necessary  requirement  for  C  >  0  is 
that  there  be  a  suppression  of  two -photon  events  relative  to  the 
square  of  the  probability  for  single-photon  events  for  the  fields  2L,  2R, 
that  is:  hfjP  —  pn/  (piopoi)  <  1  •  For  our  measurements,  we  find  h(p  = 
0.30  ±  0.04  for  events  conditioned  on  detection  at  Dla,  and  ny  = 
0.35  ±  0.04  for  events  conditioned  on  Dlb  (ref.  21).  In  contrast,  for 
non-conditioned  events,  we  find  h///  =  0.99  ±  0.04. 

The  second  step  in  our  tomography  protocol  is  to  determine  the 
coherence  term  d  in  equation  (3),  which  we  accomplish  by  adding  a 
relative  phase  shift  for  the  fields  2L,  2R,  and  by  combining  them  at 
the  beamsplitter  BS2  shown  in  Fig.  lb.  By  recording  the  conditional 
count  rate  after  the  beam  splitter  as  function  of  <p,  we  can  measure  an 
interference  fringe  with  a  visibility  with  \d\  then  following  from  V 
and  thepy.  Roughly,  for  50/50  beamsplitters  and  neglecting  higher- 
order  terms  (that  are  employed  in  our  actual  analysis),  we  would  have 
\d\  =  V(p\o  +poi)/2. 

Figure  2  shows  conditional  counts  N2a,  AT2^  +  AT2c  as  functions  of 
<p.  These  data  demonstrate  that  the  indistinguishable  character  of 
measurement  events  at  detectors  Dla  (Fig.  2a)  and  Dlb  (Fig.  2b) 
induces  a  high  degree  of  coherence  between  the  otherwise  indepen¬ 
dent  ensembles  L,  R  (refs  6  and  26).  Indeed,  we  deduce  visibilities 
V^ia  =  (70  ±  2)%  and  Vlb  =  (71  ±  2)%  for  the  associated 
conditional  states. 


Table  1  |  Diagonal  elements  of  the  density  matrix  p2l  2r ,  deduced  from 
the  records  of  photo-electric  counts 


Probability 

Dia 

Dib 

Poo 

0.98510  ±  0.00007 

0.98501  ±  0.00007 

Pio 

(7.38  ±  0.05)  XlO-3 

(6.19  ±  0.04)  X  10“3 

Poi 

(7.51  ±  0.05)  X10-3 

(8.78  ±  0.05)  X10-3 

p  11 

(1.7  ±  0.2)  X10-5 

(1.9  ±  0.2)  X  10-5 

The  values  of  py  are  referenced  to  the  location  of  detectors  D2a,2b,2o  and  were  obtained  by 
considering  unit  detection  efficiency,  which  gives  a  more  conservative  (smaller)  lower  bound 
for  the  concurrence  than  the  actual  (larger)  field  concurrence  for  finite  efficiency  <1.  See  the 
Supplementary  Information  for  further  details,  and  equation  (3). 


A  notable  feature  of  these  results  is  that  the  interference  fringes 
have  relative  phase  tt  for  the  cases  of  detection  at  Dla,  Dlb,  in 
agreement  with  equations  (1)  and  (2).  We  observe  similar  fringes  if 
the  phase  rj  i  between  the  write  beams  is  varied  instead  of  <p.  More¬ 
over,  if  the  fields  1L,  1R  are  combined  at  the  beamsplitter  BSi  with 
orthogonal  polarizations  (byway  of  the  half-wave  plate  in  Fig.  la),  we 
find  that  the  visibility  from  interference  of  fields  2L,  2R  drops  to  near 
zero,  because  in  this  case  there  is  no  longer  measurement-induced 
entanglement  associated  with  quantum  interference  for  detection  of 
fields  1L,  1R  (see  Supplementary  Information). 

With  equation  (4),  the  measured  values  for  the  visibility  Vand  for 
the  various  py  are  sufficient  to  deduce  a  lower  bound  for  the 
concurrence  C  for  the  field  state  p2L,2R  at  the  location  of  detectors 
D2a,2b,2c-  With  no  correction  for  detection  efficiencies  or  propagation 
losses,  and  without  subtraction  of  any  background,  we  find: 

Cla(P2L,2R)  =(2.4  ±  0.6)  X  10"3  >  0, 


Cib(P2L,2R)=(1.9±0.6)X10-3>0 


(5) 


conditioned  upon  detection  at  either  Dla  or  Dlb.  This  conclusively 
demonstrates  a  non-zero  degree  of  entanglement  between  the 
ensembles,  albeit  with  the  concurrence  CLjR  small.  The  small  differ¬ 
ence  between  the  concurrence  for  the  states  conditioned  on  Dla  or 
Dlb  can  be  explained  by  an  asymmetry  in  BSi  (see  Supplementary 
Information). 

Beyond  the  firm  lower  bound  given  by  equation  (5),  we  can  make  a 
better  estimate  of  the  degree  of  entanglement  CLjR  between  the  L  and 
R  ensembles  by  using  detailed  measurements  of  the  propagation 
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Figure  2  |  Coherence  between  the  atomic  ensembles  L,  R  induced  by  a 
measurement  event  of  the  fields  1L  and  1R  at  detector  D1a  or  D1b.  Shown  is 
the  number  of  coincidences  N2a  (triangles)  and  iV2b  +  N2c  (circles) 
recorded  by  the  respective  detectors  D2a>2b,2c  for  the  fields  2L  and  2R  with  the 
interferometer  arrangement  of  Fig.  lb  as  a  function  of  the  relative  phase  <p. 
In  a,  N 2a,2b,2c are  conditioned  upon  a  detection  event  at  Dla  with  no  count  at 
Dlb,  while  in  b,  N2a.2b.2c  are  conditioned  upon  an  event  at  Dlb  with  no  count 
at  Dla.  At  each  setting  of  <p,  data  are  acquired  for  150  s  with  a  detection 
window  of  width  190  ns.  Although  the  interference  fringes  have  comparable 
visibility,  the  different  sizes  arise  from  unequal  quantum  efficiencies  for 
detectors  D2a  and  D2bj2c  (see  Supplementary  Information).  The  visibility 
values  are  obtained  from  an  average  of  the  visibilities  of  the  red  and  black 
curves,  respectively.  Error  bars  reflect  ±  one  standard  deviation  due  to  the 
finite  number  of  counts. 
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efficiencies  from  the  atomic  ensembles  to  the  plane  z0  of  the  detectors 
shown  in  Fig.  lb  (see  Supplementary  Information).  Figure  3  gives  an 
inference  of  the  density  matrix  p2  2  and  thereby  of  the  concurrence 
cz'(p1U)  at  z0  and  at  two  other  locations  zi=ly2  along  the  path  from 
the  ensembles  to  the  detectors  (see  Fig.  lb),  assuming  a  constant 
visibility.  In  general,  C  increases  in  direct  correspondence  to  the 
reduced  level  of  losses  for  the  2L  and  2R  fields  at  locations  closer  to  the 
ensembles.  At  location  z2  corresponding  to  the  output  edges  of  the 
atomic  ensembles,  we  find  the  result: 


^ci(pX2„)  “ 0.021  ± 0.006 > 0, 

c&  acft  (p12l,2r)  -  0.016  ±  0.006  >  0 


(6) 


To  move  beyond  this  result,  we  need  more  detailed  information 
about  the  efficiencies  £  L>R  with  which  stored  excitation  in  the  atomic 
ensembles  is  converted  to  the  propagating  light  fields  2L  and  2R.  Our 
earlier  measurements  included  comparisons  to  a  simple  model21  and 
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Figure  3  |  Inference  of  the  concurrence  CZ/  (a)  and  density  matrix  P2l.2r  (W 
at  the  three  locations  Z\  indicated  in  Fig.  1b.  At  each  location,  two  pairs  of 
results  are  given  corresponding  to  the  measurement-induced  state  created 
from  detection  at  Dla  and  Dlb,  taking  into  account  the  efficiency  of  the 
detectors  and  propagation  losses,  a,  Concurrence  C,  for  two  different 
detection  windows  Afd  at  D2a,2b,2c-  Filled  symbols  are  for  A fd  =  190  ns, 
enough  to  acquire  the  whole  temporal  wavepacket  of  field  2.  Open  symbols 
are  for  At d  =  120  ns.  We  see  then  that  the  degree  of  entanglement  can  be 
further  enhanced,  similar  to  the  increase  of  nonclassical  correlations 
between  fields  1  and  2  reported  in  ref.  23  for  specific  detection  windows  for 
these  fields.  All  values  shown  in  this  figure,  including  the  ones  for  z0,  are 
already  corrected  for  the  efficiencies  of  the  detectors.  Error  bars  reflect  ±  1 
standard  deviation,  taking  into  account  the  finite  number  of  counts  and 
the  uncertainties  in  the  efficiency  and  propagation  loss,  b,  Density  matrix 
P2L  2r  giyen  in  the  basis  |u)2l  |m)2R  corresponding  to  equation  (3)  with 
{«,  m}  =  {0,  1}  for  Afd  =  190  ns. 


allowed  an  inference  £LjR  ~  0.10  ±  0.05.  The  measurement  of  the 
losses  together  with  the  values  of  py  at  the  detectors  yields 
p  10  +  p  01  ~  11%  at  the  output  of  the  ensembles  (z2  plane)  for  our 
current  experiment.  This  value  together  with  the  estimated  £  LjR  then 
indicates  thatp0o  0  for  the  conditional  state  pLjR  of  the  ensembles, 
so  that  CLjR  ~  V  ~  0.7,  suggesting  that  pLjR  is  close  to  the  ideal 
entangled  state  of  equation  (1).  The  low  measured  values  for  the 
entanglement  between  fields  2L  and  2R  are  apparently  principally  a 
consequence  of  the  low  readout  efficiency  £LjR  of  the  atomic 
excitation.  We  stress  that  this  inference  of  C  for  the  state  inside  the 
ensembles  must  be  confirmed  by  subsequent  experiments  and  is 
offered  here  to  provide  some  insight  into  future  prospects  for 
quantum  protocols  with  entangled  ensembles.  This  also  emphasizes 
that  a  central  point  in  subsequent  work  should  be  the  improvement 

£  L,R* 

In  conclusion,  we  have  achieved  entanglement  between  a  pair  of 
atomic  ensembles  separated  by  2.8  m,  with  the  entangled  state 
involving  one  spin  excitation  within  a  collective  system  of  roughly 
105  atoms  at  each  site  L  and  R.  The  entangled  state  is  generated  by 
and  conditioned  upon  an  initial  detection  event,  and  is  thus 
produced  in  a  probabilistic  fashion.  However,  this  initial  event 
heralds  unambiguously  the  creation  of  an  entangled  state  between 
L  and  R  ensembles,  which  is  physically  available  for  subsequent  use, 
as,  for  example,  by  mapping  to  propagating  optical  fields,  which  can 
in  principle  be  accomplished  with  high  efficiency.  We  emphasize  that 
our  measurements  relate  to  an  actual  physical  state  of  the  L  and  R 
ensembles  and  of  the  2L  and  2R  fields,  and  are  not  an  inference  of  a 
state  based  upon  post-selection.  Our  work  provides  the  first  example 
of  a  stored  atomic  entangled  state  that  can  be  transfered  to  entangled 
light  fields,  and  significantly  extends  laboratory  capabilities  for 
entanglement  generation,  with  now- entangled  states  of  matter  stored 
with  separation  a  thousand-fold  larger  than  was  heretofore  possible 
for  qubits.  With  our  current  set-up,  we  have  demonstrated  A ts  —  1  (jls 
for  storing  entanglement.  However,  this  should  readily  be  extended 
to  A ts  —  10  (jls,  and  new  trapping  schemes  have  the  potential  to  lead 
to  Ats  —  Is  (ref.  17).  The  distance  scale  for  separating  the  L  and  R 
ensembles  is  limited  by  the  length  Z0  —  2  km  for  fibre  optic  attenu¬ 
ation  at  our  write  wavelength  of  852  nm.  Extensions  to  scalable 
quantum  networks  over  larger  distances  will  require  the  realization  of 
a  quantum  repeater6,  for  which  we  have  now  laid  the  essential 
foundation. 

METHODS 

Atomic  ensembles  and  optical  pulses.  Each  of  the  L  and  R  atomic  ensembles  is 
obtained  from  caesium  atoms  in  a  magneto-optical  trap  (MOT)17,21.  Measure¬ 
ments  are  carried  out  in  a  cyclic  fashion  consisting  first  of  a  period  of  cooling  and 
trapping  to  form  the  MOT,  followed  by  an  interval  during  which  the  magnetic 
fields  for  the  MOT  are  switched  off.  After  waiting  3  ms  for  the  magnetic  field  to 
decay17,  we  initiate  a  sequence  of  measurement  trials,  where  for  each  trial  the 
atoms  are  initially  prepared  in  level  |g).  The  write  pulse  is  at  852  nm,  with  a 
duration  of  150  ns  and  is  detuned  10  MHz  below  the  \g)  — ►  \e)  transition.  The 
read  pulse  is  at  894  nm,  with  a  duration  of  130  ns  and  is  resonant  with  the 
| s)  — *•  \e)  transition.  At  the  end  of  each  trial,  the  sample  is  pumped  back  to  level 
|g)  by  illuminating  the  atomic  cloud  with  trapping  and  repumping  light  for 
0.7  ps  and  1  ps  respectively,  and  then  a  new  trial  is  initiated  with  period  of  3  ps. 
The  total  duration  for  a  sequence  of  measurement  trials  is  5  ms,  after  which  the 
measurement  interval  is  terminated  and  a  new  MOT  is  formed  in  preparation  for 
the  next  sequence  of  trials  at  a  rate  of  40  Hz. 
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We  propose  a  scheme  to  achieve  quantum  computation  with  neutral  atoms  whose  interactions  are  catalyzed 
by  single  photons.  Conditional  quantum  gates,  including  an  A-atom  Toffoli  gate  and  nonlocal  gates  on  remote 
atoms,  are  obtained  through  cavity-assisted  photon  scattering  in  a  manner  that  is  robust  to  random  variation  in 
the  atom-photon  coupling  rate  and  which  does  not  require  localization  in  the  Lamb-Dicke  regime.  The  domi¬ 
nant  noise  in  our  scheme  is  automatically  detected  for  each  gate  operation,  leading  to  signalled  errors  which  do 
not  preclude  efficient  quantum  computation  even  if  the  error  probability  is  close  to  the  unity. 
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Neutral  atoms  in  optical  cavities  have  been  one  of  the 
pioneering  avenues  for  the  implementation  of  quantum  com¬ 
putation  and  networking  [1-4].  Nevertheless,  the  experimen¬ 
tal  requirements  associated  with  these  approaches  turn  out  to 
be  very  challenging.  In  particular,  although  significant  ex¬ 
perimental  advances  have  been  reported  recently  in  transmit¬ 
ting  and  trapping  single  atoms  in  high  finesse  cavities 
[4-12],  no  experiment  has  yet  achieved  a  well  defined  num¬ 
ber  of  atoms  A^2  each  of  which  is  strongly  coupled  to  the 
cavity  mode,  individually  addressable,  and  localized  to  the 
Lamb-Dicke  limit,  as  is  required  for  the  protocol  of  Ref.  [1]. 
To  realize  a  more  scalable  system,  Chapman  et  al.  proposed 
an  architecture  in  which  a  transverse  optical  lattice  is  em¬ 
ployed  to  translate  atoms  into  and  out  of  a  high-finesse  cav¬ 
ity  for  entangling  gate  operations  [6].  Transport  that  pre¬ 
serves  internal  state  coherence  has  been  demonstrated  for 
both  ions  [13]  and  atoms  [14].  However,  although  the  ap¬ 
proach  of  Ref.  [6]  does  solve  the  problem  of  separate  ad¬ 
dressing  of  many  atoms  in  a  tiny  cavity,  there  remain  signifi¬ 
cant  obstacles  to  achieving  Lamb-Dicke  confinement  [15] 
and  strong  coupling  for  any  scheme  that  has  yet  been  pro¬ 
posed. 

To  overcome  these  difficulties  and  to  provide  several  ca¬ 
pabilities  for  quantum  logic,  in  this  paper  we  propose  a 
scheme  for  atomic  quantum  gates  whereby  atom-atom  inter¬ 
actions  are  catalyzed  by  single  photons  in  a  fashion  that  is 
robust  to  various  sources  of  practical  noise.  More  specifi¬ 
cally,  a  controlled  phase-flip  gate  between  two  atoms  is 
achieved  by  cavity-assisted  scattering  of  a  single-photon 
pulse  from  the  cavity  in  which  the  atoms  are  localized  [16]. 
This  gate  is  insensitive  to  uncertainties  in  the  atom-photon 
coupling  rate,  thereby  obviating  the  requirement  for  Lamb- 
Dicke  localization.  It  is  also  robust  to  all  sources  of  photon 
loss,  including,  for  instance,  atomic  spontaneous  emission, 
photon  collection  and  detection  inefficiency,  and  any  vacuum 
component  in  the  scattering  pulse.  Such  noise  is  automati¬ 
cally  detected  for  each  gate,  leading  to  a  finite  failure  prob¬ 
ability  of  the  gate  operation.  As  shown  in  Refs.  [17,18],  ef¬ 
ficient  quantum  computation  can  nevertheless  be  achieved 
even  if  the  associated  failure  probability  is  close  to  unity. 
Moreover,  our  scheme  can  be  readily  extended  to  achieve  a 
Toffoli  gate  for  N  atoms  in  a  single  step  and  to  realize  non¬ 
local  gates  on  remote  atoms  trapped  in  different  cavities.  The 


direct  A-bit  gate  could  lead  to  more  efficient  construction  of 
quantum  circuits,  and  the  nonlocal  gates  on  remote  atoms 
naturally  integrates  local  computation  with  quantum  net¬ 
working. 

To  explain  the  idea  of  the  gate  operation,  we  first  consider 
two  atoms  in  a  single-sided  cavity.  To  have  a  scalable  archi¬ 
tecture,  one  can  follow  Ref.  [6]  to  assume  there  are  trans¬ 
verse  optical  lattice  potentials  to  move  the  target  atoms  into 
and  outside  the  cavity  [19,20].  Each  atom  has  three  relevant 
levels  as  shown  in  Fig.  1.  The  qubit  is  represented  by  differ¬ 
ent  hyperfine  levels  |0)  and  1 1)  in  the  ground-state  manifold. 
The  atomic  transition  from  |l)  to  an  excited  level  \e)  is  reso¬ 
nantly  coupled  to  a  cavity  mode  ac.  The  state  |0)  is  decoupled 
due  to  the  large  hyperfine  splitting. 

To  perform  a  collective  quantum  gate  on  the  two  atoms, 
we  reflect  a  single-photon  pulse  from  the  cavity.  This  single¬ 
photon  pulse,  with  its  state  denoted  as  | p),  is  resonant  with 
the  bare  cavity  mode  ac.  If  the  photon  pulse  is  sufficiently 
long  (with  its  bandwidth  AD  much  smaller  than  the  cavity 
decay  rate  k ),  reflection  of  the  pulse  from  a  resonant  cavity 
absent  an  atom  will  leave  the  pulse  shape  almost  unchanged 
but  will  flip  its  global  phase,  as  we  later  characterize  in  de¬ 
tail.  For  the  case  that  both  of  the  atoms  are  in  the  |0)  state, 
this  is  precisely  the  nature  of  the  resonant  reflection  since 
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FIG.  1.  (Color  online)  (A)  Schematic  setup  for  implementation 
of  the  controlled  phase  flip  (CPF)  gate  on  two  atoms  inside  the 
cavity  through  the  photon- scattering  interaction.  Any  pair  of  atoms 
can  be  transmitted  into  the  cavity  for  a  collective  gate  operation 
through  a  transverse  optical  lattice  potential  as  suggested  in  Refs. 
[6,21].  For  a  more  robust  implementation  of  the  gate,  we  add  a 
single-photon  detector  to  detect  the  output  photon  pulse  as  illus¬ 
trated  inside  the  dashed  box.  (B)  The  relevant  level  structure  of  the 
atoms  and  the  coupling  configuration. 
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there  is  negligible  atom-cavity  coupling  and  hence  no  shift  of 
the  resonant  frequency  of  the  cavity  mode.  After  reflection, 
the  atom-photon  state  |0)1|0)2|/?)  evolves  into  -|0)1|0)2|/?), 
where  the  subscripts  1,  2  denote  the  two  intracavity  atoms. 
However,  if  either  or  both  of  the  atoms  are  in  the  state  1 1), 
the  effective  frequency  of  the  dressed  cavity  mode  will  be 
shifted  due  to  the  atom-cavity  coupling,  which  is  described 
by  the  Hamiltonian 

H=h 2  £i(k)i<l|ac+  |l)i(elaJ)-  (!) 

1=1,2 

If  the  coupling  rates  gt>  (AH,  k,  ys),  where  ys  is  the  rate  of 
spontaneous  decay  of  \e),  then  the  frequency  shift  will  have  a 
magnitude  comparable  with  gh  so  that  the  incident  single¬ 
photon  pulse  will  be  reflected  by  an  off-resonant  cavity. 
Hence,  both  the  shape  and  global  phase  will  remain  un¬ 
changed  for  the  reflected  pulse.  Due  to  this  property,  the 
component  states  \0)1\l)2\p),  |l)i|0)2| p),  and  | l)i| l)2l/7)  are 
likewise  unaffected  by  reflection  process.  The  net  effect  of 
these  two  subprocesses  is  that  the  reflection  of  a  single¬ 
photon  pulse  from  the  cavity  actually  performs  a  controlled 
phase-flip  gate  (CPF)  f/12=exp(z'7r|00)12(00|)  on  the  two  at¬ 
oms  while  leaving  the  photon  state  unchanged  (unentangled). 
Hence,  in  the  ideal  case  the  reflected  photon  can  be  utilized 
to  catalyze  subsequent  gate  operations. 

However,  in  a  realistic  setting  our  scheme  can  be  per¬ 
formed  in  a  more  robust  fashion  by  detecting  the  output 
pulse  with  a  single-photon  detector.  By  this  means,  gate  er¬ 
rors  due  to  all  sources  of  photon  loss,  including  atomic  spon¬ 
taneous  emission,  cavity  mirror  absorption  and  scattering, 
imperfection  in  the  photon  source,  and  photon  collection  and 
detection  inefficiencies,  are  always  signaled  by  the  absence 
of  a  photon  count.  As  a  result,  these  dominant  sources  of 
noise  only  lead  to  probabilistic  signaled  errors,  which  yield  a 
finite  failure  probability  of  the  gate  but  which  have  no  con¬ 
tribution  to  the  gate  infidelity  if  the  operation  succeeds  (i.e., 
if  a  photon  count  is  registered).  For  this  class  of  errors,  effi¬ 
cient  quantum  computation  is  possible  with  an  arbitrarily 
small  gate  success  probability  p  [17].  Compared  with  deter¬ 
ministic  gates,  the  required  extra  computational  overhead 
due  to  the  small  gate  success  probability  p  scales  efficiently 
(polynomially)  both  with  1  Ip  and  the  computational  scale 
characterized  by  the  number  of  qubits  n  [17].  Because  of  this 
robustness,  the  input  single-photon  pulse  can  also  be  re¬ 
placed  by  a  simple  weak  coherent  pulse  |  a)  with  the  mean 
photon  number  \a\2<  1.  This  replacement  does  not  give  any 
essential  problem  in  terms  of  scaling,  although  the  individual 
gate  efficiency  (the  success  probability)  is  indeed  signifi¬ 
cantly  reduced  by  a  factor  of  \a\2. 

Before  going  to  the  detailed  theoretical  characterization  of 
the  gate  fidelity  and  efficiency,  we  next  present  some  exten¬ 
sions  of  the  above  scheme.  First,  our  scheme  can  be  readily 
extended  to  perform  a  Toffoli  gate  on  N  atoms  in  a  single 
time  step.  If  one  reflects  a  single-photon  pulse  from  a  cavity 
with  N  atoms  trapped  inside,  the  pulse  will  have  a  flip  of  its 
global  phase  if  and  only  if  all  the  atoms  are  in  the  |0)  state. 
So,  this  reflection  performs  a  Toffoli  gate  U  \2---n 
=  exp(/7r|00- •  -0)12...^(00- •  *0|)  on  all  the  atoms  while  leav¬ 
ing  the  photon  state  unentangled.  This  direct  ALbit  gate  could 


FIG.  2.  (Color  online)  Schematic  of  the  setup  for  implementa¬ 
tion  of  nonlocal  gates  on  two  atoms  1  and  2  trapped  in  distant 
cavities.  Not  shown  are  circulators  (e.g.,  Faraday  devices)  to  redi¬ 
rect  the  output  beams  along  paths  distinct  from  the  inputs.  See  the 
text  for  further  explanation. 

lead  to  more  efficient  construction  of  circuits  for  quantum 
computation.  For  instance,  the  reflection  operation  in  the 
Grover’s  search  algorithm  can  be  realized  in  a  single  step 
with  the  ALbit  Toffoli  gate  [21]. 

Second,  the  above  scheme  can  also  be  extended  to  per¬ 
form  nonlocal  gates  on  two  remote  atoms  trapped  in  different 
cavities,  as  illustrated  in  Fig.  2.  A  similar  nonlocal  gate  was 
also  proposed  recently  in  a  different  system  with  rare-earth 
atoms  [22].  For  this  purpose,  one  uses  a  single-photon  (or 
weak-coherent)  pulse  which  is  in  an  equal  superposition  state 
(|i7)  +  |V))/V2  of  the  H  and  V  polarization  components.  With 
a  polarization  beam  splitter  (PBS1),  the  H  and  V  components 
of  the  pulse  are  “bounced”  back  from  the  atom-cavity  system 
and  a  mirror  M,  respectively,  with  the  reflection  from  M 
leaving  the  incident  pulse  unchanged.  The  overall  reflection 
from  the  cavity  and  the  mirror  M  actually  performs  the  gate 
operation  Ulp=exip(iTT\0H)lp(0H\)  on  atom  1  and  the  photon 
pulse  p ,  so  that  there  is  a  phase  flip  only  when  the  atom  is  in 
the  state  |0)  and  the  photon  is  in  the  polarization  | H)  [16]. 
The  pulse  is  reflected  successively  from  the  two  cavity 
setups,  with  a  half-wave  plate  (HWP1)  inserted  into  the  op¬ 
tical  path  between  the  two  reflections  which  performs  a 
Hardmard  rotation  on  the  photon’s  polarization  |if)— »(|iJ) 
+  |V»/V2,|V>->(|V)-|//»/a/2.  The  photon  is  detected  by 
two  single-photon  detectors  D1  and  D2  after  the  reflections, 
corresponding  to  a  measurement  of  its  polarization  in  the 
basis  (|V>±|tf»/V2  (after  the  HWP2  and  the  PBS3;  see  Fig. 
2).  For  a  detection  event  in  D2,  a  phase  flip  operation  o\  is 
performed  on  the  atom  1,  while  no  operation  is  applied  if  D1 
clicks.  The  net  effect  of  these  operations  is  the  desired  CPF 
gate  £/12=exp(/7r|00)12(00|)  on  the  two  remote  atoms  1,  2. 
Among  other  applications,  this  nonlocal  gate  and  its  exten¬ 
sion  to  multiple  atom-cavity  systems  provide  a  convenient 
avenue  for  quantum  networking.  As  before  for  the  case  of  a 
single  cavity,  in  this  distributed  setting  any  noise  leading  to 
photon  loss  is  always  signaled  by  the  absence  of  a  photon 
count  from  either  D1  or  D2. 

We  now  present  a  more  detailed  theoretical  model  of  our 
scheme  and  characterize  the  influence  of  some  practical 
sources  of  noise.  The  input  single-photon  pulse  with  a  nor¬ 
malized  shape  function  fin(t)  and  a  duration  T  can  be  de¬ 
scribed  by  the  state  \p)= flf[n(t)ajn(t)dt\vac),  where  |vac)  de¬ 
notes  the  vacuum  state  and  a\n{t)  is  the  one-dimensional 
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optical  field  operator  with  the  commutation  relation 
\ain(t)  ,a}n(t')]=  8(t-t')  [23].  The  cavity  mode  ac  is  driven  by 
the  input  field  am{t)  through  the  Langevin  equation  [23] 

ac  =  -  i[ac,H]  -  ( K/2)ac  -  ^Kain{t) ,  (2) 

where  k  is  the  cavity  decay  rate  and  the  Hamiltonian  H  is 
given  in  Eq.  (1)  for  the  case  of  two  atoms;  generalization  to 
multiple  atoms  is  straightforward.  To  account  for  atomic 
spontaneous  emission  with  a  rate  ys ,  we  add  an  effective 
term  (~iys)\e)(e\  to  the  Hamiltonian  H.  The  output  field 
aout(t )  of  the  cavity  is  connected  with  the  input  through  the 
input-output  relation  aoui(t)=ain(t)  +  ^~Kac. 

The  final  atom-photon  state  can  be  numerically  solved 
from  the  above  set  of  equations  through  discretization  of  the 
continuum  optical  fields  (for  details  on  the  numerical 
method,  see  Refs.  [16,24]).  We  use  the  following  two  quan¬ 
tities  to  characterize  the  imperfections  in  our  scheme.  (1) 
Due  to  various  sources  of  photon  loss,  photons  in  the  cavity 
may  be  lost  with  then  no  photon  count  at  the  detectors. 
Hence,  we  calculate  the  success  probability  of  a  photon 
count  at  the  detector  to  characterize  the  efficiency  of  the 
scheme.  (2)  Even  if  a  photon  emerges,  there  may  still  be 
imperfections  of  the  atomic  gate  mainly  due  to  the  shape 
distortion  of  the  photon  pulse  after  reflection  from  the  cavity, 
which  can  be  characterized  through  the  gate  fidelity.  Without 
loss  of  the  photon,  the  final  atom-photon  state  can  be  written 
as  |^rout>=2i1i2ciii2|ii/2)a|p>iiI-2,  where 
=  0,1)  is  the  general  form  for  the  input  state  of  the  two 
atoms.  The  output  photon  state  | p)ixi2  corresponds  to  the 
atomic  component  and  is  given  by  \p)t  t 

=Slf^y)alJt)dt\w&c)  with  a  shape  Ideally,  the  out¬ 

put  state  |'Pout)  would  have  the  shape  functions  f^\t)  = 
-f  jt)  and  =f  jt)  (for  ix,i2=f=  0),  which  realizes  a  per- 

feet  CPF  gate  Un  on  the  atoms.  Hence  to  characterize  the 
gate  imperfection,  we  calculate  the  fidelity  F 
which  is  directly  extendable  to  any  number 
of  atoms.  In  the  following  calculation  of  the  fidelity  F ,  we 
choose  the  input  state  [(|O)  +  |l))/V2]07V  for  the  case  of  N 
atoms. 

The  results  from  our  calculations  are  summarized  in  Fig. 
3.  First,  Fig.  3(a)  shows  the  component  pulse  shape  /°f2W 
corresponding  to  a  Gaussian  input  fin(t)  for  the  case  of  two 
atoms.  Only  the  component  f^(t)  has  a  notable  shape  dis¬ 
tortion;  all  others  are  basically  indistinguishable  from  the 
input.  To  account  for  random  variation  in  the  coupling  rates 
gi9  we  have  also  calculated  for  gt  varying  from  2  k  to 

6  k.  The  output  pulse  shapes  are  nearly  identical  for  gt  vary¬ 
ing  in  this  range,  which  is  typical  of  current  experiments 
[4-10].  Figure  3(b)  shows  the  corresponding  fidelity  F  of  the 
CPF  (or  Toffoli)  gate  from  the  shape  distortion  noise  with  the 
atom  number  N=  2 , 3 , 4 , 5.  The  fidelity  F  improves  with  in¬ 
crease  of  the  pulse  duration  T  since  the  shape  distortion  is 
reduced  for  longer  pulses.  F  also  increases  with  the  atom 
number  N,  which  is  a  bit  surprising  but  actually  reasonable: 
for  the  TV- atom  state  [(|O)  +  |l))/V2]07V,  the  fraction  of  the 
component  \0)®N  goes  down  as  1/2^,  and  the  pulse  shape 
distortion  noise  comes  dominantly  from  this  component.  Be- 


FIG.  3.  (Color  online)  (A)  The  shape  functions  \f(t)\  for  the 
input  pulse  (solid  curve)  and  the  reflected  pulses  with  the  atoms  in 
different  component  states  |iii2)fl.  The  shape  function  for  the  atom 
in  the  state  |00)fl  is  shown  by  the  dash-dot  curve.  With  the  coupling 
rate  g  in  a  typical  range  from  2  k  to  6  k,  the  shape  functions  for  the 
atoms  in  all  the  other  component  states  are  indistinguishable  from 
that  of  the  input  pulse  (the  solid  curve).  We  have  assumed  a  Gauss¬ 
ian  shape  for  the  input  pulse  with  fm(t)  ocexp[-(r-772)2/(775)2], 
where  t  ranges  from  0  to  T  and  T=210/ k  for  this  example.  (B)  The 
gate  fidelity  versus  the  number  of  atoms  with  the  pulse  duration  T 
=  100//c  (the  dotted  curve)  and  T=210/  k  (the  solid  curve),  respec¬ 
tively.  (C)  The  photon  loss  probability  Psp  due  to  atomic  spontane¬ 
ous  emission  shown  as  a  function  of  the  coupling  rate  g  in  units  of 
k  with  the  atom  number  A/=2,3,4.  The  dotted  curves  shows  Psp 
calculated  from  the  empirical  formula  given  in  the  text  for  N=4. 
(D)  Comparison  of  the  photon  loss  Psp  for  a  constant  coupling  rate 
g  =  3K  (the  solid  curve)  and  for  a  time  varying  rate  g^)  =  3/c[l 
+  sm(vt+(f>i)/3]  (the  dotted  curve)  for  the  ith  atom,  where  v-k!6 
corresponds  to  a  typical  atom’s  axial  oscillation  frequency  in  the 
trap,  and  are  taken  as  random  numbers  accounting  for  the  atoms’ 
random  initial  positions.  gt(t)  is  chosen  so  that  its  maximum  and 
minimum  differ  by  a  factor  of  2,  which  exceeds  that  in  current 
experiments  [9].  Other  parameters  for  (A)  and  (B)  are  ys=K  and 
g  =  3K,  and  for  (C)  and  (D),  ys=K  and  T=210k. 

cause  the  component  \0)®N  dominates  the  contribution  to  the 
gate  infidelity,  F  is  also  very  insensitive  to  variation  of  the 
coupling  rates  gt.  We  have  verified  that  there  is  no  notable 
change  of  F  ( SF <  10-4)  in  Fig.  3(b)  for  gt  varying  from  2k 
to  6  k. 

Any  source  of  photon  loss  has  no  contribution  to  the  gate 
fidelity  but  instead  influences  gate  efficiency  (success  prob¬ 
ability).  A  fundamental  source  of  photon  loss  is  atomic  spon¬ 
taneous  emission.  Figure  3(c)  shows  the  failure  probability 
Psp  of  the  gate  due  to  this  source  of  noise,  with  the  noise  rate 
ys=K.  For  N  atoms  with  equal  gt=g,  the  probability  Psp  can 
be  well  fit  by  an  empirical  formula  RSp~^emp 


032333-3 


DUAN,  WANG,  AND  KIMBLE 


PHYSICAL  REVIEW  A  72,  032333  (2005) 


=  ^=l[N\ / n\(N - n) \2N][l -\-ng2 / Kys]~l.  The  empirical  Pemp 
can  be  understood  as  a  probability  averaged  over  all  the 
Dicke-state  components  in  the  input  state  [(|0)  +  |l))/V2]®^, 
with  the  nth  Dicke-component  having  an  effective  coupling 
rate  V ng  to  the  cavity  mode.  We  have  also  simulated  the  loss 
probability  Psp  when  the  coupling  rates  gt  are  different  and 
vary  during  the  gate  operation,  for  instance,  as  would  be 
caused  by  the  atoms’  thermal  motion.  With  some  typical 
choice  of  the  relevant  experimental  parameters,  the  result  is 
shown  in  Figure  3  D,  which  is  qualitatively  similar  to  the 
constant  coupling  rate  case  with  an  effective  average  over 
\gi\.  Other  sources  of  photon  loss  can  be  similarly  character¬ 
ized.  For  instance,  with  a  finite  photon  collection  and  detec¬ 
tion  efficiency  77,  the  success  probability  of  each  gate  will  be 
simply  reduced  by  a  factor  of  77. 

In  summary,  we  have  proposed  a  new  scheme  for  robust 
atomic  gates  by  way  of  interactions  mediated  by  cavity- 
assisted  photon  scattering.  These  gates  are  robust  to  all 
sources  of  photon  loss  that  are  typically  the  dominant  source 
of  noise  in  experimental  implementations,  and  are  further¬ 
more  insensitive  to  randomness  in  the  coupling  rates  caused 


by  fluctuations  in  atomic  position.  Beyond  two-atom  gates 
illustrated  in  Fig.  1,  our  scheme  can  also  be  employed  for 
realization  of  an  Watom  Toffoli  gate  in  a  single  step  and  for 
the  implementation  of  nonlocal  gates  on  distant  atoms  as  in 
Fig.  2.  We  have  characterized  the  efficacy  of  our  scheme 
through  exact  numerical  simulations  that  incorporate  various 
sources  of  experimental  noise.  These  results  demonstrate  the 
practicality  of  our  scheme  by  way  of  current  experimental 
technology. 

Note  added.  After  submission  of  this  work,  we  were  in¬ 
formed  that  a  similar  idea  was  also  investigated  by  the  au¬ 
thors  X.-M.  Lin  et  al.  (unpublished). 
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Optimally  extracting  information  from  measurements  performed  on  a  physical  system  requires 
an  accurate  model  of  the  measurement  interaction.  Continuously  probing  the  collective  spin  of  an 
Alkali  atom  cloud  via  its  interaction  with  an  off-resonant  optical  probe  is  an  important  example 
of  such  a  measurement  where  realistic  modeling  at  the  quantum  level  is  possible  using  standard 
techniques  from  atomic  physics.  Typically,  however,  tutorial  descriptions  of  this  technique  have 
neglected  the  multilevel  structure  of  realistic  atoms  for  the  sake  of  simplification.  In  this  paper 
we  account  for  the  full  multilevel  structure  of  Alkali  atoms  and  derive  the  irreducible  form  of  the 
polarizability  Hamiltonian  describing  a  typical  dispersive  quantum  measurement.  For  a  specific  set 
of  parameters,  we  then  show  that  semiclassical  predictions  of  the  theory  are  consistent  with  our 
experimental  observations  of  polarization  scattering  by  a  polarized  cloud  of  laser-cooled  Cesium 
atoms.  We  also  derive  the  signal-to-noise  ratio  under  a  single  measurement  trial  and  use  this  to 
predict  the  rate  of  spin-squeezing  with  multilevel  Alkali  atoms  for  arbitrary  detuning  of  the  probe 
beam. 

PACS  numbers:  03.65.Ta,  42.50.Lc,  02.30.Yy 


I.  INTRODUCTION 


Information  gained  by  performing  measurements  on  a 
quantum  system  can  reduce  uncertainty  about  one  or 
more  of  its  physical  observables.  It  is,  however,  a  basic 
property  of  quantum  mechanics  that  measurements  are 
invasive  in  the  sense  that  they  necessarily  degrade  one’s 
ability  to  make  subsequent  predictions  about  the  values 
of  complementary  observables  [1].  This  type  of  distur¬ 
bance  is  often  called  measurement  backaction,  and  it  is 
a  natural  consequence  of  the  Hamiltonian  coupling  be¬ 
tween  a  probe  (such  as  an  electromagnetic  field  mode) 
and  the  system  of  interest.  In  a  special  class  of  “back- 
action  evading”  experimental  scenarios,  it  is  possible  to 
channel  the  disturbance  into  observables  that  are  not 
dynamically  coupled  to  the  main  quantities  of  interest. 
When  such  measurements  are  performed  with  minimal 
technical  imperfection  on  systems  whose  initial  prepara¬ 
tions  are  sufficiently  pure,  which  qualifies  them  as  what  is 
referred  to  in  the  quantum  optics  literature  as  quantum 
non-demolition  (QND)  measurement  [2,  3],  it  is  possible 
to  create  conditionally  squeezed  states  of  the  measured 
observable. 

While  measurement-induced  squeezing  can  easily  be 
understood  in  an  abstract  sense,  predicting  the  precise 
degree  of  squeezing  that  can  be  achieved  in  a  realistic 
experiment  requires  detailed  physical  modeling  of  the 
system-probe  interaction  (in  addition  to  any  operative 
decoherence  mechanisms).  Squeezed  states  of  atomic 
spins  have  recently  emerged  [4-7]  as  a  versatile  and  ro¬ 
bust  resource  for  quantum  information  science  [8,  9]  and 
quantum  metrology  [10-14].  In  these  contexts,  the  de¬ 
gree  of  spin  squeezing  is  directly  linked  to  entanglement 
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measures,  to  achievable  reductions  in  averaging  times  for 
precision  measurement,  and  to  achievable  improvements 
over  communication  protocols  that  utilize  only  classical 
information  resources. 

Theoretical  analyses  of  measurement-induced  spin 
squeezing  typically  consider  a  system  of  N  1  atoms 
whose  collective  spin  is  described  by  an  observable 

N 

F  =  J2{i)l  (1) 

i=  1 


where  Wf  =  •  •  ■  ®  ti-i  (8)  f  (8)  ii+i  (8)  •  •  •  is  the  angular 
momentum  (vector)  operator  for  the  ith  atom.  Carte¬ 
sian  components  Fx,  Fy  and  Fz  follow  from  this  in  an 
obvious  way.  Under  physical  conditions  that  preserve 
permutation  symmetry  of  the  label  i,  the  collective  spin 
of  an  initially  polarized  atomic  sample  can  be  restricted 
[15]  to  its  maximum  angular  momentum  shell.  The 
associated  Hilbert  sub-space  is  spanned  by  eigenstates 
|  F,M)  of  the  collective  spin  observable  F  that  satisfy 
F2|F,M)  =  h2F(F  +  l)\F,M ),  where  F  =  Nf  for  atoms 
with  individual  spin  /. 

It  is  natural  to  conceptualize  the  quantum  state  of 
such  a  system  as  a  Bloch-like  magnetization  vector  F  = 


FIG.  1:  (A)  Graphical  representation  of  the  spin-polarized 
atomic  sample  as  a  classical  magnetization  vector  with  trans¬ 
verse  quantum  uncertainty.  (B)  Schematic  of  the  transverse 
quantum  uncertainties  for  coherent  and  squeezed  spin  states. 
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FIG.  2:  Schematic  of  an  experimental  apparatus  for  continu¬ 
ous  measurement  of  collective  spin  in  an  Alkali  atom  sample 
based  on  polarimetric  detection  of  a  forward  scattered  probe 
laser.  Information  gained  from  the  measurement  can  be  used 
to  achieve  conditional  spin  squeezing. 

[(Fx),  ( Fy ),  (Fz)]  plus  a  transverse  uncertainty  AF^  as¬ 
sociated  with  the  variances  of  FX1  Fy  and  Fz  (see  Fig.  1). 
The  transverse  uncertainty  AF^  can  never  vanish  since 
Fx,  Fy  and  Fz  do  not  commute;  this  constraint  can  be  in¬ 
terpreted  to  mean  that  we  can  never  have  perfect  knowl¬ 
edge  of  the  orientation  of  the  collective  magnetization. 

Conditional  spin-squeezing  experiments  operate  on  the 
principle  that  weakly  measuring  Fz  gradually  reduces 
its  uncertainty  below  that  of  the  initially-prepared  spin 
state.  A  typical  apparatus  for  such  procedures  is  depicted 
in  Fig.  2.  Continuous  measurement  of  Fz  is  implemented 
by  passing  a  linearly  polarized  probe  laser  through  an 
atomic  sample  prepared  in  an  initial  (approximate)  co¬ 
herent  state  [16]  by  optical  pumping  [17].  Qualitatively 
speaking,  the  atoms  rotate  (via  state-dependent  optical 
activity)  the  probe  polarization  by  an  amount  propor¬ 
tional  to  Fz  [18].  In  a  quantum  analysis  the  atoms  and 
optical  probe  field  evolve  into  an  entangled  state  [19]  as 
the  result  of  this  scattering  interaction.  Detection  of  the 
scattered  probe  field  then  yields  information  about  Fz 
via  these  correlations  [20-24]. 

The  interpretation  of  existing  squeezing  experiments 
has  (at  least  implicitly)  assumed  that  polarimetric  detec¬ 
tion  of  the  forward  scattered  probe  laser  yields  a  detector 
photocurrent, 

ytdt  =  VMFzdt  +  dWt .  (2) 

Here  M  is  a  constant  (known  as  the  measurement 
strength )  that  describes  the  rate  at  which  photodetec¬ 
tion  provides  information  about  Fz.  The  dWt  are  noise 
increments  which  exhibit  Gaussian  white  noise  statistics 
E  [dWt]  =  0  and  dW f  =  dt  [25]. 

The  measurement  strength,  M,  is  the  key  parame¬ 
ter  for  predicting  the  degree  of  squeezing  that  can  be 
achieved  as  a  result  of  the  measurement.  It  is  thus  im¬ 
portant  to  determine  M  in  terms  of  physical  properties 
of  the  atomic  sample  and  optical  probe.  While  the  form 
of  Eq.  (2)  has  been  derived  in  previous  analyses  [20-25], 
this  has  generally  been  done  relative  to  a  simplifying  as¬ 
sumption  that  the  atoms  behave  qualitatively  as  spin-^ 
particles.  But  measurement-induced  spin  squeezing  ex¬ 
periments  have  utilized  Alkali  atoms  with  higher  spin 
[5,  7,  26],  and  recent  data  show  that  the  deviation  from 


spinA  behavior  can  be  significant  [27].  While  nonlinear¬ 
ities  in  the  atom-probe  scattering  process  are  not  always 
bad  (proposals  for  capitalizing  on  these  effects  for  quan¬ 
tum  state  tomography  are  being  explored  [28]),  they  do 
raise  complications  for  spin-squeezing  experiments  by  in¬ 
validating  the  form  of  Eq.  (2). 

We  find  that  the  photocurrent  in  Eq.  (2)  can  be  re¬ 
covered  even  for  higher-spin  atoms  by  suppressing  ten¬ 
sor  scattering  interactions  via  a  properly  chosen  exper¬ 
imental  geometry.  Using  standard  techniques  [29,  30] 
to  address  the  atom-probe  scattering  physics,  combined 
with  a  semiclassical  treatment  of  the  atomic  magnetiza¬ 
tion  vector,  we  derive  an  expression  for  the  measurement 
strength,  M,  in  terms  of  characteristic  experimental  pa¬ 
rameters.  This  allows  us  to  obtain  an  expression  for  the 
photocurrent  in  terms  of  the  duration  of  the  measure¬ 
ment  and  the  properties  of  the  atomic  system  and  the 
probe  laser.  We  observe  close  agreement  between  our 
scattering  model  and  data  obtained  using  an  apparatus 
of  the  type  in  Fig.  2. 

Finally,  we  derive  an  expression  for  the  signal-to-noise 
ratio  of  the  measurement  photocurrent  which  can  be  used 
to  calculate  the  rate  of  spin-squeezing  in  experiments  of 
the  form  shown  in  Fig.  2.  The  results  we  obtain  are 
valid  in  the  short-measurement  limit  in  which  atomic  de¬ 
coherence  due  to  scattering  probe  light  in  unobserved 
(non-paraxial)  electromagnetic  field  modes  [31]  can  be 
safely  ignored.  Current  spin-squeezing  experiments  all 
fall  into  this  regime  where  the  degree  of  quantum  un¬ 
certainty  reduction  is  small  compared  to  the  Heisenberg 
spin-squeezing  limit  [4]. 


II.  CONTINUOUS  MEASUREMENT  AND  THE 

MULTILEVEL  ATOM-PROBE  HAMILTONIAN 

We  begin  by  considering  the  experiment  depicted  in 
the  schematic  of  Fig.  2.  An  off-resonant  linearly  polarized 
probe  beam  is  sent  through  a  cloud  of  cold  spin-polarized 
atoms.  The  forward  scattered  polarization  state  of  the 
light  is  then  detected  using  a  polarimeter,  consisting  of 
polarization  shifting  waveplates,  a  polarizing  beam  split¬ 
ter,  and  two  detectors.. 

In  general,  it  is  a  rather  complicated  problem  to  pre¬ 
dict  the  output  polarization  state  of  the  probe  beam  af¬ 
ter  it  has  passed  through  the  spatially  extended  atom 
cloud.  We  can  simplify  the  problem  to  one  dimension  by 
assuming  that  the  beam  is  predominantly  forward  scat¬ 
tered  due  to  the  coherent  re-radiation  from  a  large  num¬ 
ber  of  atoms.  This  approximation  can  be  extracted  from 
a  full  three-dimensional  model  of  the  diffraction  as  con¬ 
sidered  in  references  [32-34].  Under  this  assumption,  we 
only  consider  paraxial  modes  of  the  beam.  Neglecting 
non-paraxial  modes  prevents  us  from  computing  the  de¬ 
coherence  rate  of  the  atomic  magnetization,  but  it  does 
not  limit  our  ability  to  analyze  the  dynamics  in  the  small- 
decoherence  (short  measurement  time)  limit. 

Even  in  the  one-dimensional  problem,  the  depth  of  the 
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atomic  cloud  along  the  probe  direction  introduces  further 
complications.  To  simplify  further,  we  assume  that  the 
overall  optical  density  of  the  cloud  is  small  enough  that 
the  total  rotation  of  the  optical  polarization  state  due  to 
the  atoms  is  small.  This  allows  us  to  neglect  propagation 
effects  by  which  the  atoms  in  the  front  edge  of  the  cloud 
would  see  a  substantially  different  input  state  than  the 
back  edge  of  the  cloud.  These  effects  lead  to  complicated 
multi-mode  dynamics  which  are  considered  (along  with 
the  tensor  polarizability)  in  [26] . 

Under  these  approximations,  we  approach  the  simpli¬ 
fied  scattering  problem  as  follows.  The  probe  beam  con¬ 
sists  of  two  orthogonal  polarizations  and,  for  each  polar¬ 
ization  component,  the  continuous  beam  is  divided  into  a 
series  of  distinct  spatial  traveling- wave  modes,  each  with 
a  length  equal  to  the  depth  of  the  atomic  cloud,  L.  Thus 
each  atom  interacts  with  a  pair  of  orthogonal  polariza¬ 
tion  modes  with  the  same  spatial  profile  for  a  length  of 
time  St  =  L/c.  This  approach  clearly  avoids  propagation 
effects  by  allowing  all  of  the  atoms  to  interact  with  the 
same  modes  simultaneously.  After  the  two  polarization 
modes  corresponding  to  one  time-slice  have  interacted 
with  the  cloud  for  the  discrete  time  St  ,  those  two  modes 
are  detected  with  the  polarimeter,  a  new  time-slice  begins 
to  interact  with  the  cloud,  and  the  process  is  repeated, 
leading  to  a  continuous  measurement.  More  detailed  ap¬ 
proaches  to  continuous  measurement  with  discrete  modes 
can  be  found  in  references  [24,  35]. 

Now  we  define  the  electric-field  and  polarization  oper¬ 
ators  associated  with  each  of  these  optical  modes  before 
considering  the  Hamiltonian  interaction  of  probe  beam 
with  a  single  multilevel  atom.  This  procedure  is  discussed 
with  more  detail  in  Appendix  A. 


A.  Probe  Field  Polarization  States 


For  each  traveling-wave  spatial  mode,  we  consider  the 
field  operators 


and 


E(-)  =  7 hg 


~  t  -»*  ,  ^  t  -»* 

a_e_  + a+e+ 


=  yfhg  a_e_  +  a+e^ 


(3) 

(4) 


where  a[_  and  a_  are  Heisenberg-picture  creation  and 
annihilation  operators  for  the  z-axis  propagating  mode 
with  left  circular  polarization  and  q}+  and  a+  are  the 
creation  and  annihilation  operators  for  right  circular 
polarization.  Each  field  operator  implicitly  refers  to  a 
single  traveling  mode  as  discussed  above,  and  we  neglect 
to  provide  indices  for  the  modes  unless  they  are  required 
for  clarity.  The  coefficient  g  =  o;o/(2eoU)  is  a  form 
factor,  V  will  be  taken  to  be  the  volume  of  the  atomic 
cloud,  and  e_  and  e+  are  the  (complex)  spherical  basis 
vectors  for  left  and  right  helicity. 


In  the  expansion  of  the  polarizability  Hamiltonian  we 
get  terms  which  can  be  recast  as  Schwinger  boson  oper¬ 
ators 

4>  =  i  (a+a+  +  a_a_)  (5) 

4  =  ^  (y_a_ +ata+) 

=  \  (apy  _  4«x) 

Sy  =  (<4a+  -  a+a_) 

4  =  \  («+a+  -  «-«-) 

These  operators  obey  the  usual  angular  momentum  com¬ 
mutation  relations  and  the  components  form  a  basis  for 
the  Stokes  vector  which  is  used  to  represent  the  polariza¬ 
tion  state  of  the  light.  The  quantity  S0  18  proportional  to 
the  number  of  photons  interacting  with  the  atomic  sys¬ 
tem  in  one  time  increment.  On  any  given  measurement, 
the  quantity  S0  and  a  single  component  of  the  Stokes 
vector  representing  the  polarization  state  (e.g.,  5X)  can 
be  measured  with  an  appropriate  selection  of  polarization 
rotating  waveplates  situated  after  the  atoms  and  prior  to 
the  polarizing  beam-splitter.  In  the  usual  configuration 
(of  Fig.  (2)),  Sx  is  measured  without  any  waveplates,  Sy 
is  measured  with  a  half-waveplate  that  rotates  the  lin¬ 
ear  polarization  by  45-degrees,  and  Sz  is  measured  by 
adding  a  quarter-waveplate  that  completely  circularizes 
linear  polarized  light. 

In  the  case  where  a  full  quantum  mechanical  descrip¬ 
tion  is  used,  this  choice  of  basis  will  change  the  nature  of 
the  information  gained  from  the  measurement  which  is 
then  used  to  update  the  conditional  collective  quantum 
state  describing  the  atoms.  In  other  words,  the  choice  of 
basis  will  lead  to  a  different  unravelling  of  the  conditional 
dynamics. 


B.  Scattering  Hamiltonian 

We  now  introduce  the  polarizability  Hamiltonian  that 
determines  the  joint  evolution  of  the  single- atom  spin 
and  the  polarization  of  the  traveling-wave  optical  mode. 
Subsequently,  we  summarize  the  results  from  Appendix 
A  where  we  derive  a  more  convenient  and  intuitive  way 
of  representing  the  irreducible  components  of  the  Hamil¬ 
tonian  in  terms  of  atomic  spin  operators  instead  of  dipole 
operators. 

For  a  field  which  is  off-resonant  to  the  transition  of 
interest,  the  usual  dipole  Hamiltonian  can  be  approxi¬ 
mated  and  recast  into  a  polarizability  form.  This  can  be 
derived,  for  example,  by  using  adiabatic  elimination  un¬ 
der  the  assumption  that  the  off-resonant  field  only  weakly 
populates  the  excited  states.  The  polarizability  Hamil- 
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tonian  [17,  30,  36]  is  then  expressed  as 


H  =  VEH  •  PfdPf'&Pf  .  e(+).  (6) 

Ts'  hAfJ' 

where  we  omit  indices  identifying  the  particular  atom 
and  spatial  optical  mode  being  considered.  This  defini¬ 
tion  consists  of  several  terms  which  are  also  defined  in 
Appendix  A.  The  negative  and  positive  frequency  probe 
field  operators,  and  E describe  the  creation  and 
annihilation  of  photons  in  the  contributing  probe  modes. 
The  atomic  operators  cF  and  d  are  the  vector  dipole 
raising  and  lowering  operators.  The  ground  and  excited 
state  angular  momentum  numbers  are  give  by  /  and  f 
respectively.  The  probe  detuning,  A =  u  — 
is  defined  as  the  difference  between  the  probe  frequency 
u)  and  a  particular  atomic  resonance  frequency.  For  the 
purposes  of  this  paper  we  consider  all  of  the  population  to 
remain  in  one  ground  state  manifold  (/  =  4  for  Cesium, 
ignoring  /  =  3)  and  sum  only  over  the  excited  states 
(/'  =  2,  3, 4,  5).  The  operators  Pf  and  Pf  are  projectors 
onto  the  ground  and  excited  states  respectively. 

This  Hamiltonian  has  a  satisfying  physical  interpreta¬ 
tion  as  a  scattering  interaction:  the  atom  is  first  brought 
from  its  ground  state  to  a  virtual  excited  state  via  the 
raising  operator,  cr,  by  annihilating  a  photon  from  the 
probe  field  through  E^+) .  Then,  the  temporarily  excited 
atom  returns  to  a  (potentially  different)  ground  state  by 
emitting  a  photon  into  a  (potentially  different)  scattered 
probe  mode  via  d  and  E^-). 

The  central  operator  in  the  scattering  Hamiltonian, 

«/,/'  =  (7) 


commonly  called  the  atomic  polarizability  tensor ,  is  a 
dyad  involving  vector  operators  [29].  Thus  &fjr  is  a 
rank-2  spherical  tensor  that  can  be  decomposed  into  ir¬ 
reducible  components, 


<*/,/'  —  <*/°/'  +  (8) 

The  scattering  Hamiltonian  similarly  decomposes  into  ir¬ 
reducible  spherical  tensor  operators, 

H  =  tf(°)  +  H&  +  (9) 


where 


fiti)  =  ^e(-) 


d(i) 

f  P  .  e(+E 

hAfJ' 


(10) 


The  FF°)  is  a  scalar  contribution,  transforms  as  a 
vector,  and  H ^  transforms  as  a  rank-2  symmetric  tensor 
in  the  group  representation  theory  of  50(3).  Were  the 
atomic  system  composed  of  spin-|  particles,  it  would  be 
possible  to  neglect  the  rank-2  Hamiltonian  [30]  (as  will 
become  explicit),  however,  we  can  not  do  so  for  higher- 
spin  Alkali  atoms  [26,  27,  37]. 

The  full  Hamiltonian  for  the  collective  atomic  spin  re¬ 
sulting  from  N  atoms  is  obtained  by  taking  the  symmet¬ 
ric  sum  of  these  single  particle  operators. 


C.  Hamiltonian  Decomposition 

Now  we  recast  the  single  atom  Hamiltonian  of  Eqs. 
(9,  10)  into  irreducible  terms  involving  only  atomic  spin 
operators  fi  and  probe  polarization  operators  Si  then  dis¬ 
cuss  each  component  in  physical  terms.  The  derivation 
of  these  expressions  is  sketched  in  Appendix  A. 


1.  The  Scalar  Hamiltonian 


The  scalar  scattering  Hamiltonian,  can  be  repre¬ 

sented  as  a  product  of  operators  on  the  separate  atomic 
and  probe  field  Hilbert  spaces.  This  is  accomplished 
by  combining  the  expressions  for  the  field  mode  oper¬ 
ators,  Eqs.  (3)  and  (4),  with  the  rank-0  irreducible  com¬ 
ponent  of  the  atomic  polarizability  tensor.  Evaluating 
this  Hamiltonian  using  the  form  of  the  rank-0  atomic 
polarizability  derived  in  Appendix  A  leads  to  the  scalar 
scattering  Hamiltonian, 


H<°> 


sE 

f' 


af.,  2  .  . 

J’J  q  n 

A f,r  3  Sq1p 


(11) 


where  the  constants  defined  in  equation  (A28)  of 

the  appendix,  are  related  to  the  transition  dipole  matrix 
elements  for  the  atomic  hyperfine  transitions. 

This  rank-0  Hamiltonian  couples  the  atomic  identity 
operator  to  the  field  mode  number  operator  and 
can  be  interpreted  as  an  atomic  state-independent  light 
shift.  It  therefore  affects  both  polarization  modes  of  the 
probe  field  in  an  equivalent  manner  and  will  not  influence 
the  measurement  process  since  it  does  not  provide  any 
state-dependent  information.  However,  this  Hamiltonian 
would  be  important  if  the  measurement  was  meant  to 
distinguish  between  populations  across  hyperfine  states 
(e.g.  /  =  3  and  /  =  4  using  homodyne  detection)  instead 
of  across  the  sub-level  populations  within  one  hyperfine 
state  (using  polarimetry,  as  discussed  here).  This  term  is 
also  of  importance  if  the  Hamiltonian  is  being  considered 
as  a  spatially  dependent  potential  for  the  atoms  (e.g.,  in 
an  optical  lattice). 


2.  The  Vector  Hamiltonian 


The  vector  contribution  to  the  atom-probe  scattering 
Hamiltonian,  can  be  evaluated  in  a  similar  manner  using 
expressions  for  the  rank-1  polarizability  derived  in  the 
appendix, 


p(1)  =  <?E 


/' 


a(1) 

a/,/' 

A/,/' 


SJZ- 


(12) 


Here,  the  vector  polarizability  constant,  o-j  j., ,  is  given  by 

Eq.  (A29),  fz  is  the  ^-component  of  the  (single-particle) 
atomic  spin  angular  momentum. 
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The  rank-1  Hamiltonian  can  be  interpreted  as  causing 
a  differential  phase  shift  on  the  two  circular  polariza¬ 
tion  modes  by  an  amount  that  is  proportional  to  the 
^-component  of  the  atomic  angular  momentum.  Thus 
the  vector  Hamiltonian  leads  to  optical  activity  in  the 
atomic  sample  and  produces  the  familiar  Faraday  rota¬ 
tion  effect  often  used  to  address  continuous  measurement 
of  collective  spin  [18,  20,  22-24]. 


3.  The  Tensor  Hamiltonian 


Finally  the  tensor  Hamiltonian,  can  be  evaluated  using 
expressions  for  the  rank-2  polarizability  derived  in  the 
appendix  to  give, 


#(2)  = 


a 


(2) 

fj' 


f 


* fJ ' 


S' 


(/x  -  fy  ) 


(13) 


+Sy  (/x/y  +/y/x) 

+  So  (3/z  -  /(/  +  1)1/)  /3)  • 


(2) 

Here,  the  tensor  polarizability  constant,  is  given 

by  Eq.  (A30). 

The  rank-2  Hamiltonian  couples  spin  coordinates  to 
the  elliptical  components  of  the  probe  laser  field  and 
produces  a  second-order  light  shift  proportional  to  the 
atomic  quadrupole  moment.  These  terms  vanish  for 
/  =  1/2  (as  can  be  seen  by  evaluating  the  operators 
within  the  parentheses  above)  but  are  non-zero  for  any 
higher  spin  number.  For  a  linearly  polarized  input  beam, 
the  tensor  term  leads  to  an  elliptically  polarized  scattered 
probe  field  [26,  29].  The  rank-2  interaction  potentially 
limits  the  validity  of  any  analysis  of  a  continuous  mea¬ 
surement  of  collective  atomic  spin  in  Alkali  atoms  based 
on  the  qualitative  behavior  of  spin-^  particles. 


D.  Semiclassical  Evolution  of  Probe  State 

We  can  greatly  simplify  the  dynamics  by  eliminating 
atomic  evolution  due  to  the  probe  beam  and  only  consid¬ 
ering  the  evolution  of  the  probe  beam  due  to  the  atomic 
state.  Under  this  semiclassical  approximation,  we  replace 
all  atomic  operators  with  their  expectation  values  with 
respect  to  an  assumed  fixed  spin  state.  (This  is  the  op¬ 
posite  of  the  semiclassical  situation  often  considered  in 
atom-light  interactions  where  the  atomic  system  is  con¬ 
sidered  quantum  mechanically  while  the  optical  beam  is 
made  classical.)  For  a  large  ensemble  of  atoms  and  small 
interaction  times,  fixing  the  atomic  state  will  accurately 
reproduce  the  mean  behavior  of  the  measured  photocur¬ 
rent  corresponding  to  one  of  the  Stokes  vector  compo¬ 
nents.  This  is  confirmed  experimentally  in  the  next  sec¬ 
tion,  where  the  atomic  state  is  fixed  and  adiabatically 


FIG.  3:  Definition  of  the  spherical  coordinate  angles  used  to 
describe  the  orientation  of  the  collective  atomic  magnetization 
vector,  F,  relative  to  the  fixed  laboratory  cartesian  coordinate 
system.  The  polarization  vector  of  the  input  probe  light  re¬ 
sides  in  the  xy- plane  and  forms  an  angle,  0P,  with  respect  to 
the  laboratory  x-axis. 


positioned  with  a  magnetic  holding  field.  The  holding 
field  serves  to  both  position  the  atomic  state  and  protect 
it  from  the  influence  of  the  probe  light,  such  that  the 
analysis  of  this  section  remains  valid  even  for  long  inter¬ 
action  times  or  large  optical  depth  clouds.  Ultimately, 
however,  probe  induced  decoherence  will  dominate  all  in¬ 
teractions.  In  the  final  section,  we  then  reconsider  the  full 
analysis  including  the  atomic  quantum  noise  (related  to 
spin  squeezing)  for  a  particular  alignment  of  the  collec¬ 
tive  spin  state. 

We  approximate  the  7V-atom  Hamiltonian,  H/v,  by  re¬ 
placing  the  single-atom  operators  with  their  expectation 
value  taken  with  respect  to  an  optically  pumped  spin 
pointing  with  direction  6  and  cp  given  in  spherical  coor¬ 
dinates  (Fig.  3).  In  other  words,  for  an  individual  atom 
operator  O/,  we  take 

df^(df)**(*(o,'p)\dfm,'p))  (i4) 

where  1^(0, <p))  =  exp  [~ifzip\  exp[-ify0]\f,  f)z. 

The  relevant  operators  from  the  Hamiltonian  decom¬ 
position  are  given  by 

(/z)  =  /  cos#  (15) 

(/x  -  fy)  =  /(/-  1/2)  sin2  0  cos  2</?  (16) 

(/x/y-/y/x)  =  /(/- l/2)sin26»sin2^  (17) 

Within  the  semiclassical  approximation,  we  obtain  an  ef¬ 
fective  scattering  interaction  Hamiltonian  that  only  in¬ 
volves  operators  on  the  probe  field  Hilbert  space.  Ignor¬ 
ing  all  terms  proportional  to  S0  (because  it  commutes 
with  each  term  of  the  semiclassical  Hamiltonian)  we  have 

fj  _  £r(l)  £r(2) 

=  (7x7  +  Ij/Sy  +  7 zSz )  Jt  (18) 

which  leads  to  a  rotation  of  the  Stokes  vector  S  about  a 
vector  7  =  [7^,  7^,  jz\  according  to  the  evolution  operator 
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values  are  then  given  by 


ust  =  exp 


=  exp 


■  St 

~lHH 


^(7x^x  "b  7y^y  b  Tz^z  ) 


(19) 


where  =  L/c  is  the  interaction  (transit)  time  of  the  dis¬ 
crete  spatial  modes  of  the  probe  beam  across  the  atomic 
cloud  of  length  L.  The  rotation  vector  7  is  defined  by 


Ifx 


=  7o/ (/  —  1/2)  sin2  6  cos  2ip 


(2) 
afJ' 


7 y  =  7o/(/  -  1/2)  sin2  <9  sin 


(2) 
afJ' 

y'  aoAfjf 


Iz 


=  7q/cos6»^ 


a(1) 
af,f 

/  OioAfjf 


(22) 


Here  we  have  normalized  by  the  state-independent  polar¬ 
izability  constant  (see  Appendix  A) 


(S'x)  =  (Sx)  cosy  +  A|(l -cosy) 


(26) 


(Sy)  =  (Sx)  (  -^sin7+  y^(l  -COS7) 


(7)  =  (Sx)  (  —sin 7+  -^7(1  -cos 7) 


Izlx  , 
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Taking  the  total  rotation  angle  small  (7  «  1)  this  be¬ 
comes  (to  second  order  in  y) 


(20) 

0'x)  * 

(27) 

(Sy)  * 

3  0,)(  7.  + V) 

(28) 

(21) 

(S'z)  * 

*  7.)  (7.  + 

(29) 

/QQ\ 

In  this  semiclassical  approximation,  we 

have  com- 

pletely  neglected  any  evolution  of  the  atomic  state  due 
to  the  probe  beam.  We  demonstrate  in  the  next  section 
that  the  above  model  agrees  well  with  experimental  data 
when  the  spin  state  is  fixed  with  a  magnetic  holding  field. 


OL  0 


3e0^TAo 

Sir2 


mwf) 


2  (2 3  + 1) 
(2/  +  1) 


(23) 


such  that  a'y  ^//a'o  is  dimensionless.  The  rotation 
strength  is  represented  by 


III.  EXPERIMENTAL  RESULTS 

In  this  section,  we  show  that  the  model  described  above 
is  consistent  with  representative  data  collected  from  our 
experiment  with  laser  cooled  Cs  atoms  and  balanced  po- 
larimetric  detection  of  a  forward-scattered,  off-resonant 
probe  laser  field. 


NgStao 

70  =  — n~ 

where  we  have  used  the  field  coefficient  g  =  cJo/(2eoH), 
the  atomic  resonance  frequency  cjo,  and  the  interaction 
volume  (the  volume  of  the  atomic  sample)  V  =  AL. 

From  an  experimental  standpoint,  it  is  useful  to  note 
that  70  is  directly  related  to  the  on-resonance  optical 
depth  OD  of  the  atomic  sample  and  the  decay  rate  T 
via, 

70  =  (7)  OD  (24) 

where 

0D  =  7V^’  =  (25) 

The  quantity,  <ro,  is  the  resonant  atomic  scattering  cross 
section  and  A  =  irr2  is  the  cross-sectional  area  of  the 
atomic  sample. 

In  Appendix  B  ,  the  equations  for  a  general  rotation  of 
S  about  7  are  given.  Here  we  specialize  to  the  case  where 
the  input  beam  is  linearly  polarized  in  the  x-direction 
such  that  (Sy)  =  (Sz)  =  0.  The  output  expectation 


A.  Experimental  Apparatus 

Figure  4  provides  a  schematic  of  the  major  compo¬ 
nents  of  the  experimental  apparatus.  Our  single-particle 
Alkali  atom  spin  system  is  the  62Si/2(f=4)  ground  state 
hyperfine  manifold  in  133  Cs  with  4 h  of  intrinsic  angular 
momentum  due  to  a  combination  of  the  i  =  7/2  nuclear 
spin  and  the  s  =  1/2  spin  of  an  unpaired  6s  valence  elec¬ 
tron.  We  obtain  cold  atom  samples  from  a  10-9  Torr 
background  Cs  vapor  using  standard  laser  cooling  and 
trapping  techniques  by  collecting  more  than  109  atoms 
in  a  magneto-optic  trap  (MOT).  Trapping  beams  are  de¬ 
rived  from  a  150  mW  injection-locked  diode  laser  tuned 
(11-15)  MHz  red  of  the  Cs  62Si/2(f=4)^62P3/2(f'=5) 
cycling  transition.  Each  35  mW  trapping  beam  has 
an  approximately  constant  intensity  profile  and  a  2.5 
cm  diameter.  A  10  mW  repump  laser  tuned  to  the 
62Si/2(f=3)^62P3/2(f/=4)  transition  is  used  to  prevent 
atomic  population  from  decaying  out  of  the  trapping  cy¬ 
cling  transition. 

Following  the  atom  collection  phase,  the  sample  is  sub- 
Doppler  cooled  [38]  to  a  temperature  of  T  ~  10  gK  and 
the  initial  x-polarized  spin  state  is  prepared  with  a  cir¬ 
cularly  polarized  100  gW  optical  pumping  beam  (pulsed 
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Fluxgate  /  Fluorimeter 


FIG.  4:  Schematic  of  our  experimental  apparatus  in  which  collective  spin  angular  momentum  of  a  cloud  of  laser  cooled  Cs 
atoms  is  measured  by  polarimetric  detection  of  a  scattered  off-resonant  probe  laser.  Ambient  magnetic  held  fluctuations  are 
supressed  by  magnetic  shielding  and  can  be  monitored  with  a  fluxgate  magnetometer  (FG)  situated  nearby  the  atomic  sample. 
Components  not  shown  include  the  optical  pumping  laser  (aligned  along  the  laboratory  a>axis)  and  external  trim  coils  used  to 
zero  ambient  magnetic  fields  and  their  first  order  gradients. 


for  2-4  ms)  propagating  along  the  x-axis  and  tuned  to 
the  (f=4)— >(f=4)  hyperhne  transition.  A  100  mG  mag¬ 
netic  holding  held  is  applied  along  the  laboratory  x-axis 
to  dehne  the  optical  pumping  direction. 

Continuous  measurement  of  the  polarized  atomic  en¬ 
semble  is  implemented  with  a  nearly  quantum  shotnoise- 
limited  probe  laser  that  can  be  detuned  from  the 
62S1/2(f=4)^62P  3/2(P=5)  Cs  transition  over  a  range 
A  =  dz  1.4  GHz.  The  probe  beam  is  linearly  polarized  by 
a  high  extinction  Glan-Thompson  prism  prior  to  pass¬ 
ing  through  the  cold  atom  cloud,  and  the  orientation  of 
the  linear  polarization  vector  with  respect  to  the  labora¬ 
tory  coordinate  system  may  be  rotated  via  an  input  half- 
waveplate.  The  scattered  probe  held  is  detected  with  a 
polarimeter  constructed  from  a  Glan-Thompson  polariz¬ 
ing  beam  splitter  and  a  DC-balanced  photodetector  with 
>1  MHz  measurement  bandwidth. 

A  computer  controls  the  experiment  timing  and 
records  the  polarimeter  output  as  well  as  diagnostic  in¬ 
formation  including  background  magnetic  held  fluctu¬ 
ations  (measured  with  a  flux-gate  magnetometer)  and 
atom  number  (measured  by  fluorescence  imaging).  The 
computer  enables/disables  the  measurement  by  control¬ 
ling  a  shutter  on  the  probe  laser,  constructed  from  a 
switched  acousto-optic  modulator,  with  100  ns  resolu¬ 
tion.  Magnetic  helds  with  magnitudes  up  to  ^  0.5  G  can 
be  applied  in  arbitrary  (time-dependent)  directions  by 
driving  3  pairs  of  computer-controlled  Helmholtz  coils, 
oriented  along  the  laboratory  x-,  y-,  and  z-axes,  with  a 
bandwidth  of  ^  1  MHz. 

Background  magnetic  held  fluctuations  are  suppressed 
through  a  combination  of  passive  fi- metal  shielding  and 
held  cancellation  via  external  trim  coils.  Each  atom 
preparation  (trapping,  cooling  and  optical  pumping)  and 
measurement  cycle  is  synchronized  with  respect  to  the 
60-Hz  building  power  lines  to  suppress  the  effects  of  in¬ 
duced  magnetic  helds.  Slow  magnetic  drift  due  to  natural 
and  anthropogenic  sources  are  cancelled  by  adjusting  the 
external  trim  coils  based  on  the  output  of  the  fluxgate 


magnetometer. 


B.  Verification  of  the  Probe  Scattering  Model 

Our  model  of  the  scattered  probe  polarization  as  a 
function  of  the  orientation  of  the  atomic  magnetiza¬ 
tion  vector  was  compared  against  experiment  by  observ¬ 
ing  the  polarimeter  photocurrent  as  the  orientation  of 
the  atomic  polarization  was  varied  according  to  differ¬ 
ent  specihed  paths  in  the  laboratory  coordinate  system. 
This  was  accomplished  as  follows.  An  x-polarized  cold 
atom  sample  was  prepared  according  to  the  description 
above  and  an  x-axis  magnetic  holding  held  of  100  mG 
was  applied.  At  this  point,  the  probe  shutter  was  opened 
and  the  balanced  polarimeter  photocurrent  was  moni¬ 
tored  while  the  orientation  of  the  magnetic  holding  held 
was  varied  according  to  the  specihed  path.  The  rate  of 
change  of  the  holding  held  orientation  was  chosen  to  be 
slow  (ms)  compared  to  the  atomic  Larmor  precession  fre¬ 
quency  (hundreds  of  kHz)  such  that  the  atomic  magne¬ 
tization  vector  adiabatically  followed  the  path  traced  by 
the  holding  held.  Furthermore  the  holding  held  was  large 
enough  to  dominate  the  probe  light  induced  dynamics  at 
short  times,  but  not  so  strong  as  to  shift  the  levels  sig¬ 
nificantly  compared  to  the  detuning. 

With  a  strong  enough  holding  held,  the  spin  state  (and 
hence  the  semiclassical  rotation  vector  7)  will  be  hxed 
across  the  spatially  extended  cloud.  Because  rotations  of 
the  Stokes  vector  about  the  same  vector  will  commute, 
the  semiclassical  analysis  of  the  previous  section  will  be 
valid  for  even  large  optical  depth  samples  where  the  total 
optical  polarization  rotation  is  signihcant. 

This  process  was  performed  for  two  different  adiabatic 
paths  on  the  atomic  Bloch  sphere: 

•  xz- Plane  Rotation:  the  atomic  magnetization 
follows  a  path  beginning  along  the  x-axis  and  ro¬ 
tates  around  the  y-ajtis :  0  =  tt/2  ^  — 7r/2  with 


FIG.  5:  Comparison  of  our  model  of  continuous  measurement  with  photocurrents  obtained  from  from  the  experiment  with 
N  =  lx  109  Cs  atoms  in  an  r  —  4  mm  spherical  trap  and  a  P  —  10  /xW  probe  field  blue-detuned  from  the  (f=4)— ►(? =5)  D2 
hyperfine  transition  (Ao  =  852  nm).  Each  trajectory  is  averaged  10  times.  (A)  For  an  input  probe  beam  with  x-polarization 
and  a  detuning  of  150  MHz,  Sy  and  Sz  were  measured  for  both  the  xz  and  xy  trajectories  (described  in  the  text)  resulting  in  the 
solid  curves.  All  trajectory  times  are  r  —  2  ms,  during  which  we  observe  some  atomic  decoherence  which  causes  the  prediction 
(dotted  curves)  to  stray  from  the  data.  (B)  As  a  function  of  probe  detuning,  we  plot  the  peak  of  the  Sy  measurement  (for  the 
xz  trajectory)  which  depends  only  on  rank-1  terms  and  the  peak  of  the  Sz  measurement  (for  the  xy  trajectory)  which  depends 
only  on  rank-2  terms.  The  predicted  behavior  (solid  curves)  shows  good  agreement  with  the  data  out  to  large  detunings  where 
the  curves  asymptote  to  the  1/A  and  1/A2  lines  provided  to  guide  the  eye. 


fixed  ip  =  0. 

•  xy- Plane  Rotation:  the  atomic  magnetization 
follows  a  path  beginning  along  the  x-axis  and  ro¬ 
tates  around  the  z-axis:  cp  =  0  — >  tt  with  fixed 
0  =  7t/2. 

We  chose  these  two  trajectories  because  they  highlight 
the  different  contributions  from  the  rank-1  and  rank-2 
scattering  interactions,  as  seen  from  Eqs.  (20,  21,  22). 
The  xz- plane  trajectory,  where  ip  m  0,  virtually  elim¬ 
inates  the  rank-2  tensor  contribution  to  the  photocur¬ 
rent  leaving  nearly  ideal  Faraday  rotation.  Conversely, 
the  xy- plane  rotation  eliminates  rank-1  contributions  and 
produces  elliptical  scattered  probe  polarizations. 


1.  Measuring  the  Scattering  Probe  Stokes  Vector 

Fig.  5 A  compares  the  measured  polarimeter  photocur¬ 
rents  (solid  curves)  for  these  two  adiabatic  trajectories 
with  those  predicted  by  our  atom-field  scattering  model 
(dotted  curves).  The  input  state  was  polarized  in  the  x 
direction  and  because  the  total  polarization  rotation  an¬ 
gle  induced  by  the  atoms  7  was  small,  we  measured  only 
the  other  two  components  Sy  and  Sz  with  the  appropri¬ 
ate  arrangement  of  waveplates  prior  to  the  polarimeter. 
For  measuring  Sy  a  single  half-waveplate  is  placed  prior 
to  the  polarizing  beamsplitter  (PBS)  to  rotate  the  polar¬ 
ization  by  45  degrees,  and  for  Sz  a  quart er-waveplate  is 
used  to  circularize  the  initial  linearly  polarized  light. 


We  now  refer  to  Eq.  (28)  and  Eq.  (29)  to  explain  the 
observed  trajectories.  For  the  xz  trajectory,  we  have 
=  0  such  that  Sy  contains  a  large  linear  term  in  7^ 

but  Sz  only  contains  terms  quadratic  in  7.  Thus,  for  this 
path,  the  measurement  of  Sy  leads  to  the  top  curve  in 
Fig.  5 A  which  is  proportional  to  the  rank-1  polarizabil¬ 
ity,  while  the  measurement  of  Sz  is  much  smaller  and 
effectively  zero.  For  the  xy  trajectory,  we  have  7^=0 
such  that  Sz  contains  a  large  linear  term  in  7^  but  Sy  only 
contains  terms  quadratic  in  7.  Thus,  for  this  path,  the 
measurement  of  Sz  leads  to  the  middle  curve  in  Fig.  5 A 
which  is  proportional  to  the  rank-2  polarizability,  while 
the  measurement  of  Sy  is  much  smaller  and  effectively 
zero.  The  doubling  of  frequency  between  the  two  domi¬ 
nant  curves  is  a  direct  consequence  of  the  tensor  nature 
of  the  rank-2  term. 

Note  that  there  is  some  structure  expected  in  the  two 
curves  (quadratic  in  7)  which  are  approximately  zero, 
but  these  are  more  polluted  by  technical  noise  and  do 
not  reveal  any  essentially  new  information  about  the  in¬ 
teraction.  For  the  larger  curves  (linear  in  7),  deviations 
of  the  measured  photocurrents  relative  to  the  predicted 
values  seen  in  Fig.  5 A  result  mainly  from  the  fact  that 
the  model  does  not  consider  the  probe-induced  damping. 

The  predicted  curves  use  values  for  the  atom  number, 
trap  volume,  probe  power  and  detuning  consistent  with 
independent  characterizations  of  those  parameters.  The 
atom  number  and  trap  volume  were  obtained  from  fluo¬ 
rescence  detection  of  the  MOT  and  a  CCD  image  of  the 
atom  cloud,  and  the  resulting  values,  N  =  1  x  109  and 
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r  =  4  mm,  correspond  to  an  optical  depth,  OD  ^  7, 
which  is  consistent  with  absorption  measurements  that 
we  performed.  Given  our  uncertainty  in  measuring  the 
number  of  atoms,  it  can  be  inferred  that  our  optical 
pumping  efficiency  in  these  (relatively)  optically  thin 
atomic  samples  is  no  less  than  85%  (but  is  more  likely 
>90%)  [16]. 


2.  Relative  Scaling  of  the  Scattering  Terms  with  Probe 
Detuning 

As  further  verification  of  our  scattering  model,  we  in¬ 
vestigated  the  scaling  of  the  rank-1  and  rank-2  contri¬ 
butions  to  the  polarimeter  photocurrent  as  a  function  of 
the  probe  detuning.  With  reference  to  Fig.  (  5A),  the 
magnitude  of  the  vector  and  tensor  scattering  interac¬ 
tions  were  measured  from  the  peak  amplitude  of  the  Sy 
measurement  (for  the  xz  plane  rotation)  and  the  ampli¬ 
tude  of  the  Sz  measurement  (for  the  xy- plane  rotation) 
respectively.  This  plot  compares  these  measured  signal 
amplitudes  (stars)  with  those  predicted  by  our  scatter¬ 
ing  model  (solid  curves)  for  detunings  (with  respect  to 
the  (f=4)— >(f =5)  hyperfine  transition)  ranging  from  150 
MHz  to  1.05  GHz. 

The  fact  that  multiple  excited  state  hyperfine  levels 
participate  in  the  scattering  interaction  is  evident  from 
scalings  which  are  not  constant  in  A-1  or  A-2.  As  sup¬ 
ported  by  our  full  model  of  the  scattering  interaction,  we 
observe  no  qualitative  difference  in  the  continuous  mea¬ 
surement  for  probe  detunings  smaller  than  the  hyperfine 
splittings.  This  suggests  that  conditional  spin-squeezing 
experiments  can  be  performed  with  small  detunings  pro¬ 
vided  that  the  probe  intensity  is  weak  enough  that  the 
small  decoherence  requirement  is  satisfied. 


IV.  SPIN-SQUEEZING  WITH  MULTILEVEL 
ATOMS 

Until  this  point  we  have  considered  only  the  semiclas- 
sical  evolution  of  the  optical  probe  beam  due  to  an  en¬ 
semble  of  atoms  with  a  fixed  atomic  spin  state.  Now 
we  consider  a  different  experimental  scenario  appropri¬ 
ate  for  preparing  conditional  spin-squeezed  states  of  the 
atomic  ensemble.  As  opposed  to  the  previous  situation, 
we  remove  the  adiabatic  holding  field  which  makes  spin¬ 
squeezing  impossible  as  it  will  cause  undesired  mixing  of 
the  squeezed  and  anti-squeezed  components  perpendic¬ 
ular  to  the  mean  spin.  Although  the  holding  field  may 
serve  to  validate  the  previous  semiclassical  analysis  for 
longer  times  by  eliminating  the  probe-induced  evolution 
of  the  atomic  state,  this  analysis  is  still  valid  for  small 
times  and  weak  interactions  without  a  holding  field.  Thus 
our  goal  is  to  derive  the  small  time  signal-to-noise  ratio 
by  deriving  the  signal  strength  from  the  previous  section 
and  comparing  this  to  the  optical  shotnoise.  We  then  use 
the  signal-to-noise  ratio  to  predict  the  rate  of  squeezing 


in  a  typical  experimental  configuration  where  the  tensor 
terms  can  be  ignored. 

Considering  only  the  relatively  low  optical  density 
limit,  the  measurement  of  S  will  result  in  Eq.  (28).  Now 
we  wish  to  re-write  this  equation  in  the  form  of  Eq.  (2) 
including  measurement  noise.  It  is  readily  shown  that 
all  terms  not  linear  in  Fz  vanish  in  Eq.  (28)  provided 
that  0  =  tt/ 2  and  0  =  0.  That  is,  a  pure  Faraday  ro¬ 
tation  Hamiltonian  is  recovered  when  the  atomic  mag¬ 
netization  vector  is  oriented  along  the  x-axis.  However, 
rotating  F  in  the  xy- plane  results  in  elliptically  polar¬ 
ized  scattered  probe  light,  and  moving  out  of  this  plane 
results  in  nonlinear  atomic  dephasing  due  to  scattering 
terms  which  are  quadratic  in  the  single-particle  spin  op¬ 
erators,  fz.  These  adverse  effects  are  avoided  for  the 
experimental  geometry  where  F  is  collinear  with  the  x- 
axis.  Fortunately,  spin-squeezing  experiments  are  easily 
operated  under  such  conditions  [7]. 

Taking  the  input  probe  field  to  be  in  an  x-polarized 
optical  coherent  state,  and  considering  the  small  7  limit, 
Equation  (28)  leads  to  a  semiclassical  photocurrent  (with 
units  of  optical  power)  of  the  form, 

yt  =  V^SFZ  + y/rjCt,  (30) 

where  we  have  made  the  substitution,  hN f  cos  6  — *  Fz 
(refer  to  Eq.  (15)),  and  included  the  photodetector  quan¬ 
tum  efficiency,  77.  Note  that  we  have  introduced  which 
represents  optical  shotnoise.  We  have  also  introduced  a 
constant,  S',  the  scattering  strength , 


that  depends  up  the  probe  intensity,  Ip  =  P/A ,  deter¬ 
mined  by  the  coherent  state  amplitude,  P  =  2hw\/3\2  and 
cross-sectional  area,  A  =  nr2  (for  a  mode-matched  probe 
laser).  It  is  useful  to  note  that  the  scattering  strength 
has  units  of  W2/h2  (power  squared  per  h2)  and  charac¬ 
terizes  the  degree  of  coupling  between  the  atoms  and  the 
probe  field;  quantifies  the  polarimeter  optical  power 
imbalance  per  unit  spin  (as  Fz  has  units  of  K). 

Our  expressions  are  similar  to  previous  results  [21,  24, 
39]  in  that  it  appears  as  a  Faraday  rotation  signal.  How¬ 
ever,  our  specific  expressions  for  qx,  qy  and  qz  account 
for  the  detailed  hyperfine  structure  of  the  atomic  excited 
states,  including  the  fact  that  the  oscillator  strengths  and 
signs  of  the  contributions  from  different  participating  ex¬ 
cited  states  are  not  equal,  and  doing  so  is  required  for 
quantitative  agreement  between  theory  and  experiment. 

To  arrive  at  an  expression  for  the  measurement 
strength ,  M,  as  defined  in  Eq.  (2),  we  must  consider  the 
variance,  A£2,  of  the  white  noise  increments  Q-  For  an 
optical  coherent  state  [40,  41],  this  noise  variance  is  given 
by  the  familiar  optical  shotnoise  expression, 


AC2  =  E[Ct2]  =  2  huP, 


(32) 
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which  has  units  of  W2/Hz  (power  squared  per  frequency). 
Comparing  the  semiclassical  photocurrent  of  Eq.  (30)  to 
the  photocurrent  of  Eq.  (2),  the  measurement  strength  is 
seen  to  be  given  by  the  ratio 


(33) 


where  we  have  defined  the  reciprocal  scattering  time  as 

2 


Ta  = 


(34) 


which  is  essentially  the  rate  that  probe  photons  are  scat¬ 
tered  by  the  atomic  system.  This  expression  is  similar  to 
that  derived  in  Ref.  [18]. 

Now  consider  a  measurement  of  Fz  by  Eq.  (2).  In  the 
small  time  limit  where  probe  induced  decoherence  can 
be  neglected,  the  full  quantum  filter  describing  this  mea¬ 
surement  is  equivalent  a  classical  model  in  which  Fz  is 
simply  a  random  constant  on  every  trial  drawn  from  a 
distribution  with  variance  equal  to  the  quantum  vari¬ 
ance  of  (AFz2)o  [42].  Then  the  generally  complicated 
full  quantum  filter  [25]  is  equivalent  to  linear  regression, 
or  fitting  a  constant  to  the  noisy  measurement  record  in 
real  time.  In  essence,  the  optimal  filter  serves  to  average 
away  the  optical  shotnoise  to  reveal  the  underlying  value 
of  Fz.  Under  these  statistical  assumptions,  at  small  times 
the  quantum  uncertainty  is  given  by 


(AFz2)r 


(AFZ2)0 

l+r?(AFz2)0Mr' 


(35) 


This  can  be  shown  either  with  the  full  quantum  filter 
or  by  using  the  equivalent  classical  model  combined  with 
Bayesian  estimation  (from  which  a  Kalman  filter  or  linear 
regression  can  be  derived). 

These  concepts  are  illustrated  by  the  simulated  mea¬ 
surement  trajectory  in  Fig.  6.  The  plot  begins  with  the 
probe  laser  turned  off,  during  which  all  necessary  state 
preparation  of  the  atomic  system  such  as  atom  trapping, 
cooling  and  optical  pumping  into  an  x-polarized  coher¬ 
ent  spin  state  is  performed.  Once  the  probe  light  is  en¬ 
abled  at  t  =  0,  the  photocurrent  acquires  a  mean  offset, 
r]^/SFzl  proportional  to  the  spin  measurement  outcome, 
FZ1  but  this  mean  value  is  masked  by  photocurrent  noise. 
At  short  times,  the  signal  is  overwhelmed  by  local  statis¬ 
tical  fluctuations;  however,  averaging  the  photocurrent 
suppresses  the  uncertainty  in  the  mean  signal  by  inte¬ 
grating  away  the  white  noise,  illustrated  by  the  dotted 
lines  in  Fig.  6. 

If  we  define  the  signal  to  noise  ratio  as 

SNR2  =  77(AF2)0Mt  (36) 


we  can  then  express  the  degree  of  squeezing  (ignoring 
decay  of  the  Fx)  as 


W  = 


(AF2)t 

<AF2)o 

1 

1  +  SNR2 


(37) 
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FIG.  6:  Simulated  photocurrent  (77  =  1)  for  a  continuous 
measurement  of  atomic  spin  angular  momentum  via  balanced 
polarimetry.  At  the  onset  of  the  measurement,  t  —  0,  the 
photocurrent  assumes  a  mean  offset  proportional  to  the  z- 
component  of  the  spin,  but  this  offset  is  masked  by  white 
noise  due,  in  part,  to  optical  shotnoise  on  the  probe  laser. 
Filtering  the  photocurrent  gradually  reduces  the  uncertainty 
in  the  photocurrent  offset  and  produces  spin-squeezing. 


Using  <AF2)o  =  h2Nf/ 2,  we  can  express  the  signal  to 
noise  ratio  as 


SNR2  =  r/OD^-—  (38) 

4  Ts 

To  keep  this  expression  valid  we  must  have  r  «  ts,  so 
our  only  recourse  to  creating  large  amounts  of  squeezing 
in  free  space  is  to  increase  both  the  quantum  efficiency  77 
and  the  optical  depth  OD  as  much  as  possible. 


V.  CONCLUSION 

In  this  work,  we  have  derived  the  most  useful  form 
of  the  polarizability  Hamiltonian  describing  the  realistic 
measurement  of  an  ensemble  of  multilevel  alkali  atoms 
with  an  off-resonant  probe  beam.  We  then  showed  that 
this  model  was  consistent  with  experimental  observations 
in  the  semiclassical  limit  where  the  atomic  state  was  adia- 
batically  directed  with  a  strong  magnetic  field.  We  found 
that  an  adequate  comparison  was  only  possible  after  in¬ 
cluding  all  relevant  hyperfine  transitions  including  their 
relative  (non-unit)  oscillator  strengths  in  our  model  of 
the  atomic  physics. 

We  then  developed  a  model  for  describing  conditional 
spin-squeezing  in  Alkali  atoms.  Detailed  investigation 
of  the  atom  probe  scattering  physics  indicates  that  it  is 
possible  to  eliminate  unwanted  tensor  components  of  the 
atomic  polarizability  by  adopting  a  suitable  atomic  and 
optical  polarization  geometry.  This  includes  the  elimi¬ 
nation  of  dephasing  due  to  the  quadratic  light  shift  [27] 
without  sacrificing  a  fixed  laboratory  coordinate  system 
for  the  measurement.  Moreover,  we  found  that  condi¬ 
tional  spin-squeezing  experiments  could  be  performed  at 
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small  optical  detunings  without  a  substantial  change  in 
the  form  of  the  photocurrent  or  filtering  approach. 

Acknowledgments 

The  authors  would  like  to  thank  Poul  Jessen,  Ivan 
Deutsch,  Andrew  Silberfarb,  Dima  Budker  and  especially 


Ramon  van  Handel  and  Andrew  Doherty  for  numerous 
insightful  discussions.  We  would  also  like  to  thank  Sebas¬ 
tian  de  Echaniz,  Jacob  Sherson,  and  Eugene  Polzik  for 
pointing  out  important  corrections.  This  work  was  sup¬ 
ported  by  the  Caltech  MURI  Center  for  Quantum  Net¬ 
works  (DAAD19-00-1-0374).  JMG  acknowledges  support 
from  the  Caltech  Center  for  Physics  of  Information  and 
JKS  acknowledges  support  from  a  Hertz  Fellowship. 


[1]  V.  Braginski  and  F.  Khalili,  Quantum  Measurements 
(Cambridge  University  Press,  1992). 

[2]  M.  Brune,  S.  Haroche,  V.  Lefevre,  J.  Raimond,  and 
N.  Zagury,  Phys.  Rev.  Lett.  65,  976  (1990). 

[3]  G.  Nogues,  A.  Rauschenbeutel,  S.  Osnaghi,  M.  Brune, 

J.  Raimond,  and  S.  Haroche,  Nature  400,  239  (1999). 

[4]  M.  Kitagawa  and  M.  Ueda,  Phys.  Rev.  A  47,  5138  (1993). 

[5]  A.  Kuzmich,  L.  Mandel,  and  N.  P.  Bigelow,  Phys.  Rev. 
Lett.  85,  1594  (2000). 

[6]  B.  Julsgaard,  A.  Kozhekin,  and  E.  S.  Polzik,  Nature  413, 
400  (2001). 

[7]  J.  Geremia,  J.  K.  Stockton,  and  H.  Mabuchi,  Science 
304,  270  (2004). 

[8]  A.  S0rensen,  L.-M.  Duan,  J.  Cirac,  and  P.  Zoller,  Nature 
63,  409  (2001). 

[9]  A.  S0rensen  and  K.  M0lmer,  Phys.  Rev.  Lett.  86,  4431 

(2001). 

[10]  D.  J.  Wineland,  J.  J.  Bollinger,  W.  M.  Itano,  and  D.  J. 
Heinzen,  Phys.  Rev.  A  50,  67ii88  (1994). 

[11]  J.  Geremia,  J.  K.  Stockton,  A.  C.  Doherty,  and 
H.  Mabuchi,  Phys.  Rev.  Lett.  91,  250801  (2003). 

[12]  J.  Geremia,  J.  K.  Stockton,  and  H.  Mabuchi,  Phys.  Rev. 
Lett.  94,  203002  (2005). 

[13]  A.  Andre,  A.  S0rensen,  and  M.  Lukin,  Phys.  Rev.  Lett. 
92,  230801  (2004). 

[14]  V.  Peterson,  L.  B.  Madsen,  and  K.  M0lmer,  Phys.  Rev. 
A  71,  012312  (2005). 

[15]  J.  K.  Stockton,  J.  Geremia,  A.  C.  Doherty,  and 
H.  Mabuchi,  Phys.  Rev.  A  67,  022122  (2003). 

[16]  B.  Julsgaard,  J.  Sherson,  J.  Sorensen,  and  E.  Polzik,  J. 
Opt.  B:  Quantum  Semiclass.  Opt.  6,  5  (2004). 

[17]  W.  Happer,  Rev.  Mod.  Phys  44,  169  (1972). 

[18]  G.  A.  Smith,  S.  Chaudhury,  and  P.  S.  Jessen,  J.  Opt.  B: 
Quant.  Semiclass.  Opt.  5,  323  (2003). 

[19]  Y.  Takahashi,  K.  Honda,  N.  Tanaka,  K.  Toyoda, 

K.  Ishikawa,  and  T.  Yabuzaki,  Phys.  Rev.  A  60,  4974 
(1999). 

[20]  A.  Kuzmich,  N.  Bigelow,  and  L.  Mandel,  Europhys.  Lett. 
42,  481  (1998). 

[21]  L.  K.  Thomsen,  S.  Mancini,  and  H.  M.  Wiseman,  Phys. 
Rev.  A  65,  061801  (2002). 

[22]  L.  K.  Thomsen  and  H.  Wiseman,  Phys.  Rev.  A  65  (2002). 

[23]  L.  K.  Thomsen,  S.  Mancini,  and  H.  Wiseman,  J.  Phys. 
B:  At.  Mol.  Opt.  Phys.  35,  4937  (2002). 

[24]  A.  Silberfarb  and  I.  Deutsch,  Phys.  Rev.  A  68,  013817 
(2003). 

[25]  R.  van  Handel,  J.  K.  Stockton,  and  H.  Mabuchi,  J.  Opt. 
B:  Quantum  Semiclass.  Opt.  7,  S179  (2005). 

[26]  D.  Kupriyanov,  O.  Mishina,  I.  Sokolov,  B.  Julsgaard,  and 
E.  Polzik,  Phys.  Rev.  A  71,  032348  (2005). 

[27]  G.  A.  Smith,  S.  Chaudhury,  A.  Silberfarb,  I.  H.  Deutsch, 


and  P.  S.  Jessen,  Phys.  Rev.  Lett.  93,  163602  (2004). 

[28]  A.  Silberfarb,  P.  S.  Jessen,  and  I.  H.  Deutsch  (2005), 
quant-ph/0412032. 

[29]  W.  Happer  and  B.  Mathur,  Phys.  Rev.  Lett.  163,  12 
(1967). 

[30]  I.  H.  Deutsch  and  P.  Jessen,  Phys.  Rev.  A  57,  1972 
(1998). 

[31]  H.  Nha  and  H.  Carmichael,  Phys.  Rev.  A  71,  013805 
(2005). 

[32]  J.  Mueller,  P.  Petrov,  D.  Oblak,  C.  G.  Alzar,  S.  de  Ech¬ 
aniz,  and  E.  Polzik,  Phys.  Rev.  A  71,  033803  (2005). 

[33]  L.  M.  Duan,  J.  I.  Cirac,  and  P.  Zoller,  Phys.  Rev.  A  66, 
023818  (2002). 

[34]  I.  Bouchoule  and  K.  M0lmer,  Phys.  Rev.  A  66,  043811 

(2002). 

[35]  C.  M.  Caves,  Phys.  Rev.  D  35,  1815  (1987). 

[36]  C.  Cohen- Tannoudji,  J.  Dupont-Roc,  and  G.  Gryn- 
berg,  Atom  Photon  Interactions  (Wiley-Interscience, 
New  York,  1992). 

[37]  I.  Carusotto  and  E.  J.  Mueller,  J.  Phys.  B  37,  SI  15 
(2004). 

[38]  P.  L.  Gould  and  H.  J.  Metcalf,  Phys.  Rev.  Lett.  61,  169 
(1988). 

[39]  D.  Oblak,  J.  K.  Mikkelsen,  W.  Tittel,  A.  K.  Vershovski, 
J.  L.  Sorensen,  P.  G.  Petrov,  C.  L.  G.  Alzar,  and  E.  S. 
Polzik,  Phys.  Rev.  A  71,  043807  (2005). 

[40]  R.  Glauber,  Phys.  Rev.  Lett.  131,  2766  (1963). 

[41]  L.  Mandel  and  E.  Wolf,  Optical  Coherence  and  Quantum 
Optics  (Cambridge  University  Press,  1995). 

[42]  J.  K.  Stockton,  J.  Geremia,  A.  C.  Doherty,  and 
H.  Mabuchi,  Phys.  Rev.  A  69,  032109  (2004). 

[43]  G.  Baym,  Lectures  on  Quantum  Mechanics  (W.  A.  Ben¬ 
jamin,  Inc.,  1969). 

[44]  J.  J.  Sakurai,  Modern  Quantum  Mechanics  (Addison- 
Wesley  Publishing  Company,  1994). 

[45]  D.  A.  Varshalovich,  A.  N.  Moskalev,  and  V.  K.  Kher¬ 
sonskii,  Quantum  Theory  of  Angular  Momentum  (World 
Scientific,  1988). 


APPENDIX  A:  THE  IRREDUCIBLE 
REPRESENTATION  OF  THE  POLARIZABILITY 
HAMILTONIAN 

In  this  appendix,  we  derive  the  irreducible  components 
of  the  polarizability  Hamiltonian,  Eqs.  (11,  12,  13),  from 
the  less  useful  form  of  Eq.  (6).  We  begin  by  clarify¬ 
ing  notation  used  for  the  spherical  basis  and  the  spin 
states  of  the  Alkali  atoms.  Then  we  discuss  properties  of 
the  polarizability  Hamiltonian  and  the  dipole  operator 
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before  detailing  the  decomposition  and  re-formatting  of 
the  Hamiltonian  into  its  irreducible  form. 


1.  Spherical  basis 


3.  Hamiltonian  approximation 

We  begin  with  the  single-particle  dipole  Hamiltonian 
H  =  — d  •  E.  The  dipole  operator  d  =  ere  can  be  split 
into  its  raising  and  lowering  components 


The  spherical  basis  is  the  preferred  basis  when  dealing 
with  atomic  transitions  due  to  its  symmetry  properties. 
The  basis  is  defined  by  the  transformation  from  Cartesian 
coordinates 


e+  =  ~(ex  +  iey)/V2  (Al) 

e_  =  (ex-iey)/V2 

®o  = 

Thus  elements  in  the  spherical  basis  have  the  properties 

e*=e_9(-l)«  (A2) 

gg  '  e*,  =  Sq^ 

and  for  an  arbitrary  vector  A  we  have  Aq  =  eq  •  A  so 
that  A  =  J2q  Age*  =  £g(-l)9A9e_9. 


2.  Alkali  spin  states 


d  =  d(_)  +d(+) 

d(-)  =  ^P/dJV 

fj' 

d(+)  =  X  Pf'dPf 

fj' 


(A8) 


and  the  electric  field  operator  can  be  split  into  rotating 
and  counter-rotating  terms 


E 


E(-)  +e(+) 


E(-)  =  sjhg  L,5  e "  -  y  o': 


E(+)  =  sjhg 


a_e_  + a+e+ 


(A9) 


After  using  the  rotating  wave  approximation  and  one  of 
many  available  perturbation  expansion  techniques  (e.g., 
adiabatic  elimination)  we  arrive  at  the  familiar  polariz¬ 
ability  Hamiltonian  [17,  30,  36], 


We  represent  the  internal  state  of  the  atom  in  terms  of 
the  (Zeeman  degenerate)  atomic  hyperfine  states,  |/,  m). 
Here  /  and  f  are  the  total  spin  quantum  numbers  for  the 
ground  and  excited  hyperfine  levels  respectively  while  m 
and  m!  are  their  projections  on  the  z- axis.  That  is  to 
say,  |/,  m)  are  eigenstates  of  the  total  atomic  angular 
momentum, 

f  =  s  0  i i®i  +  is  0  i  0  +  iS0/i  (A3) 

where  s,  1,  and  i  are  respectively  the  electron  spin,  orbital 
angular  momentum,  and  the  nuclear  spin.  The  quantum 
numbers,  /,  and  m,  are  defined  in  the  usual  manner, 

f2|/,m)  =  h2f(f  +  1)|/,  m)  (A4) 

Mf,m)  =  hm\f,m) 

We  use  the  notation  that  /±  are  in  the  spherical  basis 

f±  =  T(fx±ify)/V 2.  (A5) 

It  will  also  be  useful  to  define  a  projector  onto  the  ground 
state  / 

Pf  (A6) 

m 

and  a  projector  onto  the  excited  state  f 

Pf  =  (A7) 

m' 


H  =  V  E(_)  •  ,a,f  /'  •  Et+)  (A10) 

where  the  atomic  polarizability  between  a  particular 
ground  state  (/)  and  excited  state  (/')  is  defined  as 

=  PfdPfd^Pf  (All) 

=  EEE  |/,m")(/,TO"|d|/,,m')  (A12) 

m  m'  m" 

x(/',m,|dt|/,  m)(f,m\. 

This  expanded  expression  involves  dipole  operator  ma¬ 
trix  elements  of  the  form,  (/',  m'\dq\f,  mn)  where  |/,  m)  is 
a  Zeeman  sub-level  in  the  ground-state  hyperfine  man¬ 
ifold,  |  is  a  virtual  state  in  the  excited  hyperfine 

manifold,  and  q  =  0,  ±1  labels  the  helicity  of  the  electro¬ 
magnetic  field. 

The  above  notation  is  complete,  but  for  the  rest  of  this 
appendix  we  work  with  only  one  particular  /,  f  combi¬ 
nation  and  remove  the  subscripts  with  the  simplifying 
notation  change 

Pfd^Pf,  ->•  d  (A13) 

Pfd^Pf  -*•  df 

However,  when  the  complete  Hamiltonian  is  considered, 
the  summation  over  all  possible  /,  f  combinations  is  re¬ 
established. 
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4.  Matrix  element  decomposition 

In  order  to  work  with  the  above  expressions,  it  is  ad¬ 
vantageous  to  simplify  the  dipole  matrix  elements  as 
far  as  possible.  By  employing  the  Wigner-Eckart  the¬ 
orem,  the  angular  dependence  of  the  matrix  element, 
(/',  ra'|d|/,  m)  can  be  factored  into  the  product  of  a 
Clebsch-Gordan  coefficient  and  a  reduced  matrix  ele¬ 
ment, 

(f,m\dq\f',m')  =  (f,m\l,q;f,m-  q)(f\\d\\f).  (A14) 


^  e*e*,dqdq, 

q,q' 

(A21) 

^E^)(1.«;1,«'l  j,m) 

(A22) 

j,m  q,q' 

a(o)®a(i)®a(2) 

(A23) 

where 

<*0)  =  ^2  (A24) 

m=-j  q,q' 


Since  the  dipole  operator  acts  only  on  electronic  degrees 
of  freedom,  it  is  further  possible  to  factor  out  the  nuclear 
spin  degrees  of  freedom  via  the  explicit  coupling, 

</l|d||/'}  =  (-l)/'+^+V(2/'  +  l)(2j  +  l) 


Filling  in  these  Clebsch-Gordan  coefficients  explicitly, 
we  get 


a 


(0)  _  rp(0) 

—  1  n 


1 


1 


{  ]  j,  ’j  }  0  de  ./') 

(A15) 

I 

d(1)  =  fn(1) 

L  VO 

VO 

f  — 

- el 

[\/2  +  - 

72  -  +J 

1 

7! 


e_e , 


where  i  is  the  nuclear  spin  quantum  number,  j  and  j' 
are  the  ground  and  excited  state  fine  structure  quantum 
numbers,  and  de  is  the  dipole  operator  with  respect  to 
the  electronic  degrees  of  freedom. 


5.  Tensor  decomposition 


From  Eq.  (17-89)  of  reference  [43]  we  see  that  we  can 
form  an  irreducible  tensor,  Z™\  from  a  linear  combina¬ 


tion  of  tensor  operators  Uq ^  and  Vq\K  J  via  the  definition 
z£>  =  E  («,  q ; q'\j,  m)  (A16) 

q,q' 


+T_ 


(i) 


1 


+fW 


+  1  |_  y^2  °  + 

1  %e*_  - 


V2 


L  v/2 


V2 


a 


(2)  _ 


0  >V6 


-OM) 


1 


76 


1 


+T{_ 2l 


W2 
\  1 

L72e° 

+t+2  [d*+g;] 

+fi22)  [SI  SI  ] 


0^4- 


1  . 
W 

1 


(A25) 


where  (k,  g;  /d,  q'\j,  m)  are  Clebsch-Gordan  coefficients. 
This  expression  can  then  be  inverted  using 

U^vy}  =  E  z£ <«>  9;  «'» m).  (A17) 

j,m 

We  now  specialize  to  the  case  where  Z$  =  T$ ,  U  = 
d,  and  V  =  dt.  Because  we  are  creating  a  dyad  (with 
two  vectors),  we  have  k  =  k'  =  1.  Inserting  these  above 
gives  the  definition 

t £  =  E  (1,  q;  1,  q'\j,  m)  (A18) 

q,q' 

and  the  inverse 

dqd\,  =  E  (1, 9!  r  q'\j,  m).  (A19) 

j,mn 

We  can  use  this  latter  expression  to  write  the  polariz¬ 
ability  as 


ol  =  dd' 


(A20) 


Furthermore,  using  the  definition  of  T$  and  filling  in 
the  Clebsch-Gordan  coefficients  explicitly,  we  get 


7^(0) 

7^(1) 

7^(1) 

1+1 

fW 

10 

rfi(2) 

1+1 

f(2) 

7^(2) 

1+2 

7 n(2) 
1  —2 


1 

”7l 

(d{]d\)  —  d+d[_  —  d_ 

-4) 

V 

d+df_  - 

7l( 

+  d+dj) 

7i( 

d0df_  -  d_dj) 

7i  ( 

d+dt_  H-  2dgdg  -f-  d_ 

4) 

dQd^_  +  d+dj) 

d0dL  +  d_dj) 

d_d[_ 

(A26) 
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Note  that  several  standard  references  (including  refer¬ 
ences  [43,  44])  contain  an  error  in  the  prefactor  of  the 
j  =  0  term  and  in  the  sign  of  the  j  =  1  terms.  However, 
the  fundamental  definitions  of  Tm  ^  and  its  inverse  above 
are  valid. 

Using  recursion  relations  for  the  Clebsch-Gordan  co¬ 
efficients  we  can  recast  the  tensor  operators  in  terms  of 
more  intuitive  /  operators  [26,  45] 

T0(0)  =  -aj.°j.,l//V/3  (A27) 

f0(1)  =  +<*?},  fjV2 
Ui  =  +<*?},  f+/V2 
fiV  =  +af},f_/V 2 

T0(2)  =  -af},  (3/z2  —  /(/  -I- 1)1/) /V6 

Ui  =  -«/2/'v/2/+(/Z+1//2) 
ri2i  =  -a?},V2f_  (/z  -  1//2) 

tf2  =  -«/2U+ 

rf(2)  (2)  £2 

r-2  = 

Here  we  have  defined 


a(0)  - 

af,f '  “ 


a(1) 

a/,/' 


(2) 


af  ((2/  -  1  )6ff'_1  +  (2/  +  1)6 j' 

+(2/  +  3)^)  (A28) 

/'  7  2/  —  1  f 

af  f  °f- 1 

■2/+h7 


2/  +  1 

/(/  +  1)U/ 


/  + 1  /+1 


Q,/'  f  _  2^+l  Sf 

^  ^  /(/  +  1)  Z 


7TT*k 


(A29) 


(A30) 


These  definitions  have  been  chosen  to  make  the  each  of 
the  quantities 


X 

f' 


(i) 


“oA /,// 


>  0 


(A31) 


for  Ajj/  0  for  each  term  j.  We  have  defined  the 
polarizability  constants 


(2  f  +  l)2 

(2  j  +  1) 


(A32) 


and 


which  involves  the  atomic  the  spontaneous  emission  rate, 
T,  and  transition  wavelength,  Ao- 

Now,  to  complete  the  derivation,  insert  Eqs.  (A27)  into 
the  polarizability  components  of  Eqs.  (A25),  then  insert 
this  and  the  definition  of  the  electric  field,  Eq.  (A9), 
into  the  Hamiltonian,  Eqs.  (9-10).  Expanding,  using  the 
properties  of  the  spherical  dot  product,  and  the  Stokes 
component  definitions  (Eqs.  5),  and  summing  over  the 
/',  we  get  the  final  expressions  used  in  the  text  (Eqs.  11, 
12,  13). 


APPENDIX  B:  ARBITRARY  VECTOR 
OPERATOR  ROTATIONS 


Here  we  are  interested  in  evaluating  the  general  opera¬ 
tion  of  rotating  a  vector  about  an  arbitrary  direction  by 
an  arbitrary  amount  in  order  to  determine  the  semiclas- 
sical  evolution  of  the  probe  light  as  used  in  Eqs.  (26). 

Consider  the  rotation  of  the  vector  spin  operator 


S  = 


5  1 


(Bl) 


in  Cartesian  coordinates  about  an  arbitrary  direction 
n  =  [lx,ly,lz\h  by  the  angle  7  =  +  7y  +ll-  This 

rotation  can  be  represented  in  the  Heisenberg  picture  as 

s'  =  USiUf  (B2) 

where 

U  =  exp [-ijS-n]  =  exp[-i(-yxSx  +  ^ZSZ  +  -yzSz)]  (B3) 

The  S[  can  be  derived  explicitly  using  the  following  equa- 
tion  for  the  arbitrary  rotation  of  any  vector 

S[  =  (S  •  i)  cosy  +  (n  •  i)(n  •  S)(l  —  cosy) 

+  ^(n  x  i)  -  S'j  sin 7  (B4) 

Expanding  and  rearranging  terms  we  get 
Sx  =  Sx  (E(1-C0S7)  +  C0S7^ 

+7  “  cos  7)  +  sin  7^ 

+SZ  (y?(l  -  cos7)  -  ^  sin7)  (B5) 

K  =  ^  7sin^) 

+Sy  (^|(1  -  cos  7)  +  cos  7) 


<ao 


3e0^rAo 
87 r2 

|01|d||/) 


(2 j  +  1) 
(2  f  +  1) 


(A33) 


Si 


+SZ  -  COS7)  +  Y  sin 7)  (B6) 

sx  ~ cos  7)  +  ^sin7^ 
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+Sy  ^-7^(1  -  cos 7)  -  ^  sin  7^ 
+<5z  ^“|(1  “  cos  7)  +  cos  7) 


(B7) 


These  equations  can  be  specialized  to  Eqs.  (26)  which 
describes  the  experimental  situation  considered  in  this 
work. 
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1.  Introduction 

In  recent  years,  advances  in  technology  have  enabled  a  proliferation  of  experiments 
where  objects  can  be  probed  and  manipulated  near  the  fundamental  quantum  limits 
of  performance.  The  manipulation  and  readout  of  single  qubits  with  unprecedented 
coherence  times  both  in  condensed  matter  and  in  atomic  setups,  the  reliable  trapping, 
cooling  and  shot-noise  limited  continuous  observation  of  single  atoms  in  high-finesse 
optical  cavities,  and  the  production  of  various  nonclassical  states  of  light  and  of  atomic 
ensembles  is  only  a  subset  of  recent  achievements.  The  large  degree  of  control  that  can 
be  exerted  at  the  quantum  level  suggests  that  classical  engineering  methodology  can 
be  fruitfully  adapted  to  this  new  setting.  In  particular,  it  seems  that  the  concept  of 
feedback  control  should  be  of  central  importance  in  the  engineering  of  reliable  quantum 
technologies,  as  in  the  classical  case. 

This  article  is  intended  as  an  introduction  to  the  theoretical  description  of 
quantum  feedback  control  systems.  We  concentrate  on  a  scenario  that  is  common 
in  quantum  optical  experiments,  where  the  system  to  be  controlled  is  brought  in  weak 
interaction  with  an  external  probe  field  which  is  subsequently  detected.  The  detected 
signal  can  then  be  processed  and  fed  back  to  the  system  through  some  actuator.  There 
are  various  theoretical  challenges  in  describing  such  a  system: 

•  How  does  one  model  the  system-probe  interaction? 

•  How  does  one  model  a  continuous  measurement  of  the  probe? 

•  How  does  one  infer  information  on  the  system  from  the  probe  measurements? 

•  How  does  one  design  a  feedback  law  that  utilizes  this  information  to  achieve  a 
particular  control  goal? 
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In  the  following  we  will  address  each  of  these  questions  in  turn.  Needless  to  say,  it 
would  be  impossible  to  cover  every  aspect  and  intricacy  of  each  of  these  questions 
within  the  scope  of  this  article;  rather,  we  aim  to  give  a  sufficiently  detailed  discussion 
to  keep  the  article  (mostly)  self-contained,  and  refer  to  the  bibliography  for  complete 
treatments. 

As  an  example  throughout  the  article,  we  discuss  the  preparation  of  entangled 
states  of  an  atomic  ensemble  using  feedback  control.  The  model  consists  of  an  ensemble 
of  atomic  spins  interacting  dispersively  with  an  optical  probe,  which  is  subjected  to 
homodyne  detection.  Several  recent  experiments  have  exploited  a  similar  setup  to 
produce  spin-squeezed  states  (SSS)  [28,30,47]  which  have  applications  in  a  variety  of 
metrology  tasks,  including  magnetometers  [29,65]  and  atomic  clocks  [50,59,76]. 

We  consider  this  model  because  it  is  illustrative  in  several  respects.  First,  the 
model  spans  two  quite  different  and  interesting  regimes.  At  short  times  the  dynamics 
are  approximately  linear  [65]  and  the  model  describes  the  production  of  spin  squeezing. 
However,  at  long  times  the  linear  description  is  no  longer  valid,  and  we  will  show  that 
then  an  eigenstate  of  the  collective  angular  momentum  of  the  ensemble  (a  Dicke  state) 
is  obtained.  Although  the  long  time  limit  described  by  this  model  is  difficult  to  realize 
experimentally  at  this  time,  the  consideration  of  the  substantial  differences  between 
the  regimes  clearly  demonstrates  the  challenges  of  quantum  control.  Second,  the  model 
is  a  convenient  example  to  demonstrate  the  modeling  of  a  quantum  control  system 
from  first  principles.  Ultimately,  by  approaching  the  entire  problem — from  physical 
modeling  to  inference  to  control — in  a  systematic  manner,  we  hope  to  provide  a  unified 
outline  for  future  modeling  efforts. 

The  article  is  roughly  divided  into  two  parts.  The  first  part,  consisting  of  sections 
2,  3  and  4,  is  somewhat  technical  in  nature.  Its  goal  is  to  obtain  from  first  principles, 
using  a  simplified  field-theoretical  model  of  the  interaction  of  an  atomic  ensemble  with 
a  probe  field,  the  quantum  filtering  equation  (52).  To  this  end,  we  begin  by  reviewing 
in  section  2  the  statistical  inference  of  quantum  states.  In  section  3  we  introduce  a 
field-theoretical  model  of  an  atomic  ensemble  coupled  to  an  electromagnetic  probe 
field,  and  we  discuss  how  it  can  be  reduced  to  a  stochastic  equation.  In  section  4  we 
detail  how  to  properly  condition  the  ensemble  state  upon  the  results  from  continuous 
optical  measurements  in  the  field. 

The  second  part,  section  5,  presents  general  principles  of  feedback  control  and 
demonstrates  how  they  can  be  applied  to  enable  quantum  state  preparation.  This 
procedure  is  discussed  in  both  the  short  time  limit,  where  a  linear  approximation  is 
valid,  and  in  the  long  time  limit,  where  a  more  complete  description  is  required  [66]. 
Section  5  is  fairly  independent  from  the  first  part  of  the  article,  and  a  reader  who 
has  some  familiarity  with  the  filtering  equation,  Eq.  (52),  could  skip  directly  ahead  to 
this  section.  We  have  attempted,  however,  to  give  in  sections  2-5  a  unified  picture  of 
quantum  feedback  control  design,  from  the  elementary  physical  interactions  through 
feedback-enabled  state  preparation. 

As  we  proceed,  we  attempt  to  review  the  literature  concerning  measurement  and 
feedback  control  of  atomic  ensembles,  while  also  putting  into  context  related,  but 
more  mathematical,  works  concerning  estimation  and  control.  In  the  end,  we  hope 
to  inspire  further  development  in  this  field  by  highlighting  the  numerous  connections 
between  the  problems  of  quantum  control  and  problems  considered  in  the  culturally 
distinct  context  of  the  mathematics  and  control  communities  [4,18,71]. 
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2.  What  is  a  quantum  state? 

Quantum  mechanics  describes  the  statistics  of  observable  quantities,  very  much  like 
classical  probability  theory.  In  fact,  the  foundation  of  quantum  mechanics  is  just  an 
extension  of  probability  theory,  as  we  will  discuss  in  this  section.  Such  a  point  of  view 
allows  us  to  apply  classical  constructions  of  probability  theory  directly  to  quantum 
models.  Though  this  section  contains  no  surprises,  we  aim  to  clarify  the  concepts  and 
terminology  used  in  the  remainder  of  the  article.  We  will  pay  particular  attention  to 
what  is  meant  by  a  “quantum  state”,  an  issue  that  must  be  resolved  before  we  can 
discuss  state  preparation. 

2.1.  Classical  probability 

To  set  the  stage  for  quantum  probability  we  first  discuss  some  of  the  elements  of 
classical  probability  theory  [75].  As  an  illustration,  consider  throwing  two  dice.  The 
first  ingredient  we  need  in  our  theory  is  the  sample  space ,  usually  denoted  by  fh  This 
is  just  a  set  which  describes  all  the  “microstates”  of  the  system;  in  our  case,  it  is  the 
set  of  62  =  36  possible  outcomes  of  a  throw  11,  12,  . . .,  21,  22,  . . .,  65,  66.  A  random 
variable  f  is  now  a  map  f  :  Cl  —>  PL  For  example,  we  could  define  a  random  variable 
X  that  describes  the  sum  of  the  two  outcomes,  i.e.  X(ll)  =  2,  X(53)  =  8,  etc. 

To  complete  the  picture  we  need  to  introduce  an  object  that  can  provide  answers 
to  questions  such  as  what  is  the  probability  of  having  thrown  66 ?,  or  what  is  the 
probability  of  having  thrown  at  least  one  three?  This  is  exactly  provided  by  the  notion 
of  a  probability  measure.  Note  that  we  can  represent  any  question  as  a  subset  of  £2; 
e.g.,  our  first  question  is  represented  by  the  set  {66},  while  the  second  is  represented 
by  {31,  32, ... ,  36, 13,  23, ... ,  63}.  These  sets  (and  the  questions  they  represent)  are 
called  events.  The  probability  measure  P  is  a  map  that  associates  to  every  event  a 
probability. 

We  can  compose  new  events  as  follows.  Given  two  events  A,  B  C  £2,  the  question 
A  or  B?  is  represented  by  AUB,  whereas  A  and  B?  corresponds  to  AnB.  In  particular, 
the  latter  operation  defines  the  joint  probability  F(AnB)  of  A  and  B.  The  probability 
measure  needs  to  be  consistently  defined  with  respect  to  these  operations  in  the  sense 
that  P (A  U  B)  =  P(A)  +  P (B)  if  A  D  B  =  0,  i.e.  if  A  and  B  are  mutually  exclusive. 
Furthermore  P(0)  =  0  and  P($2)  =  1.  In  our  example  there  is  an  equal  probability 
of  having  thrown  any  combination;  hence  P(A)  =  for  any  event  A  with  a  single 
element.  Any  other  event  can  be  constructed  as  a  union  of  these  “elementary”  events 
and  its  probability  can  be  found  using  the  formula  for  P (A  U  B). 

Now  suppose  we  wish  to  perform  a  particular  observation  on  the  system;  we  have 
already  defined  such  observations  (random  variables)  as  maps  on  Q.  To  obtain  the 
probability  of  a  particular  observation,  we  simply  invert  the  corresponding  map.  For 
example,  the  probability  that  we  throw  a  combination  that  sums  to  4  is  ¥(X  =  4)  = 
P(X-1(4))  =  P({13,  22,  31})  =  Hence  the  probability  measure  contains  all  the 
information  available  on  the  outcome  of  any  observation,  i.e.,  P  represents  the  state  of 
the  system.  The  philosophy  behind  this  choice  of  terminology  is  that  physical  theories 
exist  to  model  the  outcomes  of  observations;  the  “state”  is  the  object  of  the  theory 
that  gives  rise  to  the  statistics  of  any  such  observation. 

Let  us  now  consider  classical  state  preparation.  The  physical  mechanism  that 
prepares  the  state  of  the  dice,  i.e.  that  causes  every  combination  to  have  equal 
probability,  is  the  throwing  process.  Suppose  we  want  to  prepare  a  different  state, 
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for  example  a  state  that  has  a  high  probability  of  obtaining  two  sixes.  We  could 
obtain  such  a  state  by  modifying  the  physical  process  that  creates  it.  For  example,  we 
could  engineer  dice  with  an  nonuniform  mass  distribution,  so  the  sixth  face  is  lighter 
than  the  other  faces;  then  the  rolling  of  the  dice  is  more  likely  to  terminate  with  the 
sixth  face  facing  up. 

There  is  a  different  way  in  which  we  can  change  the  state.  The  conditional 
probability  of  event  A  given  that  we  have  measured  event  B  is 

nw)  =  BAM!  a) 

Suppose  we  observe  X  =  12.  Then  the  conditional  probability  of  having  thrown  66 
is  P({66}|X  =  12)  =  1,  whereas  without  conditioning  P({66})  =  However,  if  we 
happened  to  measure  X  ^  12  then  P({66}|X  ^  12)  =  0.  This  corresponds  to  the 
intuitive  notion  that  if  we  see  that  we  have  thrown  66,  then  the  probability  that  we 
have  thrown  66  is  one,  no  matter  what  its  probability  was  before  we  had  gained  that 
information.  However  the  probability  that  we  would  see  66  in  the  first  place  is  only 
Hence  we  can  create  states  by  conditioning  a  “prior”  state  on  a  measurement,  but 
only  very  inefficiently:  to  prepare  a  state  with  high  probability  of  obtaining  66,  we 
have  to  keep  throwing  the  dice  until  we  happen  to  observe  X  =  12. 

There  is  a  final  possibility  which  combines  the  two  methods  of  state  preparation. 
Suppose  that  we  perform  an  observation  not  after  the  throw  has  completed,  but  while 
it  is  still  in  progress.  Moreover,  we  allow  ourselves  to  interfere  with  the  dice:  if  the 
rolling  dice  threaten  to  terminate  with  a  low  value  of  X,  we  give  them  a  shove  so  they 
keep  rolling.  This  way  the  probability  of  throwing  high  numbers  is  elevated.  In  other 
words,  we  prepare  the  state  of  our  choice  by  performing  observation  and  applying 
feedback  to  the  system  dynamics.  This  crude  example  represents  the  type  of  state 
preparation  that  we  consider  in  this  article  for  quantum  systems. 

We  conclude  this  section  by  introducing  expectations  and  conditional 
expectations.  If  H  is  a  finite  countable  set  (which  we  have  implicitly  assumed  in 
this  section)  then  we  may  always  decompose  a  random  variable  /  :  H  — >  M  as  follows. 
The  map  /  takes  the  values  /j  El  on  disjoint  subsets  Si  =  /_1(/i)  C  H  such  that 
1J^  =  H.  Hence  we  can  write 

/m  =  (2) 

i 

where  xSi  is  the  indicator  function  of  Si,  i.e.  xSi(w)  =  1  if  cu  G  Si,  0  otherwise.  The 
expectation  of  /  is  given  by 

Ef  =  j2mSi)  (3) 

i 

and  represents  the  value  that  /  takes  on  “on  average” .  Note  that  the  state  P  uniquely 
determines  E,  but  the  converse  is  also  true  as  by  construction  Ex  s  =  P(S)  for  any 
event  S  C  Cl.  Hence  we  can  equivalently  define  the  state  of  the  system  by  specifying 
the  expectation  of  every  system  observable. 

Similarly,  we  can  define  the  conditional  expectation  of  /  =  JT  fiXSi  given  that 
we  have  measured  g  =  JT  QiXTZ 

E(/I$)H  TEWj \Ti)xrM 

i  3 


(4) 
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Now  E(xs\g){u)  =  P(Sjg  =  g(u)).  Hence  we  can  equivalently  define  the  state  of  the 
system,  conditioned  on  a  measurement  of  g,  by  specifying  the  conditional  expectation 
of  every  system  observable  with  respect  to  g. 

Though  entirely  natural  from  a  probabilist’s  point  of  view,  it  is  not  customary 
in  physics  to  think  of  the  conditional  expectation  as  a  random  variable.  One  way  to 
interpret  Eq.  (4)  is  that  the  random  variable  E(/|g)  is  the  best  estimate  of  /  given 
g  [60,75].  To  see  this,  first  note  that  E(/|g)  is  by  construction  a  function  of  g: 
E(f\g)(uj)  =  X(g(uj))  where  we  define  X  :  gi  i— ►  fj^(Sj\Ti).  It  is  not  difficult 
to  show  that  of  all  functions  X' ,  the  one  that  minimizes  the  least-squares  criterion 
E[(/  —  X'(g))2]  is  exactly  X '  =  A.  This  is  precisely  what  we  mean  by  K(f\g)  being 
the  best  estimate  of  /  given  g.  Evidently  this  idea  is  equivalent,  or  in  some  sense  dual, 
to  the  notion  of  a  conditional  state  that  we  introduced  earlier. 

2.2.  Quantum  probability 

We  will  now  formulate  quantum  mechanics  in  the  same  language  as  the  classical 
case  [53,68].  An  observable  (random  variable)  in  quantum  theory  is  given  by  a 
self-adjoint  operator  F  on  some  complex  Hilbert  space  TL.  Assuming  Ti  is  finite¬ 
dimensional,  we  always  have  the  spectral  decomposition 

F  =  (5) 

i 

where  /j  El  are  the  eigenvalues  of  F  and  Pi  =  P f  =  P-  are  projection  operators 
onto  the  corresponding  eigenspaces.  The  picture  is  completed  by  introducing  a  map 
E  :  •  i— >  Tr[-p]  with  some  p  =  pt  >  0,  Trp  =  1.  Then  E F  is  the  expectation  of  the 
observable  F.  In  terms  of  the  spectral  decomposition 

EF  =  fi  EPi  (6) 

i 

Clearly  the  projectors  Pi  play  the  role  of  events  xst  in  the  classical  theory.  Indeed, 
a  measurement  of  F  yields  the  outcome  fi  with  probability  E Pi.  Thus  any  quantum 
observable  is  identical  to  a  classical  random  variable. 

We  can  make  the  correspondence  explicit  in  the  following  way.  As  we  are  free 
to  choose  any  basis  in  the  Hilbert  space,  we  may  always  choose  a  basis  in  which 
F  is  diagonal.  We  can  then  interpret  the  diagonal  elements  of  F  as  the  values  of 
the  random  variable  /,  where  Cl  is  just  the  set  of  diagonal  entries:  /  :  i  i— >  Fa. 
The  Pi  now  correspond  exactly  to  indicator  functions  on  Q  and  P (S)  =  ^2iXs(i)pu- 
Note  that  the  underlying  Hilbert  space  plays  a  passive  role  in  the  theory,  just  like 
the  sample  space  Q  in  classical  probability — the  central  element  of  the  theory  is  the 
set  of  observables  we  are  interested  in.  As  long  as  we  are  interested  in  a  set  of 
observables  that  all  commute  with  each  other,  then  quantum  and  classical  probability 
are  identical  theories:  commuting  observables  can  be  simultaneously  diagonalized,  so 
we  can  follow  the  above  “recipe”  to  transform  between  the  classical  and  quantum 
descriptions.  In  other  words,  classical  probability  theory  is  a  special  case  of  quantum 
probability  theory. 

The  embedding  of  classical  in  quantum  probability  allows  us  to  carry  over  directly 
concepts  from  classical  probability  to  sets  of  commuting  quantum  observables.  For 
example,  in  the  classical  case  we  defined  the  joint  probability  of  two  events  A  and 
B  as  ¥(A  fl  B)  =  E (xaXb)-  This  carries  over  directly  to  the  quantum  case  for  two 
quantum  events  P,  Q  as  long  as  they  commute:  i.e.  the  joint  probability  of  P  and  Q 
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is  E (PQ)  =  Tr [PQp\.  Similarly,  we  obtain  an  expression  for  conditional  expectation 
for  two  commuting  quantum  observables  F  =  Yi  fi^i  and  ^  =  diQii 

E  ™  =  <7> 

1  3 

which  is  itself  an  observable  as  in  the  classical  case,  interpreted  as  the  best  estimate 
(in  the  least  mean  square  sense)  of  F  given  G.  Note  that  these  are  not  even 
“quantum  analogs”  of  classical  concepts^these  are  entirely  classical  operations.  We 
can  obtain  these  expressions  by  writing  the  commuting  set  of  events  in  the  diagonal 
basis,  transforming  to  the  classical  picture,  applying  the  classical  operation,  and 
transforming  back  to  the  quantum  picture  in  the  original  basis. 

What  makes  quantum  probability  different  from  classical  probability  is  the 
existence  of  noncommuting  observables.  For  events  or  observables  that  do  not 
commute  the  classical  probabilistic  concepts  do  not  make  any  sense:  for  example, 
the  joint  probability  of  P,  Q  with  [P,  Q]  ^  0  cannot  be  unambiguously  defined  as 
E(PQ)  7^  E(QP).  Similarly  E(P|G)  cannot  be  defined  for  [P,  G)  ^  0.  Hence  we  do  not 
allow  simultaneous  measurement  or  statistical  inference  of  noncommuting  observables. 
The  fact  that  noncommuting  observables  are  inherent  to  quantum  models  restricts  the 
amount  of  information  that  can  be  obtained  from  the  system  by  measurement. 

Once  we  have  fixed  a  commuting  family  of  observables  to  measure,  however,  the 
measurement  process  is  reduced  to  straightforward  application  of  classical  probability 
theory.  In  particular,  even  if  we  are  interested  in  modeling  a  pair  of  observables  P 
and  G  that  do  not  commute,  we  can  still  perform  statistical  inference  as  long  as  both 
observables  commute  with  the  observation  H.  After  all,  by  Eq.  (7),  E(P|P)  and 
E(G|P)  commute  and  can  hence  be  measured  simultaneously,  despite  that  P  and  G 
do  not  commute.  We  will  repeatedly  exploit  this  fact  throughout  this  article. 

To  illustrate  these  ideas,  consider  the  example  of  a  single  spin-|  atom,  and 
suppose  we  are  interested  in  controlling  the  spin  observables  (Pauli  matrices) 
<Jx,cryicrz-  We  run  into  problems  if  we  try  to  directly  measure  az ,  as  this  does  not 
commute  with  ax  and  ay.  Because  the  best  estimate  of  ax  or  ay  with  respect  to  az  is 
undefined,  it  is  unclear  in  what  sense  one  could  control  ax  and  ay  if  we  keep  observing 

°z- 

We  have  already  hinted  at  the  solution  to  this  problem:  we  must  observe  a  fourth 
observable  X  that  commutes  with  ax,ay,crz.  Then  all  three  conditional  expectations 
are  well  defined.  A  famous  example  of  this  procedure  is  the  Stern-Gerlach  apparatus: 
in  this  case  the  atom  passes  through  a  strong  magnetic  gradient  which  correlates  the 
spin  observables  <Jx,y,z  with  the  spatial  position  X  of  the  atom.  By  measuring  X,  which 
commutes  with  <Jx,y,z,  we  can  form  best  estimates  of  the  latter  three  observables,  and 
thus  at  least  conceptually  these  can  be  controlled. 

In  practice  the  Stern-Gerlach  device  is  not  a  good  system  for  controlling  the  spin, 
as  the  observable  X  is  a  different  degree  of  freedom  of  the  same  atom  that  carries 
the  spin.  When  the  atom  hits  the  screen,  enacting  a  measurement  of  X,  the  atom  is 
effectively  destroyed  and  there  is  no  point  in  updating  the  spin  state  for  further  control. 
The  approach  we  take  in  this  article  is  a  realistic,  though  conceptually  identical,  version 
of  this  example.  Instead  of  coupling  the  atomic  spin  to  the  atomic  position,  the  spin 
interacts  with  an  external  electromagnetic  field.  Even  though  photodetection  of  the 
field  is  destructive  this  will  not  affect  the  atom  itself. 

The  quantum  state  is  an  object  that  associates  an  expectation  to  the  relevant  set 
of  observables.  We  refrain  from  defining  the  quantum  state  as  the  density  matrix  p. 
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The  properties  that  any  expectation  map  must  obey  imply  that  we  can  always  find 
a  density  matrix  p  such  that  the  expectation  can  be  expressed  as  E F  =  Tr  [pF]  for 
the  relevant  set  of  observables  F.  What  this  relevant  set  is,  however,  depends  on  the 
context. 

To  illustrate  this  subtle  distinction  let  us  consider  again  the  Stern- Gerlach 
example.  Before  conditioning  we  consider  the  four  observables  (Jx,y,z  and  X.  Hence 
we  naturally  express  the  state  as  a  density  matrix  on  Hs  ®Hq,  the  tensor  product 
of  the  atomic  spin  and  position  Hilbert  spaces.  However,  we  can  only  condition 
observables  on  X  that  commute  with  X.  Hence  after  conditioning  a  spin-position 
density  matrix  is  no  longer  meaningful,  as  many  observables  on  the  position  Hilbert 
space  (e.g.  momentum)  will  have  an  undefined  conditional  expectation^. 

To  find  the  natural  state  after  conditioning,  recall  that  E(ax^y^z\X)  all  commute. 
Hence  we  can  describe  them  as  classical  random  variables  sXjyjZ((j)  on  some  probability 
space  Cl.  To  express  the  state  as  a  density  matrix,  then,  we  must  also  make  it  random: 
we  define  p{uf)  on  Tis  through  sXjyjZ((j)  =  Tr[p(u))crXjyjZ\.  This  conforms  to  the  intuitive 
idea  that  after  measurement,  the  conditional  state  is  itself  a  classical  random  variable, 
where  Cl  is  simply  the  set  of  possible  outcomes  of  X.  It  also  highlights,  however, 
that  in  order  to  talk  sensibly  about  state  preparation  we  must  carefully  select  which 
observables  we  wish  to  specify.  Though  the  “dual”  description  in  terms  of  a  density 
matrix  is  often  more  economical,  we  will  often  find  it  both  conceptually  and  technically 
simpler  to  obtain  results  by  considering  conditional  expectations  to  be  observables  on 
Hs  ®  Hq. 

The  three  methods  of  state  preparation  discussed  in  the  previous  section  carry 
over  directly  to  the  quantum  case.  All  these  methods  have  been  discussed  to  various 
extent  in  the  literature;  references  to  their  various  experimental  implementations  will 
be  given  in  section  5.  The  first  method  corresponds  to  designing  a  Hamiltonian  whose 
time  evolution  generates  the  desired  state.  The  drawback  of  this  method  is  that  such 
a  Hamiltonian  may  be  highly  nonlinear  and  difficult  to  engineer  in  practice. 

The  second  method  corresponds  to  conditioning.  As  we  saw  in  the  example 
above,  to  do  this  we  must  “open”  the  system  by  introducing  another  observable. 
We  emphasize,  however,  that  there  is  no  physical  “collapse”  associated  to  the  actual 
measurement:  we  just  use  classical  conditioning  to  update  our  state  of  knowledge. 
The  drawback  of  this  method  is  that  the  outcome  of  the  measurement  is  random  and 
will  not  always  result  in  the  desired  state;  particularly  in  cases  where  the  state  is 
prepared  with  low  probability,  this  may  not  be  a  desirable  option. 

The  third  method,  which  is  the  main  topic  of  this  article,  is  that  of  conditioning 
with  feedback.  The  advantage  of  such  a  method  is  that  it  can  be  implemented 
with  simple  Hamiltonians,  while  it  does  not  suffer  from  the  indeterminism  of  pure 
conditioning.  The  method  can  also  be  more  robust  than  simple  Hamiltonian  evolution, 
as  it  is  not  as  sensitive  to  e.g.  timing  errors  or  precise  knowledge  of  experimental 
parameters  [65,70].  However,  succesful  implementation  of  such  a  method  requires 
sensitive,  continuous-time  quantum-limited  measurements  and  fast  in-line  signal 
processing,  techniques  that  have  only  recently  become  available. 

We  separate  the  development  of  quantum  feedback  control  into  three  parts.  In 
order  to  interpret  the  measurement  current  and  feedback  we  must  develop  a  physical 

f  Of  course  if  we  were  interested  in  both  position  and  momentum,  we  could  couple  to  yet  another 
observable  that  commutes  with  crx,y,z  as  well  as  position  and  momentum.  This  way  we  move  further 
and  further  down  the  “Heisenberg  chain”.  Ultimately,  however,  we  have  to  make  an  observation, 
which  will  rule  out  some  incompatible  observables. 
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model  of  the  system  and  its  interaction  with  the  environment.  This  first  step,  the 
physical  modeling  step,  embodies  the  “physical  content”  of  the  problem.  In  the  second 
step  we  condition  the  system  dynamics  based  on  an  observation  of  the  environment. 
This  statistical  inference  step  is,  as  we  have  discussed,  entirely  classical  in  nature.  The 
third  step  is  the  control  problem ,  finding  a  control  law  that  will  prepare  the  desired 
state.  In  the  following  sections  we  consider  each  of  these  problems  separately. 

Note  that  all  the  constructions  in  this  section  can  be  generalized  to  infinite¬ 
dimensional  Hilbert  spaces  in  the  quantum  case,  and  to  infinite  or  continuous  Q  in 
the  classical  case.  However,  a  rigorous  discussion  of  the  associated  mathematics  is 
beyond  the  scope  of  this  article.  Though  conceptually  the  finite  and  infinite  cases  are 
very  similar,  we  will  need  to  extend  the  finite  techniques  somewhat  in  section  4.2  in 
order  to  deal  with  continuous  systems.  For  lucid  introductions  to  the  general  theories 
of  classical  and  quantum  probability  we  refer  to  [75]  and  [53],  respectively. 

3.  The  physical  model:  from  QED  to  stochastic  equations 

In  this  section  we  will  describe  a  microscopic  model  for  the  class  of  systems  we 
consider.  The  model  consists  of  an  atomic  ensemble  coupled  weakly  to  an  external 
electromagnetic  field  which  is  ultimately  detected. 

3.1.  System  model  from  quantum  electrodynamics 

It  is  well  known  from  quantum  electrodynamics  [13,  54]  that  the  observable  for  the 
free  electric  field  is  given  by 

E(M)  =  y/(27r)3£o  ^  (*ak,sek,se4(k  r_UJt)  +h.c.)  d3k  (8) 

where  uj  =  c|k|,  Sk,s  are  polarization  vectors  and  a^s  are  plane  wave  (Fourier) 
mode  annihilation  operators  that  satisfy  the  commutation  relations  [a^s,  a\.,  s,]  = 
S3( k  —  k ')5ssr.  We  assume  that  the  atomic  ensemble  (centered  at  the  origin)  interacts 
with  the  field  predominantly  through  its  collective  dipole  moment;  i.e.,  the  interaction 
Hamiltonian  will  be  of  the  form  Hi  ft)  =  —  d  ft)  •  E(0,  t)  where  d (t)  is  the  ensemble 
dipole  operator.  In  practice  there  will  be  some  ultraviolet  cutoff,  which  we  can  obtain 
e.g.  by  averaging  the  electric  field  over  the  volume  of  the  cloud  of  atoms  instead  of 
evaluating  it  at  the  origin.  We  will  write 

E(+)(r ,t)  =  [E (-}(M)]f  =  ]T  f  5(k)ak,sek>ie<(k-p-‘rf)d3fc  (9) 

s  J 

where  E  =  i(E^+)  —  E^-))  and  g( k)  is  the  mode  function,  e.g.  g( k)  oc  if  we 

average  E  over  a  spatial  Gaussian  distribution. 

The  full  interaction  is  sketched  in  cartoon  form  in  Fig.  la.  The  atomic  ensemble 
interacts  through  its  dipole  moment  with  all  plane  wave  modes  in  three  dimensions.  A 
strong,  focused  laser  beam  at  frequency  cjo  is  modeled  by  bringing  the  corresponding 
modes  into  a  large-amplitude  coherent  state.  The  drive  is  scattered  predominantly 
in  the  forward  direction,  and  is  ultimately  detected.  The  remaining  modes  are  in 
the  vacuum  state  and  drive  spontaneous  emission  of  the  ensemble  in  all  directions. 
This  essentially  complete  description  of  the  interaction  embodies  all  the  physics  of 
the  problem,  and  thus  allows  one  to  predict  quantities  such  as  the  spontaneous 
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(a) 

strong  drive 


^4 

T\ 


3D  plane  wave  modes 


(b) 

strong  drive 
(semiclass.) 


vacuum  input 

scattered  field 
1 D  plane  wave  modes 


Figure  1.  (a)  Schematic  of  the  full  interaction  Hamiltonian,  where  all  plane  wave  modes  in  three 
dimensions  interact  with  an  atomic  ensemble.  One  of  the  incoming  modes  is  coherently  excited 
with  frequency  uq  and  large  amplitude;  the  coherent  excitation  scatters  mainly  in  the  forward 
direction.  The  remaining  incoming  modes  are  in  the  vacuum  state  and  drive  spontaneous  emission, 
(b)  Simplified  one-dimensional  model,  where  plane  waves  are  scattered  off  a  single-mode  cavity  in 
one  direction  only.  The  strong  driving  field  is  treated  semiclassically  and  coherently  excites  the 
cavity  mode,  which  has  frequency  uq.  Spontaneous  emission  can  be  added  to  the  model  in  a 
phenomenological  manner. 


emission  rate.  The  full  picture  is  also  very  complicated,  however,  as  it  requires  a 
detailed  analysis  of  the  atomic  structure,  a  partitioning  of  the  field  into  observed  and 
unobserved  modes,  etc.  Such  an  analysis  is  beyond  the  scope  of  this  article. 

Instead,  we  will  investigate  a  highly  simplified  model  (Fig.  lb)  that  is  widely 
used  in  quantum  optics  [27,78,83].  To  justify  such  a  model,  we  claim  that  most  of 
the  interesting  physics  occurs  in  the  direction  of  the  driving  laser,  as  most  of  the 
light  is  scattered  forward  and  observed  in  this  direction.  Hence  we  can  approximate 
the  system  by  a  one-dimensional  model  where  only  the  forward  modes  are  treated 
exactly  and  the  strong  drive  is  treated  semiclassically.  Spontaneous  emission  into  the 
eliminated  modes  is  unobserved,  and  hence  we  could  include  it  phenomenologically  by 
adding  decoherence.  Finally,  to  simplify  the  interaction  with  the  ensemble,  we  place 
the  ensemble  into  a  leaky  single  mode  cavity.  This  allows  us  to  treat  the  interaction 
between  the  ensemble  and  the  field  for  a  single  frequency  only,  that  of  the  cavity  mode, 
which  is  chosen  to  be  at  the  laser  driving  frequency  ujq.  The  cavity  dynamics  is  then 
adiabatically  eliminated  to  give  an  effective  interaction  between  the  ensemble  and  the 
external  field. 

Let  us  systematically  work  out  this  simplified  model.  We  begin  by  treating  the 
one-dimensional  external  field  that  is  ultimately  detected.  We  can  obtain  an  expression 
for  the  field  by  integrating  Eq.  (8)  over  a  transverse  area  [27],  or  alternatively  by 
directly  quantizing  the  wave  equation  in  one  dimension  [78].  We  obtain 

E[z- *>  =  vS;  l  S  + >“'>  ^  <10> 

for  the  electric  field  intensity  in  a  single  polarization  state  (we  will  assume  polarized 
light),  where  [a^a^,]  =  S(ca  —  a/).  The  annihilators  aw  correspond  to  plane  wave 


Modeling  and  feedback  control  design  for  quantum  state  preparation 


10 


modes  in  the  ^-direction,  k  =  (w/c) z,  where  positive  2  is  defined  to  be  on  the  left 
in  Fig.  1.  Thus  the  field  for  2  <  0  is  propagating  towards  the  cavity  mirror  in  time, 
whereas  z  >  0  propagates  away  from  the  mirror.  In  practice  the  cavity  mirror  will 
reverse  the  propagation  direction,  so  we  can  reinterpret  2  >  0  as  the  component  of  the 
field  propagating  toward  the  detector,  whereas  2  <  0  is  the  incident  part  of  the  field§. 

We  now  introduce  the  cavity  mode  with  annihilation  operator  b{t)  =  be~luJot .  The 
interaction  Hamiltonian  between  the  cavity  mode  and  the  external  field  is  given  by 


=  ih(b(t)  +  6(t)t)(f;(-)(o,i)  -  /-;(+ ’«)./))  (11) 

where  we  have  used 

/>00 

E(+\z,t)  =  [E^~\z,t)]]  =  /  K(uj)aMe-iu^-z/c)duj  (12) 

Jo 

Here  k(uj)  does  not  only  depend  on  the  external  field  but  also  on  the  frequency- 
dependent  transmission  of  the  cavity  mirror,  and  is  unit  less.  An  interaction 
Hamiltonian  of  this  form  can  be  obtained  from  the  QED  Hamiltoian  by  expanding  it 
into  “quasi-modes”  corresponding  to  either  inside  or  outside  the  cavity;  see  [15]. 

We  will  briefly  describe  the  remaining  Hamiltonians.  The  interaction  Hamiltonian 
of  the  cavity  mode  with  a  resonant  classical  drive  is  given  by 

HDmh£(b  +  tf)  (13) 

where  £  is  the  drive  amplitude.  Spontaneous  emission  is  treated  by  introducing 
another  field  E  with  annihilation  operators  c^,  called  the  side  channel,  and  adding 
another  Hamiltonian  H±  =  — d(t)E(0,t )  where  d(t)  is  an  atomic  dipole  operator. 
Unlike  the  forward  channel  U,  which  we  will  ultimately  observe,  the  side  channel  is 
left  unobserved.  This  is  a  simple  but  effective  way  to  model  the  partitioning  of  the 
full  three-dimensional  field  E(r,  t)  into  observed  and  unobserved  modes. 

The  atomic  Hamiltonian  Ha  and  the  ensemble- cavity  mode  interaction  Hac  are 
more  variable,  as  they  depend  on  the  structure  of  the  atoms  in  the  ensemble.  In 
particular,  we  get  drastically  different  behavior  when  the  atoms  have  a  transition  that 
is  resonant  with  the  cavity  mode  than  in  the  far  detuned  case.  We  will  consider  a 
specific  example  in  section  3.3. 

3.2.  Quantum  noise  and  the  Markov  limit 

The  discussion  in  the  previous  section  was  based  entirely  on  “mechanical”  arguments; 
i.e.,  the  electric  field  emerged  naturally  by  quantization  of  Maxwell’s  equations  and 
the  dipole  coupling  to  matter.  Any  physical  model  ultimately  has  its  roots  in  this 
level  of  description.  However,  we  have  already  discussed  that  the  foundations  of 
quantum  theory  are  essentially  a  glorified  probability  theory,  where  any  observable  is 
equivalent  to  a  random  variable  on  some  probability  space.  As  we  will  be  interested 
in  observations  of  the  field,  it  is  essential  to  make  the  connection  between  the  physical 
model  and  its  manifestation  as  a  (quantum)  probabilistic  dynamical  system. 

§  In  a  full  three-dimensional  description  the  cavity  mirror  would  be  modeled  by  an  interaction 
Hamiltonian  that  scatters  into  the  backward  propagating  modes  with  terms  such  as  In 

the  one-dimensional  case,  however,  we  can  simply  absorb  this  reflection  into  the  definition  of  the 
field. 
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We  can  consider  Eq.  (12)  as  the  Fourier  transform  of  the  operator  distribution 
n{uo)6{ijo)aUJ^  where  0  is  the  step  function.  We  will  always  take  the  incoming  field  to  be 
in  the  vacuum  state  as  in  Fig.  1.  Each  aw  can  be  thought  of  as  an  independent  quantum 
“complex  Gaussian”  random  variable,  in  the  sense  that  its  “real”  and  “imaginary” 
parts  au+alj  and  ia £  —iau  are  precisely  Gaussian  random  variables.  Hence  E^  (0,  t) 
will  be  some  sort  of  quantum  complex  Gaussian  noise.  Note  that  the  two  quadratures 
E +  E^  and  iE ^  —  iE^  do  not  commute,  so  we  cannot  interpret  E^(0,t)  as 
a  classical  complex  noise.  We  would  now  like  to  consider  the  Heisenberg  equation  (in 
the  interaction  picture  with  respect  to  the  field  dynamics) 

Xt=l-  [H±  +Ha  +  Hac  +  Hd+  HCp(t),  Xt]  (14) 

as  being  driven  by  the  noise  iE ^  —  iE^+\  together  with  an  observation  of  the  field 
which  need  not  commute  with  the  driving  noise.  Then  the  statistical  inference  step 
can  be  formulated  as  finding  the  best  estimate  of  the  noisy  time  evolution  of  atomic 
observables  given  noisy  observations  of  the  field. 

Similar  problems  have  been  studied  in  classical  probability  for  about  a  century, 
and  the  main  lessons  learned  there  appear  to  carry  over  to  the  quantum  case.  In 
particular: 

(i)  Statistical  inference  of  continuous-time  processes  is  essentially  intractable  unless 
we  approximate  the  noise  process  by  a  white  noise.  In  this  case,  the  time  evolution 
of  the  system  is  Markovian  [60]  (i.e.  the  distribution  of  future  system  states 
depends  only  on  the  present  state  and  not  on  past  history)  and  statistical  inference 
is  described  by  the  elegant  theory  of  Markov  nonlinear  filtering  [16,51]. 

(ii)  Dealing  with  white  noise  directly  is  possible,  but  the  resulting  theory  is  very 
technical  due  to  the  fact  that  white  noise  is  an  extremely  singular  object 
[35,43,45].  It  is  much  easier  to  build  a  theory  from  a  Wiener  process,  the  integral 
of  white  noise,  which  is  at  least  continuous  [60]. 

We  fill  follow  a  similar  program  below  for  quantum  systems;  i.e.,  we  will  first  find 
a  Markov  approximation  of  the  full  field-theoretic  model  described  previously,  then 
develop  a  theory  of  quantum  Markov  filtering. 

Before  we  embark  on  this  path,  it  should  be  mentioned  that  the  problem  with 
colored  noise  takes  on  an  even  more  severe  form  in  the  quantum  case.  In  the  classical 
case  the  problem  is  mainly  technical;  there  is  no  conceptual  problem  associated 
to  statistical  inference  with  colored  noise,  but  it  is  not  possible  to  obtain  filtering 
equations  in  a  recursive  form  [14].  In  the  quantum  case,  however,  it  is  not  even  clear 
what  we  mean  by  an  observation  of  colored  noise,  let  alone  the  associated  statistical 
inference  problem,  as  the  field  operators  may  not  commute  with  themselves  at  different 
times  or  with  the  system  [25].  There  is  as  of  yet  no  satisfactory  solution  to  this 
problem;  in  particular,  a  satisfactory  theory  of  quantum  non-Markovian  continuous 
measurement  has  yet  to  be  developed.  As  we  will  see,  however,  these  problems  do  not 
appear  in  the  Markov  case. 

3.2.1.  Classical  and  quantum  stochastic  differential  equations  Let  us  briefly  review 
the  classical  concept  of  a  stochastic  differential  equation  (SDE)  [60].  We  denote  by 
Wt  a  one-dimensional  Wiener  process.  It  is  defined  on  a  probability  space  Q  where 
each  uj  G  Cl  corresponds  to  a  single  sample  path  {Wt(uj)}  of  the  Wiener  process. 
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Heuristically  the  time  derivative  Wt  would  be  white  noise,  so  we  wish  to  give  meaning 
to  a  differential  equation  of  the  form 

=  f(Xt(w))  +a(Xt(u))^ML  (15) 

However,  this  equation  makes  no  mathematical  sense  as  Wt  is  differentiable  with 
probability  zero.  The  solution  is  to  rewrite  it  as  an  integral  equation 

Xt  =  x0+  [  f(Xs)  ds  + 

Jo 

and  then  to  define  the  stochastic  integral.  As  a  notational  analogy  with  ordinary 
differential  equations  we  will  also  write 

dXt  =  f(Xt)  dt  +  <r(Xt)  •  dWt  (17) 


which  is  equivalent  to  (16)  by  definition. 

Ito  defined  a  stochastic  integral  in  the  following  way: 


fsdWs 


lim 


,E^(^+1  ~wtk) 


(18) 


Precisely  in  which  sense  the  limit  is  taken  is  a  central  construction  in  Ito’s  theory 
which  we  gloss  over.  A  different  definition,  due  to  Stratonovich,  is 


fsodWs 


lim  >  - 

\ti+1-ti\^o  —  2 


Eo  (ftk+1+ftk)(Wtk+1-Wtk 


(19) 


It  is  a  signature  of  the  singularity  of  the  problem  that  these  two  integrals  do  not  give 
the  same  answer;  such  integrals  would  necessarily  be  the  same  if  we  could  interpret 
them  in  the  Riemann-Stieltjes  sense.  It  is  now  ambiguous,  however,  how  we  should 
interpret  Eq.  (16). 

A  major  difference  between  the  two  integrals  is  their  transformation  property. 
Ordinary  Riemann-Stieltjes  integrals  obey  the  Leibnitz  rule  d(XtYt )  =  YtdXt  YXtdYt 
(we  use  the  shorthand  notation  of  Eq.  (17)).  It  turns  out  that  this  property  is  also 
obeyed  by  the  Stratonovich  integral  (19).  The  Ito  integral,  on  the  other  hand,  obeys 
the  modified  transformation  property  d(XtYt)  =  YtdXt  +  XtdYt  +  dXtdYt ,  where 
we  use  the  Ito  rules  dW2  =  dt,  dt2  =  dWt  dt  —  0  to  evaluate  the  rightmost  term. 
Similarly,  the  Ito  transformation  rule  for  arbitrary  functions  becomes 


dg(Xt)  =  g'(Xt)dXt  +  \g"(Xt)dX2t 


(20) 


Note  how  the  shorthand  notation  of  Eq.  (17)  allows  us  to  express  these  deep  results 
in  a  compact  way.  The  power  of  the  Ito  calculus  lies  in  the  fact  that  complicated 
transformations  can  be  perfomed  using  only  simple  symbolic  manipulations. 

The  fact  that  the  Stratonovich  integral  obeys  the  Leibnitz  rule  suggests  that 
physical  systems  should  be  described  by  a  Stratonovich  SDE;  after  all,  if  we  take  a 
physical  system  with  a  smooth  driving  force,  and  add  some  noise  to  this  force,  we  do 
not  expect  the  transformation  properties  of  the  system  to  change.  We  will  investigate 
this  further  in  the  next  section.  On  the  other  hand,  the  Ito  integral  has  the  nice 
property  that  its  expectation  vanishes ||,  which  suggests  that  Ito  SDE  are  natural 


||  The  Ito  integral  is  only  defined  for  nonanticipative  integrands,  i.e.  ft  must  be  independent  from 
any  increment  Wt2  ~  Wtx  with  t<2  >  t\  >  t.  It  follows  immediately  from  (18)  that  the  integral  has 
vanishing  expectation. 
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from  the  point  of  view  of  statistical  inference.  Fortunately  we  can  have  it  both  ways, 
as  there  is  a  conversion  formula  between  Ito  and  Stratonovich  SDE:  the  solution  of 
dXt  =  f(Xt)dt  +  a(Xt )  o  dWt  is  equivalent  to  the  solution  of 

dXt  =  f(Xt)dt  +  \a{Xt)  ■  Va{Xt)dt  +  a(Xt)  dWt  (21) 


We  see  that  in  the  mean,  the  Stratonovich  noise  results  in  an  effective  drift, 
additional  term  is  known  as  the  Ito  correction. 

Let  us  return  to  the  quantum  case.  Define 


at 


awe  luJtdcj 


This 


(22) 


where  we  have  extended  aw  to  negative  frequencies.  In  the  vacuum  state,  the  two 
quadratures  xt  =  at  +  a\  and  yt  =  iat  —  ia\  have  zero  mean  and  delta-correlated 
covariance,  e.g.  Ext  =  0  and  E[xtxs\  =  E [a*a|]  =  S(t  —  s).  Moreover,  it  is  easily 
verified  that  [xt,xs\  =  0  for  f  /  s  (and  similarly  for  yt);  this  is  important,  as  it 
means  that  we  can  interpret  xt  as  a  classical  random  process.  Indeed,  following  the 
procedure  of  section  2.2,  we  can  simultaneously  diagonalize  the  operators  xt  at  all 
times  and  transform  to  a  classical  probability  space.  We  find  that  both  xt  and  yt  are 
entirely  identical  to  classical  white  noise.  We  will  thus  call  the  field  at  quantum  white 
noise. 

Note  that  the  noise  E^+^(0,t)  that  drives  Eq.  (14)  is  not  white.  However,  if  the 
system  response  has  a  sufficiently  narrow  bandwidth  we  would  expect  the  noise  to 
“look”  white  on  the  slow  timescale  of  the  system,  as  k(lj)  is  locally  flat.  Equation 
(22),  and  the  associated  introduction  of  negative  frequencies,  should  be  seen  purely 
as  a  mathematical  construction  that  corresponds  to  noncommutative  white  noise.  In 
the  next  section  we  will  make  these  ideas  more  precise  by  showing  in  what  sense  the 
physical  model  (14)  can  be  approximated  using  noises  of  this  form. 

We  now  proceed  as  in  the  classical  case.  Define  the  quantum  Wiener  process 

At  =  [  at  dt  (23) 


Jo 

We  can  now  introduce  the  quantum  Ito  integral  [10,26,37,55] 


n—  1 


Xs  dAs  =  lim  V  Xtk  (Atk+1  -  Atk ) 

t0  \U+i-U\^0  k=Q 


(24) 


for  nonanticipative  Xt  (i.e.,  Xt  is  independent  of  any  increment  Au  —  Av,  u  >  v  >  t.) 
It  immediately  follows  that  the  integral  has  vanishing  expectation  in  the  vacuum 
state.  Moreover,  as  Xtk  is  independent  from  Atk+1  —  Atk ,  the  process  and  increment 
commute:  hence  XtdAt  =  dAtXt.  The  quantum  Ito  rules  are  dAtdA\  =  dt, 
dAt  dt  =  dA\  dAt  =  dA^  =  0. 

Similarly,  we  can  define  a  quantum  Stratonovich  integral  [26,31,32] 


Xs  o  dAs 


n—% 


lim 

\ti+i-ti\- 


Y,  9  (^fc+i  +  Xtk)(Atk+ 1  -  At  J 


>o^2 
k= 0 


(25) 


which  obeys  the  Leibnitz  rule  but  does  not  have  vanishing  expectation.  Additionally, 
in  this  case  Xt  does  not  commute  with  the  noise  increment,  so  Xt  o  dAt  and  dAt  o  Xt 
are  two  distinct  forms  of  the  Stratonovich  integral. 

Note  that  the  above  discussion  is  entirely  heuristic;  the  mathematical  objects 
we  are  using  are  extremely  singular  and  require  careful  definition.  The  quantum  Ito 
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theory  was  introduced  in  a  rigorous  way  by  Hudson  and  Parthasarathy  [37];  a  more 
heuristic  treatment  can  be  found  in  Gardiner  and  Collett  [26].  More  recently  the 
relations  between  the  quantum  Ito,  Stratonovich  and  white  noise  formalisms  were 
investigated  by  Gough  [31,32].  We  refer  to  these  references  for  a  detailed  treatment. 

3.2.2.  The  Wong-Zakai  theorem  and  the  Markov  limit  Mathematically,  (quantum) 
stochastic  differential  equations  are  rather  peculiar  objects — strictly  speaking  they  are 
not  even  differential  equations,  but  integral  equations.  Nonetheless  SDE  are  widely 
used  to  model  physical  phenomena.  The  reason  that  this  is  so  succesful  stems  from 
an  important  result,  originally  due  to  Wong  and  Zakai  [82],  which  can  be  stated  as 
follows.  Suppose  we  have  an  ordinary  differential  equation  of  the  form 

(26) 

where  £A(t)  is  some  piecewise  smooth  random  process  that  converges  to  white  noise 
in  some  appropriate  sense  as  A  — ►  0.  Then  the  solution  xx(t)  of  Eq.  (26)  converges  as 
A  — >  0  to  the  solution  of 

dXt  =  f(Xt)  dt  +  a(Xt)  o  dWt  (27) 

This  result  tells  us  that  the  behavior  of  a  “real”  physical  system  is  well  approximated 
by  the  solution  of  an  SDE  as  long  as  the  noise  is  sufficiently  wideband.  Additionally 
our  notion  that  physical  systems  are  well  described  by  Stratonovich  equations  is  now 
rigorously  justified.  In  the  remainder  of  this  section  we  will  give  a  simple  introduction 
to  the  quantum  analog  of  the  Wong-Zakai  procedure.  For  a  rigorous  treatment,  we 
refer  to  [1,33]. 

As  a  first  step  we  partition  our  system  into  fast  and  slow  timescales.  The 
electromagnetic  noise  and  the  high-frequency  oscillation  of  the  cavity  mode  operate 
on  the  fast  timescale,  whereas  the  driving  field  and  the  coupling  to  the  atoms  operate 
on  a  much  slower  timescale.  In  order  to  study  the  Wong-Zakai  limit  we  completely 
ignore  the  slow  interactions  by  turning  them  off — a  very  good  approximation  if  the 
correlation  time  of  the  noise  is  short.  This  is  equivalent  to  the  assumptions  mentioned 
in  the  previous  section:  a  short  correlation  time  implies  that  k(cj)  is  slowly  varying, 
whereas  ignoring  the  slow  interactions  assumes  that  these  do  not  significantly  shift 
the  resonance  frequency  of  the  cavity. 

What  remains  is  the  fast  dynamics,  which  we  write  in  propagator  form 

^  =  -l-HCFUt  =  (b(t)+b(t)^)(E^\0,t)-E^\0,t))Ut  (28) 

The  key  physical  assumption  we  must  make  to  obtain  the  white  noise  limit  is  that  the 
cavity  is  weakly  coupled  to  the  external  field.  Naively  one  would  expect  that  we  could 
implement  this  limit  by  solving  the  equation  dUt/dt  =  —i\HcFUt/h  and  then  taking 
the  limit  A  — ►  0.  This  clearly  doesn’t  work,  however,  as  this  would  just  turn  off  the 
interaction  between  the  cavity  and  the  field.  The  problem  is  that  A  not  only  changes 
the  coupling  strength,  but  also  the  timescale  of  the  interaction  dynamics. 

The  effect  that  we  are  trying  to  capture  in  the  weak  coupling  limit  is  not  a  precise 
description  of  fast  dynamics,  but  the  effective  contribution  of  the  noise  to  the  slow 
dynamics.  We  saw  in  the  classical  case,  Eq.  (21),  that  the  noise  causes  an  effective 
drift  in  the  system  dynamics.  If  we  replace  a(x)  i— »  Xa(x),  we  infer  from  (21)  that  this 
drift  occurs  on  a  timescale  t/X2.  This  suggests  what  we  can  make  the  substitution 
Hcf  ►  A Hcf  and  let  A  — >  0,  but  we  will  only  obtain  the  weak  coupling  limit  if  we 
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simultaneously  rescale  time  as  t  h- >  t/X2 .  This  idea  was  originally  suggested  in  the 
context  of  classical  stochastic  approximations  by  Stratonovich  [67]  and  was  introduced 
independently  in  the  physics  literature  by  Van  Hove  [72]. 

After  performing  these  rescalings  we  obtain 


0 


E (-}  (0, 


A2 


£(+)  (  0, 


A2 


Uf 


a\(t)b  +  a\(t)0  —  —  a\(t)b 


ui 


(29) 


Let  us  investigate  the  behavior  of  the  rescaled  noise 

1  f‘°° 

a\(t)  =  —  k(uo)  duo  (30) 

*  J  o 

as  A  — >  0.  In  particular,  we  obtain  for  the  correlation  function 

E [ax(t)a\(s)}  =  ^  J  7 '  S(t  -  s)  (31) 

with  7'  =  2ttk(uoo)2,  where  we  have  used  liniA^o  e~lujtU2 /\2  =  2tt S(uo)S(t)  (in  the 
sense  of  Schwartz  distributions).  Hence  in  the  weak  coupling  limit  the  resonant  terms 
converge  to  white  noise  driving  terms.  However,  for  the  rescaled  noise 


we  obtain 


1  P°° 

d\(t)  =  —  /  k(co)  awe“^w+w°^A2(ia; 

^  Jo 


(32) 


1  f°° 

E[dx{t)d\(s)}  =  —  0  (33) 

Jo 

Hence  the  nonresonant  terms  vanish  in  the  weak  coupling  limit.  We  see  that  the  weak 
coupling  limit  gives  us  the  commonly  used  rotating  wave  approximation  for  free. 

Studying  the  convergence  of  Uf  is  more  complicated,  but  can  be  performed  by 
investigating  the  convergence  of  each  term  in  the  associated  Dyson  series  [1,33].  The 
result  is,  however,  not  surprising:  Eq.  (29)  converges  to  the  Stratonovich  equation 
[31,32] 


dUt  =  Vy  Ua\  O  bUt  -  dAt  o  0  Ut 


(34) 


which  is  essentially  the  quantum  version  of  the  Wong-Zakai  theorem  [33].  We  can 
equivalently  express  the  result  in  the  Ito  form  as 


dUt  = 


y/ybdAl  —  dAt  —  \^'b^bdt 


Ut 


(35) 


where  an  Ito  correction  term  emerges  as  in  the  classical  case. 

In  addition  to  the  emerging  quantum  stochastic  equation,  a  detailed  treatment  of 
the  quantum  Wong-Zakai  limit  usually  results  in  an  additional  small  energy  shift  to 
the  system  Hamiltonian  [1,33].  This  energy  shift  can  be  normalized  away  by  a  proper 
choice  of  the  system  Hamiltonian. 
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Now  that  we  have  made  a  Markovian  approximation  to  the  interaction,  it  remains  to 
add  the  slow  dynamics  back  in.  We  do  this  simply  by  adding  the  corresponding 
Hamiltonians.  As  discussed  before,  spontanous  emission  is  modeled  by  coupling 
the  atoms  directly  to  an  unobserved  field  E  through  their  dipole  moment  d(t)  = 
ae-iujdt  ateiujdt  (here  a  is  an  atomic  decay  operator  and  ujd  is  the  dipole  rotation 
frequency.)  Through  a  similar  analysis  as  the  one  performed  above,  we  obtain  our 
complete  physical  model: 

dUt=  yy  b  dA\  —  \/y  dAt  +  y/^fZ  a  dA\  —  dAt 

-  \~i'tfbdt  -  ±7 jaVdt  -  {{Ha  +  Hac  +  Hd)  dt] Ut  (36) 

Before  we  specialize  to  the  particular  model  that  will  be  used  in  the  remainder  of  the 
paper,  let  us  digress  for  a  moment  and  calculate  the  Heisenberg  evolution  Xt  =  U}XUt 
of  an  arbitrary  observable  X  of  the  atom  or  cavity  mode,  as  expressed  symbolically 
in  Eq.  (14).  Using  the  quantum  Ito  rules  we  easily  obtain 

dXt  =  \[Ha  +  Hac  +  HD,  Xt]  dt  +  7 'CbtXt  dt  +  j±CatXt  dt  +  VY [b\ ,Xt]  dAt  (37) 

+  \ff[Xt,  bt]  dA\  +  Vt I[al,Xt]  dAt  +  yfyI[Xt,  at]  dA\ 

where  CcX  =  c^Xc  —  cX  +  Xc^c)  is  the  well-known  Lindblad  term.  As  the 
expectations  of  Ito  integrals  vanish,  clearly  averaging  away  the  noise  terms  ( “tracing 
over  the  bath” )  results  in  a  Lindblad-type  master  equation  in  the  Heisenberg  picture, 
which  is  ubiquitous  in  the  description  of  quantum  open  systems  (see  e.g.  [26]).  In  the 
language  of  quantum  probability,  the  unitary  solution  Ut  of  the  quantum  Ito  equation 
provides  a  unitary  dilation  of  the  associated  Lindblad  equation  [36]. 

We  now  introduce  a  highly  simplified  model  of  an  atomic  ensemble  interacting 
with  an  electromagnetic  field  [69,70].  Consider  an  atomic  ensemble  consisting  of  a  set 
of  N  atoms  with  a  degenerate  two-level  ground  state.  We  will  assume  that  all  atomic 
transitions  are  far  detuned  from  the  cavity  resonance,  so  the  interaction  between  the 
atoms  and  the  cavity  is  well  described  by  the  dispersive  Hamiltonian  Hac  =  hxFzb^b 
where  Fz  is  the  collective  dipole  moment  of  the  ensemble,  i.e.  it  is  a  spin-AT/2  angular 
momentum  operator,  and  %  determines  the  coupling  strength.  Such  a  Hamiltonian  can 
be  obtained,  for  example,  by  considering  the  full  dipole  coupling  and  then  adiabatically 
eliminating  all  the  excited  states.  We  furthermore  consider  the  atomic  Hamiltonian 
Ha  =  hAFz  +  hh(t)Fy ,  where  A  is  the  atomic  detuning  and  h(t)  is  the  strength  of 
a  magnetic  field  in  the  ^/-direction.  The  latter  will  allow  us  to  apply  feedback  to  the 
system  by  varying  the  external  magnetic  field.  We  obtain 

dUt  =  b  dA\  -  y/77  dAt  +  dA\  -  ^  dAt  -  ^7±cr V  dt 

-  \x'b]bdt  -  i(AFz  +  h(t)Fy  +  xFztfb  +  £(b  +  6f))  dt]  Ut  (38) 

Adiabatically  eliminating  the  cavity  [19,24,74],  assuming  that  7'  and  £  are  sufficiently 
large  so  this  is  a  good  approximation,  yields 

dUt  =  [Vt ~adA\  -  VtIo-1,  dAt  +  VM Fz(dA\  -  dAt ) 

-±7_l cr]adt  -  \MF az  dt  -  i(^£  +  A )FZ  dt  -  ih(t)Fv  dt 1  Ut 


(39) 
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where  M  =  16x2f2/(7')3  is  the  effective  interaction  strength.  It  is  convenient  to 
choose  the  atomic  detuning  A  =  — 4x£2/(7')2,  and  we  will  henceforth  assume  that 
this  is  the  case  (experimentally  we  can  always  fix  the  detuning  by  applying  a  magnetic 
field  in  the  ^-direction.) 

Finally,  we  will  for  simplicity  neglect  spontaneous  emission  by  setting  7^  =0,  a 
good  approximation  if  71  <  ill  (in  this  case  the  interesting  system  dynamics  takes 
place  long  before  spontaneous  emission  sets  in.)  This  gives 


dUt  = 


Vm  Fz (dA\  —  dAt)  —  \MF*  dt  -  ih{t)Fy  dt 


Ut 


(40) 


We  will  use  this  highly  simplified  model  as  an  example  throughout  the  remainder  of 
the  article. 


4.  Conditioning:  classical  probability  and  quantum  filtering 

In  the  previous  section  we  considered  in  detail  the  physical  interactions  between  an 
atomic  ensemble  and  the  electromagnetic  field,  which,  after  many  simplifications, 
were  condensed  into  Eq.  (40).  This  expression  contains  all  the  physical  dynamics 
of  our  model.  We  now  start  the  second  step  in  our  program,  in  which  we  perform 
statistical  inference  of  the  atomic  dynamics  based  on  an  observation  of  the  field.  Our 
approach  [71]  is  inspired  by  [5,7]. 

4.1.  Optical  detection 

Before  we  can  derive  a  filtering  equation  we  must  specify  what  measurement  is 
performed.  We  will  consider  the  case  of  (balanced)  homodyne  detection,  which 
measures  a  quadrature  of  the  outgoing  field.  The  principles  of  this  method  are 
discussed  in  many  textbooks  [62,  73]  and  a  continuous  time  description  in  terms 
of  quantum  stochastic  calculus  can  be  found  in  [3].  Homodyne  detection  has 
the  advantage  that  it  gives  rise  to  a  continuous,  Wiener  process- type  integrated 
photocurrent,  which  is  particularly  convenient  for  continuous  time  feedback  control. 

Other  types  of  detection  may  be  convenient  in  different  situations  depending  on 
the  experimental  setup.  For  example,  the  spin  squeezing  experiment  [30]  makes  use 
of  polarimetry,  which  can  be  modeled  in  a  very  similar  way  as  homodyne  detection. 
Though  photon  counting  detection  also  has  a  continuous  time  description  in  terms  of 
quantum  stochastic  calculus,  it  gives  rise  to  a  discrete  jump  process  which  is  much 
less  convenient  for  the  purpose  of  feedback  control. 

Heuristically,  consider  Eq.  (40)  as  being  driven  by  the  white  noise  a*,  the 
“derivative”  of  At.  An  ideal  wide-band  homodyne  detector  will  measure  the  field 
observable  at  +  a\  after  the  field  has  interacted  with  the  ensemble;  i.e.,  we  observe 
the  photocurrent  I(t)  =  U} (at  +  a\)Ut-  As  usual  mathematically  rigorous  results  are 
much  more  easily  obtained  in  integrated  form;  hence  we  define  as  our  observation  the 
integrated  photocurrent 

Yt=u}{At+A\)Ut  (41) 

where  I(t)  can  be  considered  the  “derivative”  of  Yt.  For  a  rigorous  treatment  directly 
from  the  quantum  stochastic  description  we  refer  to  [3] . 

Finding  an  explicit  expression  for  Yt  is  a  straightforward  exercise  in  the  use  of 
the  quantum  Ito  rules.  From  Eqs.  (40)  and  (41)  we  directly  obtain 

dYt  =  2 VMU}FzUt  dt  +  dAt  +  dA\ 


(42) 


Modeling  and  feedback  control  design  for  quantum  state  preparation 


18 


Thus  clearly  homodyne  detection  of  the  field  provides  a  measurement  of  the  system 
observable  Fz(t)  corrupted  by  the  incident  field  noise. 

We  will  extend  our  observation  model  a  little  further.  We  assumed  in  the  above 
analysis  that  the  detection  efficiency  is  perfect.  In  practice  there  will  always  be  some 
technical  noise  added  to  the  signal,  either  due  to  the  intrinsic  loss  mechanisms  in  the 
photodetectors  or  due  to  noise  in  the  detection  electronics  (e.g.  amplifier  noise).  We 
will  model  these  effects  by  the  addition  of  an  uncorrelated  white  noise  term  dW[  to 
the  observation  current;  i.e. 

dYt  =  2 \J~Mr]  V l  Fz  Utdt+^j  ( dAt  +  dA\ )  +  /TT  dW[  (43) 

where  77  £  (0, 1]  determines  the  relative  strength  of  the  technical  noise  (77  =  1  is  perfect 
detection.)  We  can  interpret  the  white  noise  dW[  as  an  operator  process  by  embedding 
it  in  a  quantum  probability  space,  e.g.  dW't  =  dBt  +  dB\  for  some  uncorrelated  field 
Bt  that  does  not  interact  with  the  system.  Note  that  we  have  rescaled  the  current 
Yt  so  that  the  total  corrupting  noise  has  unit  variance,  i.e.  dYf  =  dt ;  this  gives  a 
convenient  normalization  of  the  photocurrent.  Experimentally  the  observed  current 
will  have  some  arbitrary  amplification. 

In  order  to  make  sense  as  an  observed  current  Yt  must  be  a  classical  stochastic 
process,  i.e.  [Yt,  Ys]  =  0  Vs  7^  £;  clearly  any  sample  path  recorded  in  the  laboratory  is 
classical.  From  Eq.  (43),  however,  it  is  not  at  all  obvious  that  this  is  the  case.  Once 
again  we  resort  to  a  heuristic  argument  which  can  be  made  rigorous  in  a  detailed 
treatment  of  quantum  stochastic  calculus.  Eq.  (40)  implies  that  the  observable  at 
only  interacts  with  the  system  at  time  t.  As  at  is  independent  from  as  when  t  7^  s,  it 
follows  that  l]\asUt  —  U^asUs  \/t  >  s.  But  then  [/(£),/(«§)]  =  Ul[at+a\,as+a\\Ut  =  0, 
as  we  have  already  established  that  at  +  a\  is  entirely  classical  white  noise.  Hence  Yt, 
the  integral  of  I{t)  plus  technical  noise,  is  also  a  classical  stochastic  process. 

There  is  another  property  of  the  observation,  called  the  nondemolition  property 
by  Belavkin  [6],  that  is  essential  in  what  follows.  Let  X  be  some  observable  of  the 
atomic  ensemble.  Then  it  is  easy  to  show,  in  exactly  the  same  way  we  showed  that 
It  is  a  classical  process,  that  \U}XUt,Ys\  =  0  Vs  <  £;  i.e.,  any  system  observable  at 
time  t  commutes  with  all  prior  observations.  This  means,  as  we  saw  in  section  2.2, 
that  finding  the  best  estimate  of  a  system  observable  given  all  prior  observations  is 
an  entirely  classical  statistical  inference  problem.  We  will  find  the  explicit  solution  to 
this  problem,  the  quantum  filtering  equation,  in  the  next  section. 

4-2.  The  quantum  filter 

Let  us  begin  by  establishing  some  notation.  If  X  is  an  atomic  ensemble  observable, 
denote  by  jt(X)  =  U^XUt  its  Heisenberg  evolution  at  time  t.  Using  Eq.  (40)  and  the 
quantum  Ito  rules  we  easily  obtain 

djt(X)  =  jt(C[X ])  dt  +  VMjt([X,  Fz})  ( dA\  -  dAt)  (44) 

where  C[X]  =  ih(t)[Fy,X }  +  MFZXFZ  -  \M(F*X  +  IFZ2).  We  have  already 
established  the  observation  equation 

dYt  =  2^KFqjt{Fz)  dt  +  ^j(dAt  +  dA\)  +  y/l^jdWl  (45) 

Together,  Eqs.  (44)  and  (45)  form  the  system-observation  pair  of  our  model.  Eq.  (44) 
describes  the  time  evolution  of  any  system  observable,  whereas  Eq.  (45)  describes 
the  observed  current.  The  goal  of  the  filtering  problem  is  to  find  an  expression 
for  7Tt(X)  =  K[jt(X)\Ys<t\,  the  (least  mean  square)  best  estimate  of  the  observable 
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X  given  the  prior  observations  Ys<t •  An  essential  point  is  that  the  conditional 
expectations  7rt(X)  are  guaranteed  to  be  well-defined  by  the  nondemolition  property. 

Due  to  the  nondemolition  property  we  could  in  principle  simultaneously 
diagonalize  jt(X)  and  Ys,  s  <  t  for  every  X,  drop  down  to  the  associated  classical 
probability  space,  and  calculate  the  classical  conditional  expectation  i Tt(X).  This  is 
not  a  very  practical  course  of  action,  however,  so  we  will  need  a  shortcut.  Moreover, 
our  description  of  the  conditional  expectation  in  section  2.1  was  rather  limited:  we 
only  defined  the  conditional  expectation  with  respect  to  one  discrete  random  variable, 
whereas  Ys<t  is  a  continuous  family  of  continuous  random  variables.  To  manipulate 
such  continuous  quantities  one  needs  the  mathematical  machinery  of  real  analysis. 

We  take  the  following  approach.  From  the  definition  of  conditional  expectation 
is  section  2.1  we  can  extract  the  following  properties: 

(i)  E[X|T]  is  a  function  on  Y. 

(ii)  For  any  random  variable  Z  that  is  a  function  of  T,  we  must  have  E[E[X|Y]Z]  = 
E[XZ]. 

It  is  easy  to  see  that  the  definition  of  section  2.1  implies  these  properties,  and  it  is 
not  hard  to  show  that  the  converse  is  also  true.  In  the  continuous  case  we  just  take 
these  properties  as  the  definition  of  conditional  expectation.  This  is  precisely  the  real 
analytic  definition,  where  the  intuitive  idea  of  being  “a  function  of  Y”  is  replaced  by 
the  notion  of  measurability  [75]. 

We  are  now  ready  to  take  our  shortcut.  By  property  (1),  7Tt(X)  must  be  a  function 
of  Ts<£.  Introduce  the  ansatz 

d7Tt(X)  =  Ct{X)  dt  +  Dt{X)  dYt  (46) 

where  C*(X),  Dt(X)  are  functions  of  Ys<t  to  be  determined.  If  we  can  determine  Ct 
and  Dt,  the  filtering  problem  has  been  solved. 

To  implement  property  (2)  we  use  the  following  trick.  We  require  that 

E  WtiX)  efo  =  E  k(x)  efo  s(s)dy»l  (47) 

for  any  function  gif).  The  idea  behind  this  is  the  same  as  that  of  a  moment  generating 
function:  we  can  generate  any  (analytic)  function  of  Ts<t  by  using  an  appropriate  g(t) 
and  taking  derivatives.  Hence,  if  we  have  proved  the  relation  (47)  then  we  have 
essentially  satisfied  property  (2). 

What  remains  is  mostly  a  direct  application  of  the  Ito  rules.  For  convenience  we 
multiply  both  sides  of  (47)  by  exp(— ^  g(s)2ds).  Define 

e9  =  g/0‘  9(s)dYe-i  J0*  g(s)2ds  d(,g  =  g^yg  ^  (48) 

It  is  now  straightforward  to  evaluate 
<flE[C^t(X)]  =  E[e?(Ct(X)  +  2  ^fnjt{Fz)Dt{X)) 

+g(t)e9t(Dt(X)  +  2^M^jt(Fz)n(X))}  (49) 

=  E[e?  jt{C[X])  +  7 M^g(t)e9jt(FzX  +  XFZ)\  (50) 

We  now  invoke  Eq.  (47)  and  attempt  to  find  Ct(X),  Dt{X)  by  comparing  (49)  and 
(50)  term  by  term.  We  run  into  a  snag,  however,  as  a  naive  comparison  would  yield 
Ct  and  Dt  in  terms  of  jt{Fz ),  etc.,  which  are  not  functions  of  Ys<t •  Fortunately  we 


Modeling  and  feedback  control  design  for  quantum  state  preparation 


20 


can  use  property  (2)  of  conditional  expectations  to  change  all  the  jt  terms  in  (49)  and 
(50)  to  the  corresponding  7r*  terms  (E[jt (*)]  =  etc.)  This  gives  immediately 

dnt(X)  =  7 Kt{C[X})  dt  +  / M rj  (nt(FzX  +  XFZ) 

-2nt(Fz)Trt(X))(dYt-2^M^nt(Fz)dt)  (51) 

which  is  the  quantum  filtering  equation  for  our  model. 

It  is  instructive  to  recall  the  example  of  section  2.2.  There  a  simple  filtering 
scenario  was  constructed  by  coupling  a  spin  to  a  commuting  observable,  then 
conditioning  the  spin  observables  on  the  commuting  observable.  This  gave  rise  to  a  set 
of  classical  random  variables,  representing  the  conditioned  spin  observables.  Similarly, 
we  have  coupled  an  atomic  ensemble  to  an  optical  mode  and  conditioned  the  atomic 
observables  on  a  homodyne  measurement  in  the  field.  This  gave  rise  to  a  classical  Ito 
equation  (51)  for  the  conditioned  atomic  observables,  driven  by  the  observations  Yt. 

As  in  section  2.2  we  will  find  it  useful  to  represent  the  filter  in  its  adjoint  (density) 
form.  To  this  end,  we  define  the  conditional  atomic  density  matrix  pt  as  the  random 


matrix  that  satisfies  7rt(X)  =  Tr[ptX].  Eq.  (51)  gives 

dpt  =  —ih(t) [Fy,  pt]  dt  +  M  V[Fz]pt  dt  +  \ /Mr/H[Fz}pt  dWt  (52) 

where  we  have  used  the  notation 

V[c]p  =  cpc^  —  (c^cp  +  pc[c)/2  (53) 

H[c]p  =  cp-\-  pc f  -  Tr[(c  +  c^)p]p  (54) 

and  we  have  defined  the  innovations  process 

dWt  =  dYt  -  2^M^Tv[ptFz\  dt  (55) 


An  important  result  in  filtering  theory  is  that  the  innovations  process  Wt  is  in  fact  a 
Wiener  process  [6,12].  Though  we  have  not  introduced  sufficient  technical  machinery 
to  prove  this  fact,  we  can  can  give  a  simple  interpretation.  We  can  write  Wt  in  the 
form 

dWt  =  2^m,(jt(Fz)-nt(Fz))dt+^j(dAt+dAl)+^/r^1dWi(56) 

This  expression  consists  of  two  parts:  the  last  two  terms  are  white  noise  terms,  whereas 
the  first  term  is  the  difference  between  an  atomic  observable  and  our  best  estimate  of 
that  observable,  i.e.  it  represents  the  new  information  (the  “innovation” )  contained  in 
the  measurement. 

4-3.  Conditional  spin  dynamics 

Before  we  add  control  to  the  picture  it  is  interesting  to  take  a  look  at  the  open-loop 
properties  of  the  filtering  equation  (52),  i.e.  without  feedback,  by  setting  h(t)  =  0. 
The  equation  propagates  a  density  matrix,  defined  as  the  adjoint  of  a  set  of  classical 
conditional  expectations,  which  carries  the  interpretation  of  the  “statistically  inferred” 
density  matrix  of  the  ensemble  given  the  observations  in  the  probe  field.  One 
might  wonder  how  such  a  picture  is  related  to  the  traditional  picture  of  quantum 
measurements. 

To  illustrate  the  filtering  process  we  have  simulated  Eq.  (52)  for  a  spin  F  =  5 
ensemble  (e.g.,  10  two- level  atoms)  [66].  Such  simulations  are  highly  simplified  by  the 
fact  that  the  innovations  process  is  a  Wiener  process.  This  means  that  we  do  not 
have  to  simulate  the  full  quantum- mechanical  model,  Eqs.  (44)  and  (45),  to  obtain 
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Figure  2.  (a)  Time  evolution  of  the  conditional  state  from  a  coherent  spin  state  at  t  =  0  to  an 
eigenstate  at  long  times.  The  graph  shows  the  population  of  each  Fz  eigenstate,  (b)  100  sample 
paths  of  7 rt(Fz),  with  M  =  77  =  1.  The  dark  line  is  the  sample  path  shown  in  (a),  resulting  in 
mz  =  1.  Von  Neumann  projection  is  clearly  visible  at  long  times. 


a  photocurrent  Yt  to  drive  (52).  Instead,  we  just  plug  in  a  Wiener  process  for  the 
innovations,  for  which  straightforward  numerical  methods  are  available.  The  results 
are  shown  in  Fig.  2. 

At  long  times  the  conditional  state  is  clearly  driven  to  one  of  the  eigenstates  of  Fz , 
i.e.  Dicke  states  [17],  just  as  predicted  by  the  Von  Neumann  projection  postulate.  In 
fact,  it  can  be  rigorously  proved  that  the  t  — >  00  limit  of  Eq.  (52)  is  exactly  identical 
to  the  projection  postulate,  i.e.  the  probability  of  collapse  onto  each  eigenstate  is 
predicted  correctly  by  the  filtering  equation  [2,66,71]. 

Note  that  we  have  not  previously  mentioned  the  projection  postulate  in  this 
article.  As  this  result  follows  from  our  theory  we  do  not  need  to  postulate  it:  instead, 
we  have  “derived”  it  using  quantum  dynamics  and  classical  statistics^.  In  some 
sense  the  filtering  process  exposes  the  anatomy  of  a  quantum  measurement.  We 
have  explicitly  modeled  the  coupling  between  the  probe  field  and  the  system  under 
measurement,  Eq.  (44),  and  we  considered  separately  a  step  that  involved  purely  the 
gain  of  information.  Both  processes  conspire  to  bring  about  the  traditional  projection 
of  the  system  state  in  the  long-time  limit. 

At  intermediate  times,  t  <  00,  the  conditional  state  gradually  collapses  onto 
the  Fz  eigenstates.  This  process,  for  a  single  sample  path,  is  shown  in  Fig.  2a. 
Whereas  a  Von  Neumann  measurement  would  take  the  state  discontinuously  from 
the  initial  state  to  the  final  collapsed  state,  the  filtering  process  continuously  narrows 
the  distribution  over  the  eigenstates  until  only  one  remains.  Aside  from  giving  a  more 
realistic  description  of  continuous  optical  measurements,  this  description  creates  an 
opportunity  that  has  no  analog  with  projective  measurements:  we  can  interfere  with 

If  The  reader  should  not  get  the  impression,  however,  that  we  have  now  reduced  all  the  peculiarities 
of  quantum  measurement  to  pure  classical  probability.  In  particular,  we  cannot  derive  why  the 
measurement  of  an  observable  rules  out  the  measurement  of  noncommuting  observables,  which  has 
no  counterpart  in  classical  probability.  Only  the  conditioning,  which  takes  place  after  a  measurement 
has  been  performed  and  the  measurement  result  has  been  obtained,  can  be  given  a  purely  classical 
interpretation  in  this  way  as  a  statistical  inference  procedure.  On  the  other  hand,  the  “back  action” 
on  the  system  is  caused  by  the  quantum  dynamics  of  the  interaction  between  the  system  and  the 
probe,  which  we  have  explicitly  modeled  by  a  quantum  stochastic  differential  equation. 
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the  collapse  process  while  it  is  occuring  by  applying  real-time  feedback. 

Finally,  we  should  remark  that  not  all  filtering  equations  give  rise  to  Von- 
Neumann  type  collapse.  For  example,  homodyne  detection  of  spontaneously  emitted 
photons,  or  an  atomic  ensemble  resonantly  interacting  with  the  probe  field,  will  result 
in  continuous  decay  of  the  conditional  state  into  the  ground  state.  Projective  dynamics 
is  obtained  in  our  case  because  of  the  dispersive  (off-resonant)  interaction  of  the 
ensemble  with  the  probe  and  the  neglect  of  spontaneous  emission.  The  latter  can 
be  justified,  however,  if  there  is  a  large  separation  of  time  scales  between  the  time 
of  collapse  and  the  time  at  which  the  spontaneous  emission  sets  in.  In  this  case,  the 
intermediate  regime  will  be  very  similar  to  the  long-time  limit  of  our  model. 

The  range  of  dynamics  emerging  from  filtering  equations  highlights  the  need  for 
the  separate  modeling  of  the  system-probe  interation.  Though  we  have  only  presented 
a  very  simple  model,  we  have  outlined  a  bottom-up  approach  in  which  the  system- 
probe  interaction  is  modeled  from  first  principles  using  quantum  electrodynamics. 
The  detailed  modeling  of  realistic  experimental  configurations  will  be  invaluable  for 
quantitative  comparison  of  theoretical  predictions  and  experimental  data  [28] . 

5.  Feedback  control  and  quantum  state  preparation 

The  intrinsic  randomness  of  quantum  measurement  should  not  dissuade  the  capable 
observer  from  trying  to  control  the  dynamics  of  a  system.  In  fact,  it  should  do  just 
the  opposite.  The  inherent  uncertainty  in  observation  is  the  inspiration  for  the  use  of 
feedback  control,  and  promotes  it  to  the  status  of  fundamental. 

Although  the  physical  constraints  imposed  by  quantum  mechanics  are 
performance  limiting,  quantum  feedback  control  problems  are  well  defined  and  worth 
pursuing  for  all  of  the  same  reasons  engineers  use  control  on  classical  systems. 
Furthermore,  quantum  feedback  control,  while  technically  difficult,  is  simply  a  branch 
of  traditional  control  and  amenable  to  the  techniques  developed  therein  [4,18,71]. 
Far  from  introducing  an  entirely  new  kind  of  problem,  the  challenges  presented  here 
highlight  and  motivate  the  extension  of  mathematical  methods  already  in  development 
elsewhere. 

In  this  section  we  begin  by  discussing  the  types  of  problems  and  structure 
encountered  in  a  typical  quantum  feedback  control  scenario,  building  upon  the 
formalism  developed  above.  Here  we  use  language  from  classical  control  theory,  and 
discuss  the  possible  application  of  optimal  and  robust  control  theories  to  the  quantum 
setting.  We  also  emphasize  experimental  constraints  which  motivate  simplifications 
of  desired  controls  through  model  reduction.  Next  we  demonstrate  the  utility  of 
feedback  in  a  review  of  applications  to  atomic  ensemble  experiments.  We  finish  by 
focusing  on  the  particular  theoretical  example  of  deterministically  preparing  a  state 
with  continuous  measurement  and  control. 

5.1.  Defining  feedback  control 

The  term  “quantum  feedback  control”  as  used  in  this  article  refers  to  a  particular  class 
of  problems  that  should  be  distinguished  from  other  types  of  control  with  quantum 
systems.  The  class  we  consider  involves  the  measurement  of  a  quantum  system  by 
interaction  with  a  quantum  field.  The  field  is  destructively  measured  resulting  in  a 
classical  measurement  record.  That  measurement  record  is  then  processed  and  fed 
back  to  Hamiltonian  parameters  affecting  the  same  system. 
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The  rest  of  this  article  is  concerned  with  problems  of  this  kind.  However,  it 
should  be  noted  that  there  exist  further  types  of  control  with  this  arrangement  that 
we  will  not  discuss.  Aside  from  actuating  Hamiltonian  parameters  of  the  system  with 
feedback,  the  observer  may  possess  the  ability  to  adaptively  change  the  measurement 
itself  according  to  the  measurement  record.  This  leads  to  different  “unravellings”  of 
the  dynamics  [79].  By  the  nature  of  the  measurement,  the  ensemble  average  behavior 
of  the  system  will  be  the  same  for  any  chosen  unravelling  or  adaptive  measurement 
scheme.  Of  course,  the  same  will  not  be  true  for  the  average  trajectory  behavior  of 
the  system  under  different  Hamiltonian  control  laws. 

Additionally,  there  exist  completely  different  types  of  control  with  quantum 
systems  bearing  little  resemblance  to  the  measurement  techniques  discussed  here.  For 
instance,  one  can  imagine  doing  a  type  of  feedback  experiment  where,  instead  of 
destructively  measuring  the  ancilla  system,  it  is  returned  to  interact  with  the  system 
of  interest  again,  and  possibly  repeatedly.  For  the  case  of  the  usual  optical  ancilla 
system,  this  has  been  referred  to  as  “all-optical  feedback”  to  distinguish  it  from  the 
electrical  measurement  signal  alternatively  produced  [81].  In  certain  cases  this  kind 
of  “coherent  control”  [52]  can  achieve  state  preparation  goals  with  minimal  processing 
overhead  and  delay.  In  the  formalism  presented  here,  one  could  describe  such  a  process 
completely  at  the  quantum  stochastic  level  of  section  3. 

Finally,  the  term  “quantum  control”  is  also  used  in  the  literature  to  refer  to 
yet  another  scenario,  with  not  one  system,  but  an  ensemble  of  identically  prepared 
systems.  Here  a  system  is  driven  with  a  pulse,  then  the  result  is  measured. 
Subsequently,  another  system  is  prepared,  another  pulse  is  used  to  drive  it,  the  result 
is  again  measured,  and  so  on.  In  between  trials,  the  pulse  shape  is  changed  based 
on  the  previous  measurements  in  some  algorithmic  way  to  optimize  the  effect  of  the 
pulse  [61].  This  procedure  is  a  type  of  “learning  control”  and,  unlike  in  the  examples 
we  study,  no  feedback  occurs  during  the  lifetime  of  an  individual  system. 

5.2.  Separation  structure 

Generally  speaking,  the  control  problem  consists  of  finding  a  mapping  of  the 
measurement  record  onto  the  actuation  variables  such  that  some  pre-defined  task 
is  achieved.  When  stated  in  this  way  the  problem  is  very  difficult  to  solve;  after  all, 
when  we  allow  any  functional  from  the  photocurrent  history  to  the  control  variables, 
it  is  hard  to  know  where  to  start. 

Fortunately  we  can  simplify  the  problem  description  considerably  using  what  is 
sometimes  referred  to  as  the  separation  principle  or  the  information  state  approach , 
originally  introduced  in  classical  control  theory  by  Mortensen  [57].  The  basic  idea 
behind  this  approach  is  that  we  can  never  control  the  system  more  precisely  than 
the  precision  with  which  the  system  state  can  be  inferred  from  the  observations.  In 
many  ways  this  is  a  statement  of  the  obvious:  for  example  if  we  know  that  the  system 
is  controlled  to  within  some  bound,  then  clearly  we  can  infer  that  the  system  state 
is  within  that  bound.  As  a  consequence,  the  best  we  can  do  is  to  control  the  best 
estimate  of  the  system  state,  i.e.  the  conditional  state. 

The  advantage  of  this  approach  is  that  we  have  converted  the  output  feedback 
control  problem  into  a  state  feedback  control  problem  for  the  filter.  Operationally,  we 
then  consider  the  filtering  equation  (52)  as  our  new  “effective”  dynamical  equation 
to  be  controlled,  where  the  feedback  h(t)  can  now  be  taken  to  be  a  function  of  the 
conditional  state  pt  as  opposed  to  the  measurement  record.  This  is  a  less  constrained 
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Figure  3.  Schematic  of  the  entire  feedback  control  problem  for  an  atomic  ensemble.  On  the 
experimental  level  an  ensemble  interacts  with  a  probe  field,  as  described  by  Eq.  (44).  Homodyne 
detection  gives  rise  to  the  photocurrent  (45),  which  is  processed  by  a  digital  controller.  A  magnetic 
field  is  used  for  feedback.  On  the  controller  level,  the  photocurrent  drives  the  quantum  filter  (51) 
which  updates  recursively  the  best  estimate  of  the  atomic  state.  The  control  law  is  a  functional  of 
the  current  conditional  state.  The  innovations  structure  (55)  allows  the  control  design  to  be  based 
directly  on  the  filtering  dynamics. 


problem  than  the  output  feedback  problem  and  hence  often  easier  to  solve.  Control 
design  is  further  simplified  by  the  fact  that  the  innovation,  Eq.  (55),  is  white.  This 
means  we  can  consider  Eq.  (52)  as  an  ordinary  Ito  equation  to  be  controlled,  without 
separately  modeling  the  statistics  of  the  photocurrent  driving  noise. 

The  structure  of  the  entire  control  setup,  in  the  context  of  the  model  discussed 
in  the  previous  sections,  is  shown  in  Fig.  3.  The  atomic  ensemble  and  its  interaction 
with  the  optical  probe  field  and  the  magnetic  control  field  was  modeled  in  section  3. 
Homodyne  detection  was  the  subject  of  section  4.1.  The  photocurrent  is  processed 
by  a  digital  control  circuit  which  produces  the  feedback  signal.  Inside  the  controller, 
the  “whitened”  photocurrent  drives  the  quantum  filter,  as  described  in  section  4.2. 
The  control  law  is  a  function  of  the  best  estimate  of  the  system  state.  To  design  the 
control  law,  however,  we  only  need  to  consider  the  “internal”  feedback  loop  inside  the 
digital  circuit.  From  the  controller’s  effective  perspective,  the  only  role  of  the  physical 
experiment  is  to  provide  the  innovation  dWt,  which  is  white  by  construction. 

5.3.  Defining  an  objective 

We  have  separated  the  control  design  into  an  estimation  problem,  which  was  the 
subject  of  section  4.2,  and  a  control  problem.  The  control  problem  is  undefined, 
however,  until  we  state  a  goal  that  our  controller  should  achieve. 

As  an  example,  an  experimentalist  may  want  to  minimize  some  functional  of  the 
system  and  control  variables,  e.g., 

c[h(t)\  =  E  [  (jt(F2)  +  vh(t)2)  dt  (57) 

Jo 

where  g  is  a  parameter  that  limits  the  degree  that  the  control  input  is  applied.  To 
apply  the  separation  principle  to  this  case  we  must  first  convert  the  cost  function 
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C  into  a  form  which  is  only  a  function  of  the  filter  state.  This  is  straightforward, 
however,  due  to  the  property  E jt{f)  =  E7 r*(*)  of  conditional  expectations:  we  obtain 

C[h(t)\  =  E  f  (7 rt(F^)  +  fih(t )2)  dt  (58) 

Jo 

As  expected,  the  control  goal  depends  only  on  the  conditional  state,  i.e. ,  the  filter 
state  is  a  sufficient  statistic  for  this  control  problem. 

In  principle,  the  minimization  of  (58)  using  the  dynamics  of  the  filtering  equation 
would  produce  a  control  law  which  is  the  optimal  time-dependent  mapping  of  the 
conditional  state  onto  the  control  parameters.  This  type  of  problem  is  known  as 
“optimal  control”,  and  is  one  of  the  primary  modes  of  thought  in  classical  control 
theory  [8,20,38].  As  is  apparent  from  Eq.  (52),  the  general  form  of  the  filtering 
equation  is  non-linear  in  the  state  and,  as  an  unfortunate  result,  the  optimal  control 
solution  is  extremely  difficult  to  find.  Although  both  non-linear  and  stochastic  control 
theories  are  well  developed  fields  classically,  there  is  still  much  work  to  be  done  in  their 
intersection. 

Fortunately,  there  are  alternative  methods  for  gaining  ground  on  the  quantum 
feedback  control  problem.  First,  in  some  instances,  it  is  possible  to  linearize  the 
dynamics  of  the  filtering  equation  via  moment  expansions.  In  this  case,  one  can 
readily  adopt  “LQG”  techniques  from  classical  control  [8,38]  for  linear  systems  (L), 
a  cost  function  quadratic  in  linear  observables  and  control  variables  as  above  (Q), 
and  Gaussian  dynamics  (G),  to  solve  the  problem  completely  [4,18,19,65].  In  any 
given  example,  the  needed  linearization  may  only  work  for  particular  initial  states 
and  limited  periods  of  time,  but  the  LQG  results  can  still  be  remarkably  far  reaching. 

Second,  we  can  choose  to  be  less  demanding  of  our  controller,  and  instead 
formulate  a  non-optimal  goal.  For  instance,  suppose  we  are  interested  in  preparing 
the  quantum  state  pc  at  long  times.  The  control  goal  can  then  be  formulated  as  find 
a  control  law  h(t)  so  that  E \jt(X)\  —>  Tr [Xpc\  as  t  —>  00  for  any  system  observable  X. 
As  above,  it  is  easy  to  see  that  the  filter  state  is  a  sufficient  statistic  and  hence  we  can 
directly  apply  the  separation  principle.  In  particular,  if  we  can  find  a  controller  that 
makes  pc  a  global  (stochastically)  stable  state  for  the  filter  dynamics,  the  eventual 
preparation  of  pc  is  ensured.  Although  the  state  might  not  be  prepared  as  fast  as  is 
physically  possible,  it  is  an  accomplishment  to  know  that  it  will  eventually  be  prepared 
with  unit  probability.  Here  there  is  much  work  to  be  done  on  constructively  generating 
controllers  and  methods  for  proving  the  stability,  but  progress  has  been  made  for  some 
simple  problems  [71]. 

Robustness  and  model  reduction 

If  given  the  choice  between  a  controller  that  works  optimally  under  one  set  of  ideal 
circumstances  and  a  controller  that  works  sub-optimally,  but  adequately,  over  a  wide 
set  of  possible  conditions,  the  wise  experimentalist  would  always  choose  the  latter. 
Due  to  unexpected  modeling  uncertainties  and  exogenous  noise  sources,  the  optimal 
control  approach  has  the  potential  to  fail  catastrophically  in  realistic  environments,  a 
possibility  that  has  motivated  the  development  of  “robust  control”  for  many  years  [84] . 
One  could  say  the  reason  experiments  are  performed  at  all  is  to  test  the  robustness  of 
our  model  and  control  design. 

The  concept  of  robust  control  has  been  extensively  studied  in  the  classical 
deterministic  setting,  but  the  same  logic  holds  true  for  quantum  applications.  Even  as 
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quantum  technology  reaches  its  limits,  there  will  always  be  some  degree  of  non-intrinsic 
system  uncertainty  to  which  the  system  should  be  robust.  Not  surprisingly,  quantum 
feedback  techniques  have  been  shown  to  enable  robustness  to  model  uncertainty  in 
metrology  applications  [65] .  Of  course  one  need  not  draw  the  line  too  sharply  between 
optimal  and  robust  control,  as  there  exist  types  of  risk-sensitive  optimal  quantum 
control  that  inherently  consider  certain  kinds  of  robustness  [39,40]. 

Another  practical  reason  why  optimal  control  may  not  be  ultimately  relevant  is 
that  real-time  information  processing  takes  time.  Even  if  the  modeling  is  perfect  and 
there  are  no  excess  noise  sources,  an  optimal  controller  may  not  work  due  to  processing 
constraints.  If  the  processing  delay  of  the  actual  controller  is  large  compared  to  the 
relevant  time-scale  of  the  filtering  dynamics,  then  another  approach  will  be  needed. 
Despite  the  improving  performance  of  programmable  logic  devices  that  might  best 
implement  the  optimal  control,  there  are  few  experiments  with  slow  enough  time- 
scales  that  modern  electronics  can  be  optimally  effective  at  real-time  estimation  and 
control  [64]. 

Clearly  it  is  of  significant  interest  to  be  able  to  derive  a  controller  that  works 
without  having  to  evolve  the  full  filtering  equation  in  real  time.  Recognizing  this, 
physicists  have  proposed  and  used  controllers  for  quantum  feedback  applications  that 
use  a  simplified  control  law  which  bypasses  the  full  state  estimation.  Even  more  easily, 
one  can  sometimes  feed  the  measurement  record  directly  back  to  the  system  with  a 
gain  tailored  intelligently  in  time  [69,77,80].  However,  with  any  of  these  approaches, 
one  must  be  cognizant  of  realistic  gain  and  bandwidth  constraints.  For  example,  one 
cannot  realistically  feed  pure  white  noise  back  into  a  system,  as  this  would  imply 
infinite  sensor  and  detection  bandwidths.  Although  most  of  the  simplified  quantum 
controllers  suggested  in  the  literature  have  been  constructed  through  more  or  less 
heuristic  means,  we  expect  the  continuing  development  of  these  techniques  to  resort 
to  more  mathematical  notions  of  model  reduction,  where  the  degree  of  approximation 
and  its  effect  on  the  feedback  performance  can  be  more  explicitly  quantified. 

5.5.  Measurement  and  Feedback  in  Atomic  Ensembles 

When  considering  systems  with  the  potential  for  interesting  applications  related 
to  quantum  information  processing,  there  exists  a  natural  tendency  within  many 
physicists  to  consider  conceptually  simplified  systems,  e.g.,  a  single  atom  or  ion.  While 
much  progress  has  been  made  in  trapping,  measuring,  and  controlling  single  particles, 
it  has  also  been  realized  for  some  time  that  the  use  of  atomic  ensembles  does  not 
preclude  the  observation  of  uniquely  quantum  effects  nor  a  simple  description.  As 
compared  to  alternative  systems,  ensembles  are  experimentally  convenient  and,  by  the 
sheer  number  of  participants,  sufficient  signal  can  be  generated  to  make  them  powerful 
in  quantum  applications,  with  atomic  clocks  being  just  one  prominent  example. 

Here  we  consider  those  experiments  where  continuous  measurement  and  feedback 
have  been  used  to  generate  entanglement  either  within  or  between  atomic  ensembles. 
We  begin  by  discussing  the  use  of  dispersive  measurement  to  produce  a  spin-squeezed 
state  in  a  single  ensemble  in  the  short-time  limit,  and  how  feedback  can  be  used  to 
make  this  process  deterministic.  We  then  focus  on  a  particular  theoretical  limit  where 
the  linear  approximation  fails,  but  still  highly  entangled  eigenstates  of  the  measured 
Fz  can  be  prepared  by  using  the  more  complete  filtering  equation  and  an  intuitive 
feedback  law.  Finally,  we  briefly  discuss  experiments  and  proposals  involving  the 
creation  of  entanglement  between  two  ensembles  with  and  without  feedback. 
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5.5.1.  One  Ensemble  For  state  preparation  with  atomic  ensembles,  spin-squeezed 
states  (SSS)  [44]  are  natural  target  states.  These  collective  spin  states  are  internally 
entangled,  simply  characterized  by  measured  moments  of  the  spin-operators,  and 
useful  in  metrology  tasks  [29,65,76].  For  an  ensemble  with  N  spin-/  particles  and 
collective  angular  momentum  operators  i^,  a  state  is  defined  as  spin-squeezed,  and 
entangled,  if+ 


2/iV(AF2) 

(Fx)2 


(59) 


where  the  spin- state  is  pointing  along  x  so  that  (Fx)  =  F  =  Nf  and  (Fy)  =  (Fz)  =  0 
[63].  Methods  to  produce  these  states  typically  begin  with  an  unentangled  coherent 
spin  state  (CSS)  with  all  spins  exactly  polarized  along  the  x  direction  and  realizing 
the  equality  of  the  uncertainty  relation 

(AF2)(AF2)  >  (60) 

For  the  SSS,  the  equality  is  roughly  maintained  with  one  component  (A Ff)  squeezed 
smaller  than  the  CSS  value  and  the  other  (A Fy)  anti-squeezed. 

There  are  many  ways  one  can  imagine  producing  the  spin-correlations  within 
the  ensemble  needed  for  the  collective  state  to  be  squeezed.  Examples  include  using 
direct  Hamiltonian  interactions  [63]  and  also  transferring  correlations  from  an  auxiliary 
system,  e.g.  squeezed  states  of  light  [34,49,56].  We  shall  focus  on  the  production  of 
spin  squeezed  states  via  dispersive  measurement,  the  effects  of  which  were  originally 
discussed  and  demonstrated  in  references  [47,48].  Subsequently,  Thomsen,  et  al.  [69] 
proposed  a  feedback  procedure,  discussed  below,  that  used  a  measurement  based  field 
rotation  to  remove  the  randomness  of  the  measurement  while  retaining  the  desired 
squeezing  effect.  Others  have  proposed  using  feedback  to  an  optical  pumping  beam  to 
achieve  a  similar  result  [56] .  It  has  since  been  experimentally  demonstrated  that  using 
a  procedure  similar  to  [69]  feedback  can  enable  the  deterministic  production  of  spin- 
squeezed  states  in  cold  atomic  samples  [28,30].  Much  work  continues  in  this  direction, 
in  particular  towards  creating  squeezed  states  with  the  Cesium  clock  transition,  which 
would  considerably  improve  current  atomic  clock  performance  [59] . 

To  understand  the  conditional  preparation  of  spin-squeezed  states  by  dispersive 
measurement,  consider  the  apparatus  in  Figure  3.  As  shown  above,  the  filtering 
equation  is  given  by  Eq.  (52).  This  equation  is  only  applicable  at  long  times  t  1/M 
if  a  sufficiently  strong  cavity  is  used  to  suppress  the  spontaneous  emission  to  an 
insignificant  level.  Given  existing  experimental  technology  this  is  currently  unrealistic; 
nevertheless  we  consider  the  long  time  dynamics  for  purposes  of  demonstration. 

The  filtering  equation  was  derived  using  a  simplified  one-dimensional  model  of 
the  interaction.  Although  this  model  is  often  an  adequate  description  of  free-space 
experiments  where  a  distribution  of  atoms  interacts  with  a  spatially  extended  probe 
beam,  there  is  much  interest  in  making  the  model  more  accurate  by  extending  it  to 
three  dimensions.  A  complete  model  would  consider  the  scattering  process  where  all 
free-space  field  modes  interact  with  the  atomic  distribution.  Some  of  those  channels 
would  then  be  measured,  and  the  results  used  to  condition  the  atomic  state.  In  this 
picture,  the  conditional  entanglement  results  from  the  indistinguishability  of  the  atoms 
in  the  measurement  and  “spontaneous  emission”  is  a  term  used  to  describe  the  effect 

+  We  will  denote  by  (•)  the  expectation  of  an  observable  in  a  general  sense.  The  associated  state  can 
be  prepared  either  unconditionally  or  conditionally. 
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of  the  remaining  unobserved  channels.  A  considerable  amount  of  work  remains  to 
be  done  in  describing  collective  scattering  in  terms  of  measurement  theory,  but  much 
progress  has  been  made  [11,21,46,58]. 

Returning  to  the  one-dimensional  model,  we  can  extract  the  conditional  evolution 
equations  for  the  moments  of  any  operator  from  the  filtering  equation.  Under  the 
approximation  that  there  are  many  atoms  and  the  initial  collective  state  is  nearly 
polarized  along  the  x-direction,  we  can  derive  the  closed  set  of  equations 

d7rt(Fz)  «  Fexp[— Mt/2]  h(t)  dt  +  2^/Mr]7Tt(AF^)  dWt  (61) 

d7rt(AF*)  «  -  4M7?7rt(AF|)2  dt  (62) 

These  equations  are  obtained  by  truncating  the  exact  coupled  expressions  for  i Tt(F™)7 
calculated  from  Eq.  (51),  at  n  =  2  [29,65].  This  reduced  description  is  equivalent 
to  a  classical  Kalman  filter  [8,38]  and  corresponds  to  a  local  linearization  of  the  spin 
dynamics. 

Eqs.  (61)  and  (62)  are  valid  only  in  the  short  time  limit  t  1/M,  past 
which  the  full  filtering  equation  is  needed.  At  longer  times  terms  neglected  in  the 
approximation  grow  to  the  point  that  the  variance  becomes  stochastic  [65],  and  the 
moment  truncation  is  no  longer  a  good  description.  This  process  can  be  seen  in  Fig. 
4a,  where  at  small  times  the  variance  is  deterministic,  but  then  becomes  random  at 
longer  times. 

The  deterministically  shrinking  variance  of  Eq.  (62)  at  short  times  signifies  that 
a  spin  squeezed  state  is  prepared  with  a  random  offset  given  by  Eq.  (61).  The  idea 
of  [69]  was  to  choose  h(t)  oc  Yt  with  an  intelligently  chosen  gain  such  that  the  first 
term  effectively  cancels  the  second  term  in  Eq.  (61),  preparing  the  same  SSS  on  every 
trial.  Although  this  exact  procedure  cannot  be  implemented  in  practice  due  to  the 
infinite  detector  and  actuator  bandwidths  implied  by  the  control  law,  it  was  essentially 
a  similar,  but  filtered,  current  feedback  law  used  in  the  experiment  [30].  Because  of  the 
linearity  of  the  dynamics  in  the  short-time  limit,  the  simple  current-based  feedback 
law  does  not  perform  significantly  worse  than  a  law  that  changes  h(t)  more  optimally 
according  to  the  state  i Tt(Fz)  [65]. 

Given  these  dynamics,  another  control  strategy  would  be  to  separate  the 
measurement  and  control  in  time:  simply  measuring  for  a  finite  amount  of  time, 
turning  off  the  probe,  and  using  the  measurement  result  to  rotate  the  spin-squeezed 
state  to  the  desired  location.  However,  as  pointed  out  in  [69],  the  continuous  feedback 
approach  is  more  robust  than  this  procedure  to  e.g.  uncertainty  in  the  total  atom 
number  which  is  necessary  to  compute  the  size  of  the  correcting  rotation. 

To  further  demonstrate  the  utility  of  continuous  measurement  and  feedback, 
we  now  consider  the  long  time  behavior  of  the  filtering  equation,  past  the  point  in 
time  t  >  1/M  when  the  linearized  description  fails.  As  discussed  in  section  4.3,  the 
filtering  equation  stochastically  prepares  a  random  eigenstate  of  Fz  asymptotically  in 
time*.  In  [66]  we  investigated  numerically  the  performance  of  particular  controllers 
at  producing  one  Fz  eigenstate  deterministically  on  every  trial. 

Here  it  is  critical  to  point  out  that,  unlike  with  the  Gaussian  spin-squeezed  states, 
a  post-measurement  rotation  strategy  will  not  work  in  this  regime.  If  the  wrong 
eigenstate  is  randomly  prepared  in  one  measurement,  it  cannot  be  transformed  into 
the  correct  eigenstate  by  a  rotation  alone.  Furthermore,  despite  the  adequacy  of  the 
direct  current  feedback  law  at  short  times,  such  a  controller  is  less  useful  at  longer 

*  There  are  other  schemes  that  produce  superpositions  of  Fz  eigenstates  conditionally  but  without 
control,  based  on  single  photon  detection  of  an  ensemble  in  a  cavity  [22]. 
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Figure  4.  (a)  (A Ff){t)  =  nt(AFf)  in  open  loop  h{t)  =  0.  (A Ff)s  is  the  approximate  variance  of 
Eq.  (62).  (b)  (Ff)(t)  =  irt(Ff)  and  (c)  ( Fz){t )  =  irt(Fz)  with  the  control  law  (63)  and  A  =  10.  Note 
that  E7rt(Ff),  the  cost  for  preparation  of  the  mz  =  0  eigenstate,  decreases  monotonically.  All  plots 
show  100  sample  paths  and  M  =  rj  =  1. 


times.  As  opposed  to  state-based  control,  this  type  of  control  will  feed  noise  into  the 
system  even  if  the  target  state  is  reached,  unless  the  gain  is  turned  to  zero.  Although 
certain  gain-tailored  schemes  can  be  made  to  optimize  the  feedback  at  small  times  [69] , 
it  is  not  at  all  obvious  how  such  a  procedure  could  be  generalized  to  the  long  time 
case. 

In  contrast,  if  the  control  variable  h(t)  is  made  a  function  of  the  conditional  state, 
then  it  will  naturally  know  when  the  goal  has  been  achieved  and  no  longer  disturb  the 
state  unnecessarily.  Numerically  we  were  able  to  demonstrate  [65]  that  with  an  initial 
x-polarized  state,  the  control  law 

h(t)  = -\7rt(Fz)  (63) 

appeared  to  deterministically  prepare  the  highly  entangled  state  mz  =  0  on  every 
trial,  as  seen  in  figure  4.  Thus,  continuous  feedback,  in  addition  to  being  robust,  is 
also  capable  of  preparing  states  on  every  trial  that  would  be  impossible  to  generate 
deterministically  with  measurement  and  control  pulses  separated  in  time. 

Numerical  evidence  is  encouraging,  but  more  analytic  statements  about  the 
performance  of  particular  control  laws  are  still  desirable.  Unfortunately,  the  more 
atoms  the  ensemble  contains,  the  larger  the  Hilbert  space  becomes,  and  the  more 
difficult  it  is  to  analytically  prove  that  certain  states  are  global  attractors  under 
particular  feedback  laws.  However,  as  we  have  shown  in  [71],  there  exist  methods 
adapted  from  non-linear  and  stochastic  control  theory  that  can  prove  the  global 
stability  of  Fz  eigenstates  for  this  problem.  Although  this  has  only  been  demonstrated 
for  few  atom  systems,  there  is  hope  that  the  techniques  can  be  extended  to  consider 
dynamics  on  larger  Hilbert  spaces.  Much  of  the  control  design  process  remains 
guesswork,  but  ultimately  we  desire  methodology  that  allows  us  to  systematically 
construct  both  controllers  and  proofs  that  validate  those  controllers. 

5.5.2.  Two  Ensembles  The  creation  of  a  collective  entanglement  within  a  single 
atomic  ensemble  can  be  motivated  with,  for  example,  the  need  for  noise  reduction 
in  metrology  tasks,  where  the  system  is  used  as  a  relatively  localized  probe  of  some 
parameter  of  interest.  In  other  practical  applications,  like  quantum  communication, 
it  is  desirable  to  have  an  entangled  quantum  state,  but  with  constituents  separated 
substantially  in  space  [23].  Indeed  it  has  been  experimentally  demonstrated  that  by 
detecting  a  single  probe  beam  after  it  passes  through  two  spatially  separate  atomic 
ensembles,  the  two  ensembles  can  be  made  conditionally  entangled  [41]. 
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Just  as  single- mode  spin  squeezing  can  be  quantified  with  the  collective  variables 
for  the  one  ensemble,  here  the  “two-mode”  squeezing  can  be  quantified  with  the  joint 
collective  operators  describing  both  ensembles.  Furthermore,  in  analogy  to  the  work 
of  [69],  the  random  offset  observed  in  the  measurement  process  can  in  principle  be 
eliminated  with  a  suitable  feedback  law  to  deterministically  produce  the  same  two¬ 
mode  spin  squeezed  state  on  every  trial  [9] . 

In  a  related  context,  it  has  recently  been  experimentally  demonstrated  that  the 
two-ensemble  system  may  serve  as  an  effective  quantum  memory  for  states  of  light  [42] . 
This  procedure  differs  from  the  deterministic  state  preparation  discussed  previously 
in  that  the  state  of  light  to  be  mapped  onto  the  ensembles  is  not  known  beforehand. 
However,  the  procedure  described  in  [42]  is  similar  in  that  it  does  use  feedback  to 
rotate  the  Gaussian  ensemble  state  in  a  way  that  maps  one  measured  quadrature  of 
the  optical  state  onto  the  atoms,  while  the  other  unmeasured  quadrature  is  mapped 
unconditionally  by  the  interaction  alone.  Clearly,  this  process  shares  many  of  the 
same  properties  as  the  applications  discussed  previously  and  can  similarly  benefit  from 
analyses  with  technical  notions  of  robustness  and  optimality.  Finally,  this  procedure 
becomes  even  more  efficient  if  the  input  atomic  state  is  a  two-mode  squeezed  state, 
which  highlights  yet  another  practical  application  of  deterministic  entangled  quantum 
state  preparation. 

6.  Conclusion 

In  this  article  we  have  attempted  to  give  a  unified  picture  of  a  quantum  feedback 
control  setup.  Starting  from  elementary  physical  interactions,  as  described  by  a  field- 
theoretic  model,  we  first  performed  statistical  inference  on  this  model,  and  then  used 
this  framework  to  develop  feedback  control  strategies  for  state  preparation  in  atomic 
ensembles.  The  latter  is  directly  related  to  recent  experimental  work  which  we  briefly 
summarized.  It  is  our  hope  that  such  a  unified  picture  will  help  linking  the  basic 
physics  and  experimental  reality  to  a  high-level,  control-theoretic  point  of  view. 

Many  open  problems  remain  on  both  ends  of  the  spectrum.  On  the  physics  side 
much  work  remains  to  be  done  on  the  realistic  modeling  of  laboratory  experiments. 
Ultimately  a  full  three-dimensional  field-theoretic  model  will  be  invaluable  for 
quantitative  comparison  of  theory  and  experiments.  On  the  control-theoretic  side 
many  of  the  techniques  that  have  been  used  are  still  heuristic  in  nature.  Systematic, 
constructive  design  methods  for  nonlinear  stochastic  controllers,  the  incorporation  of 
realistic  robustness  criteria,  and  efficient  model  reduction  techniques  with  controllable 
approximation  errors  are  some  of  the  major  outstanding  issues.  We  believe  that  a 
fruitful  interaction  between  the  physics  and  mathematical  control  theory  communities 
will  open  the  road  to  significant  advances  in  these  directions. 
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We  show  that  superselection  rules  do  not  enhance  the  information-theoretic  security  of  quantum  crypto¬ 
graphic  protocols.  Our  analysis  employs  two  quite  different  methods.  The  first  method  uses  the  concept  of  a 
reference  system — in  a  world  subject  to  a  superselection  rule,  unrestricted  operations  can  be  simulated  by 
parties  who  share  access  to  a  reference  system  with  suitable  properties.  By  this  method,  we  prove  that  if  an 
rc-party  protocol  is  secure  in  a  world  subject  to  a  superselection  rule,  then  the  security  is  maintained  even  if  the 
superselection  rule  is  relaxed.  However,  the  proof  applies  only  to  a  limited  class  of  superselection  rules,  those 
in  which  the  superselection  sectors  are  labeled  by  unitary  irreducible  representations  of  a  compact  symmetry 
group.  The  second  method  uses  the  concept  of  th e  format  of  a  message  sent  between  parties — by  verifying  the 
format,  the  recipient  of  a  message  can  check  whether  the  message  could  have  been  sent  by  a  party  who 
performed  charge-conserving  operations.  By  this  method,  we  prove  that  protocols  subject  to  general  superse¬ 
lection  rules  (including  those  pertaining  to  non-Abelian  anyons  in  two  dimensions)  are  no  more  secure  than 
protocols  in  the  unrestricted  world.  However,  the  proof  applies  only  to  two-party  protocols.  Our  results  show 
in  particular  that,  if  no  assumptions  are  made  about  the  computational  power  of  the  cheater,  then  secure 
quantum  bit  commitment  and  strong  quantum  coin  flipping  with  arbitrarily  small  bias  are  impossible  in  a  world 
subject  to  superselection  rules. 
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I.  INTRODUCTION 

The  central  aim  of  modern  cryptography  is  to  formulate 
protocols  that  achieve  cryptographic  tasks  with  computa¬ 
tional  security ,  meaning  that  a  dishonest  party  would  need  to 
perform  a  prohibitively  difficult  computation  to  break  the 
protocol.  A  major  goal  of  quantum  cryptography  is  to  formu¬ 
late  protocols,  involving  the  exchange  of  quantum  states,  that 
achieve  information-theoretic  security ,  meaning  that  even  an 
adversary  with  unlimited  computational  power  would  be  un¬ 
able  to  defeat  the  protocol  [1].  Information-theoretic  security 
(sometimes  called  “unconditional  security”)  has  been  estab¬ 
lished  for  quantum  key  distribution  protocols  [2-7]  but  it  has 
also  been  shown  that,  even  in  the  quantum  world, 
information-theoretic  security  is  not  attainable  for  certain 
tasks.  For  example,  unconditionally  secure  quantum  bit  com¬ 
mitment  is  impossible  [8,9],  as  is  (strong)  quantum  coin  flip¬ 
ping  with  arbitrarily  small  bias  [10,11]. 

Superselection  rules  are  limitations  on  the  physically  re¬ 
alizable  quantum  operations  that  can  be  carried  out  by  a  local 
agent.  For  example,  it  is  impossible  to  create  or  destroy  an 
isolated  particle  that  carries  locally  conserved  charges,  such 
as  an  electrically  charged  particle,  a  fermion,  or  (in  a  two- 
dimensional  medium)  an  any  on.  Recently,  Popescu  [12]  has 
suggested  that  superselection  rules  might  have  interesting 
implications  for  the  security  of  quantum  cryptographic  pro¬ 
tocols.  The  intuitive  idea  behind  this  suggestion  is  that  su¬ 
perselection  rules  could  place  inviolable  limits  on  the  cheat¬ 
ing  strategies  available  to  the  dishonest  parties,  thus 
enhancing  security.  Might,  say,  unconditionally  secure  bit 
commitment  be  possible  in  worlds  (perhaps  including  the 
physical  world  that  we  inhabit)  governed  by  suitable  super¬ 
selection  rules?  An  affirmative  answer  could  shake  the  foun¬ 
dations  of  cryptography. 


The  purpose  of  this  paper  is  to  answer  Popescu’s  intrigu¬ 
ing  question.  Sadly,  our  conclusion  is  that  superselection 
rules  can  never  foil  a  cheater  who  has  unlimited  quantum- 
computational  power. 

In  the  case  of  quantum  bit  commitment,  and  other  two- 
party  protocols,  our  argument  hinges  on  a  quite  simple  ob¬ 
servation.  In  a  two-party  protocol,  one  participant  (Alice)  has 
control  of  a  local  system  A,  and  the  other  participant  (Bob) 
has  control  of  another  local  system  B.  In  addition,  there  is  a 
message  system  M  that  they  pass  back  and  forth.  In  each  step 
of  the  protocol,  one  party  performs  a  joint  quantum  operation 
on  her/his  local  system  and  the  message  system,  and  then 
sends  the  message  system  to  the  other  party.  Suppose  that  in 
each  step,  any  part  of  the  full  system  ABM  that  is  beyond 
Alice’s  control  is  under  Bob’s  control  and  vice  versa — no 
part  of  the  full  system  is  inaccessible  or  in  the  possession  of 
a  third  party.  Suppose  further  that  the  full  system  ABM  has 
trivial  total  charge  (belongs  to  the  trivial  superselection  sec¬ 
tor).  Then  at  any  stage  of  the  protocol,  the  algebra  of  opera¬ 
tions  that  Alice  can  perform  is  the  commutant  of  the  algebra 
of  operations  that  Bob  can  perform;  that  is,  Alice’s  algebra 
contains  all  operations  that  commute  with  Bob’s  algebra. 
Likewise,  Bob’s  algebra  is  the  commutant  of  Alice’s.  By  a 
minor  extension  of  the  standard  argument,  it  then  follows 
that  unconditionally  secure  quantum  bit  commitment  is  im¬ 
possible  if  the  total  charge  shared  by  the  parties  is  trivial. 

Now,  if  the  total  charge  in  nontrivial ,  then  Alice’s  algebra 
is  surely  a  subalgebra  of  the  commutant  of  Bob’s,  but  it  may 
be  a  proper  subalgebra;  similarly,  Bob’s  algebra  may  be  a 
proper  subalgebra  of  Alice’s.  This  unusual  property  of  the 
local  operations  seems  to  open  new  possibilities  for  the  de¬ 
sign  of  quantum  protocols.  Regrettably,  though,  there  is  no 
way  for  an  honest  party  to  ensure  that  the  total  charge  is 
really  nontrivial  when  the  other  party  is  dishonest.  Though 
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the  honest  protocol  may  call  for  the  parties  to  start  out  with 
nontrivial  charges,  we  may  always  imagine  that  there  are 
actually  compensating  charges  beyond  the  grasp  of  Alice  and 
Bob,  so  that  the  total  charge  of  the  world  is  really  trivial. 
Furthermore,  a  cheater  might  seize  control  of  the  compensat¬ 
ing  charge,  while  for  an  honest  party  it  makes  no  difference 
whether  the  compensating  charge  is  present  or  not.  It  follows 
that  a  protocol  that  calls  for  the  total  charge  to  be  nontrivial 
can  be  no  more  secure  than  one  in  which  the  total  charge  is 
actually  trivial;  we  conclude  again  that  unconditionally  se¬ 
cure  quantum  bit  commitment  is  impossible,  irrespective  of 
the  value  of  the  total  charge  shared  by  the  parties  in  the 
honest  protocol. 

Aside  from  quantum  bit  commitment,  we  will  also  study 
the  impact  of  superselection  rules  on  the  information- 
theoretic  security  of  a  broad  class  of  other  quantum  proto¬ 
cols,  using  two  different  methods.  We  analyze  in  detail  the 
important  special  case  in  which  the  superselection  sectors 
can  be  identified  with  the  unitary  irreducible  representations 
of  a  compact  symmetry  group.  In  that  case,  we  argue  that  it 
is  possible  in  principle  to  prepare  a  reference  state  that  es¬ 
tablishes  a  preferred  orientation  in  the  symmetry  group.  A 
party  with  access  to  the  reference  state  can  use  it  to  perform 
operations  that  are  ostensibly  forbidden  by  the  superselection 
rule.  In  particular,  consider  an  n-party  quantum  protocol 
where  up  to  k<n  of  the  parties  are  dishonest,  and  suppose 
that  in  a  world  with  no  superselection  rules  the  dishonest 
parties  have  a  cheating  strategy  that  breaks  the  protocol. 
Then,  even  in  a  world  with  superselection  rules,  the  dishon¬ 
est  parties,  by  sharing  a  suitable  reference  state,  can  simulate 
this  cheating  strategy  faithfully.  We  conclude  that  if  a  quan¬ 
tum  protocol  is  information-theoretically  secure  in  a  world 
with  a  superselection  rule,  the  security  will  be  maintained 
even  if  the  superselection  rule  is  relaxed,  at  least  in  the  case 
where  the  superselection  rule  arises  from  a  compact  symme¬ 
try  group. 

Superselection  rules  arising  from  compact  symmetry 
groups  are  not  the  most  general  possible  ones.  In  particular, 
an  especially  rich  variety  of  superselection  rules  are  poten¬ 
tially  realizable  in  two-dimensional  systems  such  as  those 
that  admit  non- Abelian  anyons.  However,  even  superselec- 
tion  rules  of  this  more  general  kind  cannot  foil  a  cheater.  We 
find  that  for  any  two-party  protocol  that  is  secure  in  a  world 
subject  to  a  superselection  rule,  the  security  is  maintained 
when  the  superselection  rule  is  relaxed. 

Our  analysis  of  these  more  general  superselection  rules 
does  not  rely  on  the  concept  of  a  reference  system;  rather  it  is 
founded  on  a  completely  different  idea,  the  concept  of  the 
format  of  a  message.  A  superselection  rule  can  always  be 
characterized  by  saying  that  there  are  charges  that  must  be 
conserved  by  all  local  operations,  and  when  we  relax  the 
superselection  rule,  in  effect  we  are  permitting  a  cheater  to 
violate  these  conservation  laws.  For  the  purpose  of  assessing 
the  security  of  a  two-party  protocol,  we  are  interested  in  how 
the  actions  of  the  cheating  party  (Alice)  affect  the  outcomes 
of  measurements  performed  by  the  honest  party  (Bob).  Po¬ 
tentially,  if  Alice  is  granted  the  power  to  violate  conservation 
of  “charge,”  her  ability  to  influence  Bob’s  measurements  will 
be  strengthened. 

However,  if  the  total  charge  shared  by  Alice  and  Bob  is 
trivial  (as  we  are  entitled  to  assume  in  an  analysis  of  secu¬ 


rity),  then  if  charge  is  conserved,  Alice  and  Bob  hold  conju¬ 
gate  charges  at  each  stage  of  the  protocol.  Therefore,  Bob 
always  knows  what  charge  Alice  is  supposed  to  have,  which 
constrains  the  type  of  message  that  Alice  can  send  to  Bob  if 
she  is  honest.  When  Bob  receives  a  message  he  can  verify  its 
format,  checking  whether  the  message  could  have  been  sent 
by  a  party  who  performed  a  charge-conserving  operation, 
and  he  can  abort  the  protocol  if  the  verification  fails.  There¬ 
fore,  if  the  protocol  ends  normally,  Alice  has  been  forced  to 
respect  charge  conservation — her  power  to  flout  the  superse¬ 
lection  rule  does  not  enhance  her  ability  to  fool  Bob.  This 
reasoning  shows  that  superselection  rules  cannot  thwart 
cheating,  but  because  the  argument  relies  on  the  property  that 
Alice  and  Bob  hold  perfectly  correlated  charges,  it  works 
only  for  two-party  protocols. 

For  cryptographic  protocols  with  more  than  two  parties, 
and  for  general  superselection  rules,  new  subtleties  arise.  In 
two  spatial  dimensions,  general  charges  are  not  merely  lo¬ 
cally  conserved,  they  may  also  have  nontrivial  braiding 
properties — the  exchange  of  two  charges  may  induce  a  non¬ 
trivial  transformation  on  their  joint  Hilbert  space.  This  means 
that  the  effect  of  sending  a  message  from  one  party  to  an¬ 
other  can  depend  on  the  path  along  which  the  message  trav¬ 
els.  It  is  an  interesting  problem  to  specify  appropriate  defi¬ 
nitions  of  security  for  protocols  in  this  setting,  but  we  will 
not  attempt  to  address  this  issue  here.  For  the  special  case  of 
charges  labeled  by  unitary  representations  of  compact 
groups,  the  braiding  properties  are  trivial;  therefore  in  that 
case  we  can  analyze  multiparty  protocols  without  confront¬ 
ing  such  questions. 

Verstraete  and  Cirac  [13]  recently  discussed  a  data-hiding 
protocol  whose  security  is  premised  on  a  superselection  rule. 
However,  as  the  authors  recognized,  the  protocol  is  not  un¬ 
conditionally  secure;  it  can  be  broken  if  the  parties  establish 
a  suitable  shared  reference  state  via  quantum  communica¬ 
tion.  The  notion  that  the  naive  implications  of  a  superselec- 
tion  rule  can  be  evaded  through  the  use  of  a  suitable  refer¬ 
ence  system  was  emphasized  long  ago  by  Aharonov  and 
Susskind  [14];  see  [15]  for  a  recent  discussion.  A  special  case 
of  our  main  result  was  reported  earlier  in  [16]. 

The  rest  of  this  paper  is  organized  as  follows:  We  develop 
the  concept  of  a  reference  system  in  Sec.  II,  first  for  Abelian, 
then  for  non- Abelian  symmetries,  and  we  explain  how  a  ref¬ 
erence  system  can  be  used  to  simulate  unrestricted  operations 
in  a  world  subject  to  superselection  rules  arising  from  a  sym¬ 
metry  group;  this  observation  is  applied  in  Sec.  Ill  to  the 
analysis  of  the  security  of  quantum  protocols.  In  Sec.  IV  we 
explore  the  distinction  between  an  itinerant  reference  system 
that  is  passed  from  party  to  party  as  needed  during  a  proto¬ 
col,  and  a  distributed  reference  system  that  can  be  prepared 
and  passed  out  to  the  parties  before  the  protocol  begins.  Su¬ 
perselection  rules  arising  from  non- Abelian  symmetries  are 
further  characterized  in  Sec.  V,  and  we  comment  in  Sec.  VI 
on  the  data-hiding  protocol  of  Verstraete  and  Cirac.  Our 
analysis  of  the  impact  of  superselection  rules  on  the  security 
of  quantum  bit  commitment  is  in  Sec.  VII;  we  also  show 
there  that  for  the  analysis  of  security  of  an  n-party  protocol, 
it  suffices  to  consider  the  case  in  which  the  total  charge  held 
by  the  parties  is  trivial.  Two-party  protocols  subject  to 
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general  superselection  rules  are  investigated  in  Sec.  VIII,  and 
Sec.  IX  contains  some  concluding  comments. 

II.  SUPERSELECTION  RULES  AND  REFERENCE 
SYSTEMS 

A  superselection  rule  is  a  decomposition  of  Hilbert  space 
into  sectors  that  are  preserved  by  local  operations.  The  dif¬ 
ferent  sectors  can  be  distinguished  by  attaching  to  each  sec¬ 
tor  a  label,  which  we  refer  to  as  the  sector’s  “charge.”  There¬ 
fore,  an  equivalent  way  to  characterize  a  superselection  rule 
is  to  say  that  the  charge  is  locally  conserved.  In  the  context 
of  a  cryptographic  protocol,  this  means  that  when  one  of  the 
parties  (Alice,  say)  performs  an  operation,  the  charge  in  Al¬ 
ice’s  laboratory  is  preserved. 

An  important  special  case  arises  if  the  Hilbert  space  Tt 
transforms  as  a  unitary  representation  of  a  compact  group  G, 
and  the  sectors  are  labeled  by  the  irreducible  representations 
of  G.  An  equivalent  way  to  describe  the  superselection  rule 
in  that  case  is  to  say  that  the  allowed  operations  must  com¬ 
mute  with  the  action  of  G  on  Tt.  In  fact,  it  has  been  shown  by 
Doplicher  and  Roberts  [17]  that  such  superselection  rules  are 
almost  the  most  general  ones  allowed  under  rather  weak  con¬ 
ditions  that  apply  in  particular  to  quantum  field  theories 
(without  gravity)  in  three  or  more  spatial  dimensions.  We  say 
“almost”  because  there  is  an  additional  freedom  to  assign  to 
a  localized  state  an  even  or  odd  fermion  number.  This  fer¬ 
mion  number  is  more  than  just  a  conserved  charge,  because 
of  the  property  that  the  wave  function  changes  sign  when 
two  fermions  are  exchanged. 

In  two  spatial  dimensions,  there  is  a  richer  classification 
of  superselection  rules,  reflecting  the  exotic  quantum  num¬ 
bers  carried  by  pointlike  non- Abelian  anyons  that  occur  in 
topological  quantum  field  theories  [18-20].  We  will  post¬ 
pone  further  discussion  of  non-Abelian  anyons  until  Sec. 
VIII,  concentrating  for  now  on  the  superselection  rules  asso¬ 
ciated  with  compact  symmetry  groups  (and  ignoring  fermi¬ 
ons). 

An  important  example  is  the  group  U(l)  associated  with 
conservation  of  the  electric  charge  Q.  An  agent  acting  locally 
can  create  or  annihilate  pairs  of  particles  that  carry  equal  and 
opposite  charges,  but  cannot  change  the  total  charge  in  her 
vicinity.  In  particular,  this  agent  is  unable  to  transform  any 
eigenstate  of  Q  into  a  coherent  superposition  of  states  with 
different  charges,  as  emphasized  by  Wick,  Wightman,  and 
Wigner  [21,22]. 

While  we  might  readily  accept  that  local  creation  of  elec¬ 
tric  charge  is  physically  impossible,  other  conservation  laws 
impose  superselection  rules  that  do  more  violence  to  our 
intuition.  Suppose,  for  example  (in  nonrelativistic  quantum 
mechanics),  that  our  agent’s  actions  are  required  to  conserve 
the  angular  momentum  J  locally.  Are  we  to  conclude  that  if 
the  agent  is  presented  with  a  spin-|  object  polarized  spin-up 
along  the  z  axis,  it  is  impossible  for  him  to  transform  it  to  a 
coherent  superposition  of  the  spin-up  and  spin-down  states? 
How  are  we  to  describe  what  happens  when  a  magnetic  field 
is  turned  on  pointing  in  the  v  direction  and  the  spin  begins  to 
precess?  A  partial  resolution  of  this  puzzle  is  attained  by 
noting  that  the  angular  momentum  of  a  classical  magnet  has 
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an  uncertainty  large  compared  to  h,  so  that  conservation  of 
angular  momentum  need  not  prevent  the  magnet  from  coher¬ 
ently  exchanging  Jz=h  with  the  spin.  But  this  explanation 
does  not  fully  address  how  the  existence  of  the  classical 
magnet  is  itself  compatible  with  the  superselection  rule. 

Such  issues  were  cogently  discussed  many  years  ago  by 
Aharonov  and  Susskind  [14].  They  emphasized  that  even  if 
the  total  angular  momentum  has  a  definite  value  (like  zero), 
we  can  still  speak  sensibly  of  the  relative  orientation  of  two 
subsystems.  Whenever  an  experimentalist  observes  the  pre¬ 
cession  of  a  spin,  it  is  implicit  that  a  reference  state  has  been 
established  that  in  effect  breaks  the  rotational  symmetry,  and 
that  the  precession  is  measured  relative  to  this  reference  stan¬ 
dard.  Furthermore,  Aharonov  and  Susskind  [14]  emphasized 
that  just  as  conservation  of  angular  momentum  need  not  pre¬ 
vent  us  from  measuring  the  relative  angular  orientation  of 
two  objects,  so  the  charge  superselection  rule  need  not  pre¬ 
vent  us  from  measuring  relative  phases  in  superpositions  of 
states  of  different  charge. 


A.  Abelian  case 

Before  we  discuss  the  more  general  case  in  which  the 
symmetry  may  be  non-Abelian,  it  will  be  useful  to  consider 
the  symmetry  group  G=U(1).  Then  the  charge  operator  Q 
(the  generator  of  G)  has  eigenvalues  q  e  Z,  and  we  denote 
the  corresponding  orthonormal  eigenstates  by  |  q).  Formal 
states  of  definite  phase  (with  continuum  normalization)  can 
be  constructed  as 


00 

|0  =  -F=E  (0^e<2i t),  (1) 

V 2  77  q=— oo 

where 


(O' \d) 


e-iq(e-e')  =  S(6'_  0) 


and 


1  f2 


d6eiqe\6). 


(2) 


(3) 


The  phase  state  \d)  is  the  improper  eigenstate  with  eigen¬ 
value  el$  of  the  unitary  operator 


U+=  X  \q+  l><g|  (4) 

q=- oo 

that  increments  the  value  of  the  charge  by  one  unit.  While 
the  phase  6  is  physically  unobservable  due  to  the  charge 
superselection  rule,  the  relative  phase  of  O'  -6  of  the  two 
states  | O')  and  1 0)  commutes  with  the  charge  operator  Q  and 
so  is  measurable  in  principle.  Indeed,  the  state 


00 

dfr\ff  +  er>®\e+er)=  2 

q=- oo 


e~iq{e~e,)\-q)®\q)  (5) 


has  a  definite  value  of  the  relative  phase  O'-O  and  total 
charge  zero.  That  is,  it  is  an  (unnormalizable)  eigenstate  with 
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eigenvalue  el^e~e  ^  of  the  charge-conserving  operator  U _ 
O  U+,  where  U_-  U\. 

Similarly,  the  phases  (j)q  appearing  in  the  expansion  of  the 
state  |  if/)A  of  a  system  A, 

I*/')a  =  S  <Pqe~iqHq)A  (6) 

q 

(where  the  i//qs  are  real  and  positive),  are  themselves  unob¬ 
servable,  but  they  can  be  meaningfully  compared  to  the 
phases  appearing  in  the  state  1 0)R  of  a  charge  reservoir  R.  For 
example,  by  projecting  \0)R®\ip)A  onto  the  sector  with  total 
charge  zero,  we  obtain  the  state 

I<A>*a  =  ^=J  de'\o+e')R®e-iQ9'\ii,)A 

(7) 

q 

which  has  measurable  relative  phases.  A  state  like  1 0)R  of  a 
charge  reservoir  R  that  provides  a  phase  standard  with  which 
other  states  can  be  compared  will  be  called  a  “reference 
state”  or  a  “condensate.” 

In  the  state  \if/)RA ,  the  charge  of  the  system  A  is  compen¬ 
sated  (“screened”)  by  the  charge  of  the  reservoir  R.  There¬ 
fore,  the  system  and  reservoir  are  entangled,  and  tracing  out 
the  reservoir  destroys  the  coherence  of  the  superposition  of 
charge  states  for  the  system.  While  formally  correct,  this 
statement  can  be  misleading  if  the  reservoir  remains  acces¬ 
sible  and  is  allowed  to  interact  with  the  system  during  sub¬ 
sequent  operations.  For  example,  the  operator  (U+)A  that  in¬ 
creases  the  charge  of  the  system  by  one  unit  is  disallowed  by 
the  superselection  rule,  but  it  can  be  accurately  simulated  by 
the  allowed  charge-conserving  operator  (U_)R®(U+)A  acting 
on  |  ifj)RA — this  operator  increases  the  charge  of  A  by  borrow¬ 
ing  a  unit  of  charge  from  R.  If  the  reservoir  remains  acces¬ 
sible  at  all  times,  then  an  arbitrary  (not  necessarily  charge- 
conserving)  operation  acting  on  A  can  be  perfectly  simulated 
by  a  charge-conserving  operation  acting  on  RA.  Thus,  at  least 
as  a  matter  of  principle,  the  charge  superselection  rule  places 
no  inescapable  restrictions  on  the  allowed  operations.  This  is 
the  main  point  stressed  by  Aharonov  and  Susskind  [14]. 

The  phase  reference  state  can  be  interpreted  physically  as 
a  static  piece  of  superconducting  material  with  a  definite 
value  of  the  superconducting  phase.  While  the  phase  itself  is 
not  gauge-invariant,  the  relative  phase  of  the  system  and  res¬ 
ervoir  has  observable  consequences  (like  the  Josephson  ef¬ 
fect)  when  the  two  are  brought  into  contact.  Similar  issues, 
discussed  in  [23-27],  arise  when  considering  the  physical 
content  of  relative  phases  in  optical  systems. 

B.  Non-Abelian  case 

Our  discussion  of  the  Abelian  case  has  suggested  that  su¬ 
perselection  rules  are  nullified  if  suitable  reference  systems 
are  available.  Now  we  consider  the  more  general  case,  where 
the  symmetry  group  is  G,  which  may  be  either  a  finite  group 
or  a  compact  Lie  group.  The  superselection  rule  dictates  that 
allowed  local  operations  must  commute  with  G.  But  we  may 
anticipate  that  if  a  condensate  is  accessible  that  completely 


breaks  the  G  symmetry,  then  in  effect  there  is  no  operative 
symmetry  at  all,  and  the  superselection  rules  place  no  restric¬ 
tions  on  the  allowed  operations. 

Formally,  if  the  symmetry  is  completely  broken,  then  the 
possible  orientations  of  the  condensate  are  in  one-to-one  cor¬ 
respondence  with  the  elements  of  the  symmetry  group  G.  In 
a  particular  “fixed  gauge,”  the  states  of  the  condensate  are 
denoted  |</>),  where  <^eG,  and  these  states  transform  as  the 
left  regular  representation  of  G.  That  is,  a  symmetry  trans¬ 
formation  geG  acting  on  the  condensate  is  represented  by 
the  unitary  U(g)  where 

u(g)\</>)  ->  \g4>)-  (8) 

These  states  can  be  expanded  in  the  basis  of  irreducible  rep¬ 
resentations  of  G  as 

\<t>)=  2  yj-^-Dqia{(f>)\q,i,a),  (9) 

q,i,a  ^ G 

where  nq  denotes  the  dimension  of  the  irreducible  represen¬ 
tation  Dq(4>)  and  nG  is  the  order  of  G.  Inverting  the  Fourier 
transform  we  obtain 

\q,i,a)=  2  (10) 

</>e G  V  nG 

Note  that  in  Eqs.  (9)  and  (10)  we  have  used  notation  appro¬ 
priate  for  a  finite  group;  in  the  case  of  a  compact  Lie  group, 
the  sum  over  <^eG  would  be  replaced  by  an  integral  with 
respect  to  an  invariant  measure  on  the  group.  The  states 
| q,i,a)  transform  under  G  as 

U(g)\q,i,a)=^  \q,j,a)Dqn(g).  (11) 

j 

In  keeping  with  standard  physics  terminology,  we  will 
refer  to  the  index  /=1,2, ...  ,nq  in  \q,i,o)  as  the  “color  in¬ 
dex,”  and  to  the  action  Eq.  (11)  of  U(g)  on  this  index  as  a 
“gauge  transformation.”  The  index  a=l,2, ...  ,nq,  distin¬ 
guishing  the  nq  copies  of  the  representation  Dq  that  occur  in 
the  decomposition  of  the  regular  representation,  will  be 
called  the  “flavor”  index.  The  physical  “G-in variant”  opera¬ 
tions  are  those  that  commute  with  all  gauge 
transformations — these  preserve  q  and  act  nontrivially  only 
on  the  flavor,  not  the  color.  Therefore,  by  including  the  color 
we  have  chosen  a  redundant  description  of  the  physical  Hil¬ 
bert  space.  This  redundancy,  while  not  absolutely  necessary, 
is  quite  convenient,  and  in  particular  will  be  useful  for  our 
discussion  in  Sec.  Ill  of  the  security  of  quantum  protocols. 

In  addition  to  the  G  gauge  symmetry,  there  is  also  a  group 
G  of  “global”  transformations  that  commute  with  U(g),  un¬ 
der  which  the  states  |  cf>)  transform  as  the  right  regular  repre¬ 
sentation  of  G;  the  element  h  of  the  global  group  is  repre¬ 
sented  by  V(h),  where 

V(h)\cf>)  =  \cf>h-1)  (12) 

and 
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V(h)\q,i,a)  =  '2i  I <ld,b)Dqb*a(h) .  (13) 

b 

Thus  the  global  transformations  act  on  the  flavor  index  a  of 
the  states  in  the  { \q,i,a )}  basis — unlike  the  gauge  transfor¬ 
mations,  they  act  nontrivially  on  the  physical  states. 

In  more  geometric  terms,  a  condensate  may  be  interpreted 
as  an  asymmetric  classical  rigid  body  that  can  be  rotated 
either  “actively”  or  “passively.”  What  we  have  called  the 
color  (gauge)  rotation  is  a  passive  rotation  that  acts  on  the 
space-fixed  axes — it  does  not  change  the  actual  orientation 
of  the  body  but  only  changes  our  mathematical  description  of 
the  orientation.  In  contrast,  what  we  have  called  the  flavor 
(global)  rotation  is  an  active  rotation  that  acts  on  the  body- 
fixed  axes  and  alters  the  physical  orientation.  A  flavor  rota¬ 
tion  is  G-invariant  in  the  sense  that  it  commutes  with  color 
rotations,  and  so  is  a  physical  operation,  allowed  by  the  su¬ 
perselection  rule. 

In  contrast  to  the  flavor  orientation,  the  color  orientation 
of  an  isolated  system  A  has  no  invariant  meaning,  as  it  is 
modified  by  a  color  rotation.  However,  the  orientation  of  A 
relative  to  the  condensate  R  does  have  meaning,  and  an  op¬ 
erator  that  rotates  the  relative  orientation  admits  an  invariant 
description.  Suppose,  for  example,  that  system  A  is  itself  a 
condensate  in  the  state  <fiA ,  while  the  state  of  R  is  <f)R.  The 
relative  orientation 


4>ra=4>r4>a  (14) 

is  invariant  if  a  common  color  rotation 

U(h)RA:c/)A  — >  tuf>A,  4>R^h4>R  (15) 

is  applied  to  both  objects.  The  transformation  U(g )^  that 
changes  the  relative  orientation  according  to 

U (g)i?A:  4*ra  §  4*ra  (16) 

has  an  invariant  meaning  and  commutes  with  the  color  rota¬ 
tion  U(h)RA.  We  may  interpret  the  invariant  rotation  as  one 
that  rotates  A  while  R  is  “held  fixed,”  acting  as 

^O>)/?a(|0/?)  ®  I  0a))  ~  1 0/e)  ®  I  <f>Rg 4>r  0a) *  (17) 

or  equivalently 

U(g)Z=  2  (18) 

4>  g  G 

If  system  A  is  not  a  reference  system  but  rather  an  object 
transforming  as  the  irreducible  representation  q  of  G,  then 
£/(0g0_1)  can  be  expanded  as 

u(g)z=  2 

(f)^G 

®( x  . 


More  generally,  any  transformation 
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MA\q,i)A^>  X  \qj)AMn  (20) 

j 

acting  on  the  color  degree  of  freedom  can  be  simulated  by 
the  invariant  operation 

MZ=  X  (\4>M\ )*®(  X  \qJ)Dl(cl>)MabDlj(cl)-l)(q,j\\  . 

4>&G  \i,j,a,b  )  A 

(21) 

Mra  has  an  invariant  meaning  because  it  transforms  the  color 
of  A  relative  to  the  color  of  the  reference  system  7?;  in  effect, 
the  color  rotation  is  simulated  by  converting  the  color  index 
into  a  flavor  index  (depending  on  0),  on  which  M  may  act 
with  impunity.  For  fixed  0,  the  simulation  is  achieved  via  the 
isomorphism 

\q,a)A  ->  \q,(f>,a)RA  =  \(f>)R  ®  2  \qJ)AD%(4>) ,  (22) 

j 


such  that 


MZ\q, =  2  \q,  <P,b)RAMha.  (23) 

b 

Furthermore,  this  isomorphism  can  be  extended  to  operators 
M  that  change  the  value  of  q  as  well  as  rotating  the  color  for 
fixed  q\  the  operator 

MMa^W^aMP  (24) 

q'j 

is  simulated  by 

MZ \q.MgA  =  2  W^MraMI'A,  (25) 

q’Jb 

which  generalizes  the  result 

MZ  (I  0)r  ®  e-^\q)A)  =  \9)R®^  e*  e\q')M (26) 

q' 

that  we  found  in  the  case  of  G=U(1). 

C.  Properties  of  the  simulation 

We  will  refer  to  the  world  in  which  all  operations  are 
required  to  commute  with  the  action  of  the  symmetry  group 
G  as  the  “invariant  world”  or  “/  world,”  and  we  refer  to  the 
world  in  which  arbitrary  operations  are  allowed  as  the  “un¬ 
restricted  world”  or  “U  world.”  What  we  have  observed  in 
Eqs.  (22)  and  (25)  is  that  the  physics  of  the  U  world  can  be 
faithfully  reproduced  in  the  I  world,  as  long  as  a  suitable 
reference  system  is  at  our  disposal. 

Let  us  restate  the  main  conclusion  in  a  more  succinct 
notation:  Suppose  A  is  an  arbitrary  system  that  transforms  as 
some  representation  of  the  group  G,  and  let  R  be  a  “reference 
system”  that  transforms  as  the  left  regular  representation  of 
G.  Let  M  be  an  arbitrary  transformation  acting  on  A.  Then 
there  is  a  corresponding  transformation  Mmv  acting  on  R  and 
A  defined  as 

Minv=  2  (|0X0|)«  ®  [U{4>)MU{4>)-\.  (27) 

^gG 
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Mmv  is  an  invariant  operator  whose  action  on  RA  simulates 
the  action  of  M  on  A. 

That  is,  the  operators  Mmv  have  the  following  easily  veri¬ 
fied  properties: 

(i)  Mmv  is  G-invariant.  Proof:  From  the  transformation 
properties  of  R  and  A  we  have 

[[/(g)  0  [/(g^nt/fer1  ®  t/o?)-1] 

=  2  (\g4>){g<t>\)  ®  [U(g(f>)MU(g(p)~1]=Min'' , 

4>&g 

(28) 

where  in  the  last  step  we  have  reparametrized  the  sum  by 
replacing 

(ii)  Invariant  operators  on  RA  provide  a  representation  of 
operators  on  A.  Proof:  We  have 

MfMf=  2  (l&X&l&X&l) 

<f>b<f>2^G 

=  2  (!</>)(</>!)  ® 

4><eG 

={MxM2)in\  (29) 

(iii)  If  M  is  G-invariant,  then  Mmv=IR<S>MA.  Proof:  If 
U(4>)  commutes  with  M  for  each  </>,  then 

Minv=  2  ®  M  =  I  ®  M .  (30) 

4>&G 

(iv)  If  p  is  invariant  and  tr(pR)  =  l,  then 

tr  Miny(pR  ®  p)  =  tr  Mp.  (31) 

Proof:  If  U(4>)  commutes  with  p  for  each  </>,  then 

tr  Mim(pR  0  p)=  2  <<£M<£)tr[M  [/(^"‘p [/(</>)] 

4><eG 

=tr(p/?)tr(Mp)  =  tr(Mp) .  (32) 

The  properties  (i)  and  (iv)  mean  that  as  long  as  the  state  p 
of  A  is  G-invariant,  then  by  making  use  of  a  reference  sys¬ 
tem,  measurements  in  the  U  world  can  be  faithfully  simu¬ 
lated  by  measurements  in  the  I  world.  That  is,  given  an  ar¬ 
bitrary  measurement  performed  on  A  (with  operation 

elements  that  are  not  necessarily  G-invariant),  there  is  an 
invariant  measurement  performed  on  RA  (with  G-invariant 
operation  elements)  that  has  the  same  probability  distribution 
of  outcomes.  Furthermore,  it  follows  from  property  (ii)  that 
the  physics  of  the  U  world  can  be  faithfully  reproduced  in 
the  I  world  even  if  the  measurement  is  preceded  by  a  series 
of  unitary  transformations — applying  \Am  in  the  I  world  has 
the  same  effect  as  applying  V  in  the  U  world.  Property  (iii) 
tells  us  that,  as  expected,  the  reference  system  R  is  superflu¬ 
ous  if  the  U- world  transformation  acting  on  A  is  already 
G-invariant. 

To  derive  these  properties,  we  require  that  the  reference 
system  transform  as  the  regular  representation  of  G,  but  no 
condition  is  needed  on  the  state  pR  of  the  reference  system. 


Loosely  speaking,  the  reference  system  is  needed  so  that 
when  a  noninvariant  operation  acts  on  A,  the  change  in  the 
charge  of  A  can  be  balanced  by  a  compensating  change  in  the 
charge  of  R.  But  if  the  state  p  of  A  is  invariant,  then  only  the 
charge-conserving  part  of  M  contributes  to  the  expectation 
value  tr  (Mp)  anyway.  In  the  simulation  of  this  charge- 
conserving  part  of  M,  the  reference  system  is  superfluous 
and  its  state  irrelevant. 

Note  that  if  G  is  a  Lie  group  rather  than  a  finite  group, 
then  the  regular  representation  is  infinite-dimensional,  and 
our  formal  arguments  require  R  to  be  an  infinite-dimensional 
system.  How  is  the  fidelity  of  the  simulation  affected  if  R  is 
truncated  to  a  finite-dimensional  system?  In  fact,  the  fidelity 
will  still  be  perfect  if  the  charge  remains  bounded  in  the 
process  to  be  simulated.  Consider,  for  example,  the  case  G 
=  U(1),  for  which  Eq.  (27)  becomes,  e.g., 

(I q  -  r>(<7|)mv  =  2  (| q’  +  r)(q’\)R  0  (| q  -  r)(q\)A;  (33) 
q' 

in  the  I  world,  a  process  in  which  r  units  of  charge  are 
removed  from  A  is  simulated  by  adding  the  r  units  to  R. 
Suppose  we  are  assured  that  the  total  charge  added  to  or 
removed  from  A  will  never  exceed  r  units.  Then  we  may 
choose  the  initial  state  of  R  to  carry  charge  zero,  and  we  can 
limit  R  to  the  (2r+ 1) -dimensional  space  spanned  by  the 
states  | qR) ,  qR=-r,-r+ 1 , . . . ,  r- 1 ,  r.  This  truncated  refer¬ 
ence  system  suffices  because  states  with  \qR\>r  will  never 
be  accessed  in  the  simulation  anyway.  A  similar  remark  ap¬ 
plies  if  G  is  an  arbitrary  compact  Lie  group. 

III.  REFERENCE  SYSTEMS  AND  QUANTUM  PROTOCOLS 

We  have  concluded  that  in  the  presence  of  a  suitable  ref¬ 
erence  system,  superselection  rules  place  no  inescapable  re¬ 
strictions  on  the  allowed  operations.  We  may  anticipate, 
therefore,  that  a  cryptographic  protocol  is  secure  in  the  in¬ 
variant  “/  world"  (governed  by  the  superselection  rule)  if  and 
only  if  it  is  secure  in  the  unrestricted  “U  world.”  If  we  faith¬ 
fully  adhere  to  the  usual  stringent  principles  of  quantum 
cryptology  and  place  no  restrictions  on  the  resources  avail¬ 
able  to  our  adversaries,  then  we  must  admit  the  possibility 
that  the  dishonest  parties  could  share  access  to  a  reference 
system  during  the  execution  of  the  protocol.  For  the  case  of 
superselection  rules  arising  from  compact  symmetry  groups, 
this  observation  suffices  to  answer  Popescu’s  question  about 
the  impact  of  superselection  rules  on  the  security  of  quantum 
protocols. 

Let  us  now  discuss  this  point  in  greater  detail.  To  be  ex¬ 
plicit,  consider  at  first  a  protocol  involving  two  parties,  Alice 
and  Bob.  Alice  holds  a  private  local  system  A  that  is  beyond 
Bob’s  control,  and  Bob  holds  a  private  local  system  B  that  is 
beyond  Alice’s  control.  In  addition,  there  is  a  message  sys¬ 
tem  M  that  they  can  pass  back  and  forth.  At  the  beginning  of 
the  protocol,  they  share  a  product  state  pA<S> pB®  pM.  In  each 
round  of  the  protocol,  one  of  the  parties  performs  a  joint 
quantum  operation  on  her/his  local  system  and  the  message, 
and  then  sends  the  message  system  to  the  other  party.  Finally, 
after  all  quantum  communication  is  completed,  both  parties 
perform  local  measurements.  (See  Fig.  1.) 
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A  out 


B  out 


G-invariant.  In  effect,  then,  Bob  measures  the  invariant  op¬ 
erator 


FB,h=V^EB'bV 


(36) 


FIG.  1.  A  two-player  quantum  game.  Alice  and  Bob  have  pri¬ 
vate  systems,  and  a  message  system  that  they  pass  back  and  forth. 
At  the  end  of  the  game,  Alice  and  Bob  measure  their  private 
systems. 

For  example,  the  goal  of  the  protocol  might  be  to  flip  an 
unbiased  coin.  In  that  case,  the  final  measurement  performed 
by  each  party  has  two  possible  outcomes,  0  or  1.  If  both 
parties  follow  the  protocol,  then  both  obtain  the  same  out¬ 
come.  Furthermore,  the  two  outcomes  are  equiprobable.  A 
coin-flipping  protocol  is  secure  if  neither  party,  by  departing 
from  the  protocol,  can  bias  significantly  the  outcome  of  the 
other  party’s  measurement. 

We  say  that  a  strong  coin-flipping  protocol  has  bias  s  if 
neither  party  by  cheating  can  force  either  outcome  to  occur 
with  probability  greater  than  \  +  e.  In  a  weak  coin-flipping 
protocol,  Alice  wins  if  the  outcome  is  0  and  Bob  wins  if  the 
outcome  is  1,  and  we  say  that  the  bias  is  s  if  neither  can 
force  a  win  with  probability  greater  than  \  +  e.  (Thus,  in  a 
weak  protocol  with  bias  s,  a  cheater  might  be  able  to  lose  on 
purpose  with  a  probability  exceeding  |  +  e.)  Note  that  the 
protocol  might  abort  if  cheating  is  detected;  by  “the  probabil¬ 
ity  of  outcome  0”  we  mean  the  joint  probability  that  the 
protocol  does  not  abort  and  the  outcome  is  0.  Kitaev  [10,11] 
has  shown  that,  if  no  superselection  rules  are  imposed,  then 
strong  quantum  coin  flipping  is  impossible  with  bias 
6<  (1/ V2)  — 1  =  0.207.  Ambainis  [28]  has  shown  that  a  weak 
coin-flipping  protocol  with  bias  s  requires  at  least 
I2[log  log(l/s)]  rounds  of  communication. 

We  are  interested  in  whether  these  conclusions  about 
coin-flipping  in  the  U  world  remain  valid  in  the  /  world.  For 
a  coin-flipping  protocol  in  the  /  world,  we  may  assume  that 
the  initial  state  shared  by  Alice  and  Bob  is  a  tensor  product 
of  invariant  states  pA  0  pB®  pM.  In  the  honest  protocol,  Alice 
and  Bob  take  turns  applying  G-invariant  operations  to  the 
system  that  they  share,  then  measure  invariant  observables. 
In  fact,  without  loss  of  generality,  we  may  assume  [8]  that 
each  operation  applied  by  Alice  or  Bob  is  an  invariant  uni¬ 
tary  transformation,  and  that  the  final  measurement  is  an  in¬ 
variant  projective  measurement. 

If  Alice  and  Bob  play  the  game  honestly,  then  the  prob¬ 
ability  PB(b)  that  Bob’s  measurement  yields  the  particular 
outcome  h  can  be  expressed  as 

PB(b)  =  tr [EBJ>  V(pA  0p«0  pM)  V] ,  (34) 

where 

y=¥v''W*-  (35) 

Here  the  VA .  are  unitary  transformations  applied  to  AM  (we 
have  assumed  that  Alice  makes  the  first  move  in  the  game), 
the  VB_  are  unitary  transformations  applied  to  BM ,  and  the 
EBb  are  the  projectors  defining  Bob’s  final  measurement. 
Furthermore,  in  the  /-world  protocol,  VA  ,  VB_,  and  EB  b  are 


in  the  invariant  state  pA® pB® pM. 

Of  course,  a  protocol  in  the  I  world  can  be  regarded  as  a 
special  case  of  a  protocol  in  the  U  world,  where  the  initial 
state  is  a  product  state,  and  Kitaev ’s  result  applies  to  this 
U- world  protocol.  Therefore,  one  of  the  parties  (Alice,  say) 
can  force  one  of  the  outcomes  (0,  say)  with  probability  at 
least  1/V2.  However,  Alice’s  cheating  strategy  that  achieves 
this  result  might  employ  operations  that  are  not  G-invariant. 
To  show  that  Kitaev ’s  result  also  applies  to  the  original 
/-world  protocol,  we  must  show  that  Alice’s  cheating  strat¬ 
egy  in  the  U  world  can  be  faithfully  simulated  in  the  I  world 
by  making  use  of  a  suitable  reference  system.  For  this  pur¬ 
pose,  we  apply  the  properties  of  the  invariant  operator  Mmv 
that  were  discussed  in  Sec.  II  C. 

When  Alice  cheats  in  the  U  world,  she  replaces  the  op¬ 
erator  VA .  called  for  in  the  honest  protocol  with  an  arbitrary 
operator  VA  applied  to  AM,  where  VA  is  not  necessarily 
G-invariant.  Then  Bob’s  measurement  yields  the  outcome  b 
with  probability 

P'B(b)  =  \x[F'Bb{pA®  Pb®Pm)]<  (37) 


where 


and 


F'HJ=V^ERhV’ 


B,b  * 


(38) 


V': 


vRyAvRyA 


(39) 


This  cheating  strategy  in  the  U  world  can  be  simulated  in 
the  I  world  if  Alice  has  a  reference  system  R — instead  of 
applying  the  noninvariant  operator  VA  to  the  system  AM,  she 
applies  the  invariant  operator  VAmv  to  RAM.  Note  that  since 
Bob  follows  the  honest  protocol  which  requires  VB_  to  be 
G-invariant,  applying  VB,  to  BM  is  equivalent  to  applying 
Vgv  to  RBM,  by  property  (iii)  in  Sec.  II  C.  Therefore,  when 
Alice  adopts  the  /-world  strategy,  Bob  obtains  outcome  b 
with  probability 


P B(b)  -  tr [F Bb(pR  0  pA  ®  pB  ®  Pm)]’  (40) 

where 

F'B,b=V''EB,bV'  (41) 

and 


v  =  vrvr 

Dn  ^ n 


^n2xr«v- 


(42) 


But  since  the  invariant  operators  provide  a  representation 

[property  (ii)],  we  may  write  V  =  Vnny,  and  since  EB  b=El™b 
as  well,  we  have 


052326-7 


KITAEV,  MAYERS,  AND  PRESKILL 


PHYSICAL  REVIEW  A  69,  052326  (2004) 


F^  =  FbT-  (43) 

Finally,  the  initial  state  Pa®  Pb®  Pm  shared  by  Alice  and  Bob 
is  G-invariant;  therefore,  by  property  (iv), 

P'B{b)  =  P'B(b );  (44) 

the  measurement  outcome  b  in  the  /-world  protocol  occurs 
with  the  same  probability  as  the  outcome  b  in  the  U- world 
protocol. 

Therefore,  Alice’s  simulated  cheating  strategy  in  the  / 
world  perfectly  reproduces  the  probability  distribution  for 
Bob’s  measurement  outcome  that  is  achieved  by  her  cheating 
strategy  in  the  U  world.  The  same  is  true  if  Bob  makes  the 
first  move  in  the  game  instead  of  Alice.  Similarly,  if  Bob  is 
the  cheater,  Bob  has  a  strategy  in  the  /  world  that  simulates 
his  U  world  cheating  strategy.  We  conclude  that  if  Alice  (or 
Bob)  can  cheat  in  the  U  world,  then  she  (he)  can  cheat  just  as 
successfully  in  the  /  world.  Thus,  Kitaev’s  proof  of  the  im¬ 
possibility  of  strong  coin  flipping  with  bias  e<(l/V2)-|, 
originally  formulated  in  the  U  world,  also  applies  to  the  / 
world.  Similarly,  Ambainis’s  lower  bound  on  the  number  of 
rounds  of  communication  needed  for  weak  coin  flipping  also 
applies  to  the  /  world. 

This  conclusion  that  cheating  in  the  U  world  can  be  suc¬ 
cessfully  simulated  in  the  /  world  applies  not  just  to  coin 
flipping  protocols,  but  to  any  two-party  protocol  in  which  the 
goal  of  a  cheating  Alice  is  to  bias  the  outcome  of  a  measure¬ 
ment  performed  by  an  honest  Bob.  Furthermore,  it  is 
straightforward  to  generalize  the  argument  to  an  -party  pro¬ 
tocol,  in  which  k  cheating  parties  wish  to  bias  the  outcomes 
of  measurements  performed  by  the  n-k  honest  parties.  For 
such  a  protocol  in  the  /  world,  where  the  initial  state  is  a 
product  of  invariant  states,  any  cheating  strategy  that  can  be 
executed  in  the  U  world  can  be  simulated  perfectly  in  the  / 
world  if  the  k  cheating  parties  share  access  to  a  reference 
system.  Therefore,  the  protocol  can  be  no  more  secure  in  the 
/  world  than  in  the  U  world. 

To  summarize,  let  us  refer  to  an  ft-party  quantum  game  as 
an  /-world  game  if  the  initial  state  is  a  product  of  invariant 
states,  and  if  in  the  honest  protocol  all  operations  performed 
by  the  parties  are  invariant  operations.  If  k<n  parties  are 
cheaters,  we  say  that  their  cheating  strategy  is  an  /-world 
cheating  strategy  if  the  cheaters  are  required  to  perform  in¬ 
variant  operations,  and  we  say  that  their  cheating  strategy  is 
a  U- world  cheating  strategy  if  the  operations  performed  by 
the  cheaters  are  unrestricted.  Let  us  say  that  an  /- world 
cheating  strategy  is  equivalent  to  a  G-world  cheating  strategy 
if  both  strategies  produce  the  same  probability  distributions 
for  the  outcomes  of  the  measurements  performed  by  the  n 
-k  honest  parties.  We  have  proved  the  following. 

Theorem  1.  Suppose  that  in  the  /  world  all  quantum  op¬ 
erations  are  required  to  be  G-invariant,  where  G  is  a  compact 
Lie  group,  and  that  in  the  U  world  quantum  operations  are 
unrestricted.  Consider  an  ft -party  /-world  quantum  game,  and 
a  U- world  cheating  strategy  A'  in  which  k<n  parties  cheat. 

Then  there  is  an  /-world  cheating  strategy  A1  that  is  equiva¬ 
lent  to  A'. 


As  we  observed  in  Sec.  II  C,  the  reference  system  re¬ 
quired  by  the  cheaters  in  the  /  world  can  be  finite¬ 
dimensional,  as  long  as  the  cheaters  in  the  U  world  apply 
operations  that  change  the  “charge”  by  a  bounded  amount. 


IV.  DISTRIBUTED  REFERENCE  SYSTEMS 

The  key  ingredient  in  our  discussion  of  /-world  quantum 
protocols  is  the  observation  that  G-noninvariant  operations 
can  be  faithfully  simulated  through  the  use  of  a  reference 
system.  Suppose,  for  example,  that  Alice  and  Bob  take  turns 
acting  on  a  system  C  that  they  pass  back  and  forth.  Then 
Alice  and  Bob  in  the  /-world  can  simulate  an  arbitrary 
U- world  protocol  in  which  the  initial  state  of  C  is 
G-invariant.  They  carry  out  the  simulation  by  passing  the 
reference  system  R  back  and  forth  along  with  C,  each  taking 
turns  applying  invariant  operations  to  RC.  Similarly,  in  our 
analysis  of  cheating  in  Sec.  Ill,  we  allowed  the  k  cheaters  to 
pass  the  reference  system  R  among  themselves  as  needed 
during  the  execution  of  the  protocol.  A  reference  system  that 
travels  from  place  to  place  might  be  called  itinerant. 

Here  we  will  briefly  discuss  an  alternative  scenario,  in 
which  the  parties  share  a  distributed  reference  system — each 
party  holds  a  fixed  portion  of  this  system  throughout  the 
execution  of  the  protocol.  This  discussion  is  not  actually 
needed  for  our  analysis  of  security,  but  it  is  helpful  nonethe¬ 
less  for  understanding  the  physics  of  superselection  rules. 
Indeed,  in  many  physical  situations  in  which  reference  sys¬ 
tems  are  used  (e.g.,  in  optical  physics),  the  system  is  distrib¬ 
uted  rather  than  itinerant. 

Let  A  denote  Alice’s  part  of  the  reference  system,  B  de¬ 
note  Bob’s  part,  and  suppose  that  at  the  start  of  the  protocol 
AB  is  prepared  in  the  state 

|0)a5  —  / —  2-  I  4>)a  ®  I  4>)b •  (45) 

(f>^G 

This  state  has  trivial  total  charge;  indeed,  when  expressed  in 
the  Fourier-transformed  charge-eigenstate  basis,  it  is 

|0 )ab  =  -I =  2  I q,i,a)A  ®  \q,i,a)B.  (46) 

q,i,a 

Thus,  in  principle  Alice  (say)  could  prepare  |0)Afi  in  her  lab 
and  then  ship  half  of  it  to  Bob.  (The  state  |0)Afi  is  unnormal- 
izable  and  unphysical  if  G  is  a  Lie  group.  For  now  we  will 
suppose  that  G  is  a  finite  group,  but  we  will  comment  on  the 
case  of  a  Lie  group  below.) 

In  the  state  |0)Afi,  Alice’s  condensate  and  Bob’s,  have  val¬ 
ues  that  are  distributed  uniformly  over  the  group  G,  but  these 
values  are  locked  together.  Therefore,  if  \if/)c  is  any  pure 
state  of  C,  then  MA(X  and  M™c  act  on  |0)A5  <8>  |  if/)c  in  the  same 
way, 

4^Ac(|0)a5  ®  I  0)c)  -  4/g^(|0)Afi  (8)  \lff)c) 

—  I -  2-  |  4*)  A  ®  I  4>)b 

SUq  <f>^G 

®[U(4>)MU(4>r]Mc.  (47) 
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Furthermore  MA(X  and  MlByc  act  identically  on  any  state  of  the 
form 

I ^)abc~  ~f=  2  I  4>)a  ®  ®  l^)c»  (48) 

where  |  ^)c  might  depend  on  </>,  a  form  that  is  maintained  as 
successive  invariant  operations  are  applied  to  AC  and  to  BC. 
Therefore,  the  outcome  of  the  protocol  would  be  the  same  if 
each  invariant  operation  M^X  applied  to  BC  were  replaced 
by  the  corresponding  invariant  operation  MA(X  applied  to  AC. 
We  conclude  that  the  simulation  in  which  the  distributed  ref¬ 
erence  system  AB  is  prepared  in  the  initial  state  |0)Afi  is 
equivalent  to  a  simulation  that  uses  an  itinerant  reference 
system  A.  Since  this  latter  simulation  has  all  of  the  properties 
listed  in  Sec.  II  C,  we  find  that  a  bipartite  I- world  protocol 
using  the  distributed  reference  system  can  faithfully  simulate 
an  arbitrary  U- world  protocol. 

Note  that  the  distributed  state  can  serve  the  same  purpose 
if  there  is  a  fixed  offset  of  Bob’s  condensate  relative  to  Al¬ 
ice’s,  as  long  as  the  offset  is  known.  That  is,  if  Alice  and  Bob 
share  the  state 

|0’  (f^AB  =  I -  2  \4>)a  ®  \4>4>)b 

=  ~r=  2  Dth(4>) 

V^g  q,a,b 

I qXa)A  ®  \q,i,b)B\,  (49) 

then  the  invariant  operations  M^X  and  [G(0)MG(<^)_1]A(X  act 
in  the  same  way.  If  Bob  knows  </>,  then  he  can  participate 
successfully  in  the  simulation  by  “twisting”  his  operations 
appropriately. 

Similarly,  in  a  protocol  with  k  parties,  the  distributed  ref¬ 
erence  state 

|0)&  parties  -  ~T=  2  |  ®  I  4>)r2  ®  I  0)/?,  (50) 

provides  a  common  “phase  standard”  for  all  the  participants, 
allowing  them  to  simulate  a  U- world  protocol  in  the  I 
world — the  €th  party  simulates  the  noninvariant  operation  M 
by  applying  Mmv  to  the  target  system  and  her  part  R ^  of  the 
reference  system.  Again,  the  parties  can  twist  their  local  op¬ 
erations  to  compensate  for  known  relative  offsets  of  their 
condensates,  if  necessary. 

In  the  state  |0)Afi,  there  is  a  quantum  correlation  between 
Alice’s  condensate  and  Bob’s.  A  common  reference  standard 
can  be  provided  instead  by  a  classically  correlated  state  such 
as 

Pab=  2  (|0X0|)a  ®  (\4>)(4\)b-  (51) 

nG  <£gG 

If  Alice  and  Bob  are  equipped  with  the  state  pAB,  then  again 
MA(X  and  M^X  act  in  the  same  way,  hence  they  can  use  this 
distributed  reference  state  to  simulate  a  U-  world  protocol  in 
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the  I  world.  The  state  is  G-invariant,  but  unlike  |0)Afi  it  is  not 
a  charge  eigenstate;  rather  it  is  a  mixture  of  (invariant)  states 
with  various  charges.  For  example,  in  the  case  G=U(  1), 
|0)A5  is  the  (unnormalizable)  state 

/.o  °° 

rzir 

|0)A£  -  I  |  6)  A  ®  I  @)b  -  2  I  -  q)  A  ®  ^  b’i  (52) 

2  0  q=- oo 

Alice’s  charge  and  Bob’s  charge  are  perfectly  anticorrelated. 
In  contrast,  pAB  is 

pab*  J  de(\e)(e\)A®(\e)(e\)B 

a  2  I<7a><7bX<7a-<Mb  +  <7|-  (53) 

<IA’4B’4 

Formally,  this  state  appears  to  be  separable,  as  it  is  a  mixture 
of  the  product  states  |0)<8>  \0),  but  this  is  deceptive,  because 
1 0)  0 1 6)  is  not  G-invariant  and  is  therefore  incompatible  with 
the  superselection  rule.  On  the  other  hand,  in  the  charge- 
eigenstate  basis,  pAB  can  be  expressed  as  a  mixture  of 
G-invariant  pure  states,  each  with  a  definite  total  charge; 
however,  these  pure  states  are  highly  entangled,  with  an  in¬ 
definite  value  of  Alice’s  (and  Bob’s)  local  charge.  The  state 
pAB  is  not  a  mixture  of  invariant  product  states,  and  therefore 
cannot  be  prepared  without  quantum  communication  be¬ 
tween  Alice  and  Bob.  Classical  communication  alone  is  in¬ 
sufficient  for  Alice  and  Bob  to  establish  their  common  phase 
standard. 

Now  let  us  return  to  the  question  we  postponed  earlier: 
what  if  G  is  a  Lie  group,  so  that  the  states  |0)A5  and  pAB  are 
unnormalizable?  To  be  specific,  consider  again  the  case  G 
=  U(  1),  and  suppose  that  Alice  and  Bob  are  instructed  to 
perform  this  protocol:  Alice  is  presented  with  a  charge-zero 
state  |0).  She  is  instructed  to  rotate  this  state  to  the  superpo¬ 
sition  of  charge  eigenstates  (|0)  +  |l))/V2  and  to  send  the 
resulting  state  to  Bob.  Bob  is  to  perform  an  orthogonal  mea¬ 
surement  in  the  basis  (|0)±|l))/>/2  and  so  verify  that  Alice 
prepared  the  correct  state.  To  make  sense  of  this  procedure, 
Alice  and  Bob  must  share  a  common  reference  state  that 
serves  to  lock  together  their  phase  conventions;  for  example, 
this  state  could  be  a  shared  pure  state  |  i//)AB  with  definite  total 
charge.  Alice’s  coherent  operation  on  system  C  acts  as 

\4>)ab®  |0)c->  AB  ®  |0}c+(£/_)a|<A>  AB  ®  |i)d; 

(54) 

that  is,  Alice  simulates  the  charge-nonconserving  operator 
(U+)c  by  applying  the  invariant  operator  (U_)A®(U+)C  to 
AC.  When  Bob  receives  system  C,  he  performs  his  measure¬ 
ment  by  first  simulating  the  transformation 

|0>c^(|0>c+|Dc), 

(55) 
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IDc ^(|0)c_ 

and  then  measuring  the  charge  of  C.  After  Bob’s  first  step, 
the  state  of  ABC  has  become 

\\.h  ®  4  +  (U-)A  ®  (tf+)*]|$A*  ®  |0)c 

+  ^[4  ®  (£4)*  -  (U-)A  ®  4]|<A>a*  ®  |i)c-  (56) 

When  Bob  measures  the  charge,  the  probability  that  he  ob¬ 
tains  the  outcome  |l)c  and  fails  to  verify  Alice’s  state  is 

Pi  =  ^[1  -  Re  ab(^(U_)a  ®  (f/+)B|^M.  (57) 

If,  for  example,  the  shared  reference  state  is 

|-  ?>a  ®  k)s  j ,  (58) 

a  normalizable  approximation  to  the  state  |0)A5,  our  expres¬ 
sion  for  P\  becomes 


P  i  = 


2Af  ’ 


(59) 


Thus,  for  finite  N ,  the  state  received  by  Bob  does  not  match 
perfectly  with  the  state  prepared  by  Alice — the  superposition 
of  charge  eigenstates  decoheres  slightly.  But  this  decoher¬ 
ence  becomes  negligible  in  the  limit  Af— where  the 
“charge  fluctuations”  of  the  shared  condensate  are  large. 

The  lesson  we  learn  from  this  example  generalizes  to  non- 
Abelian  compact  Lie  groups.  We  can  replace  the  unnormal- 
izable  state 


|0)ab  =  “7=  X  I q,i,a)A  ®  \q,i,a)B  (60) 

V^g  q,i,a 

by  a  normalizable  state  with  a  truncated  sum  over  the  charge 
q.  If  Alice  and  Bob  use  this  truncated  distributed  reference 
state  to  simulate  a  U- world  protocol,  their  simulation  will  not 
have  perfect  fidelity.  But  as  long  as  all  operations  applied  by 
Alice  and  Bob  change  the  charge  by  a  bounded  amount,  the 
fidelity  can  be  arbitrarily  close  to  1  if  the  reference  state  is 
chosen  appropriately.  If  Alice  and  Bob  are  permitted  to  use  a 
truncated  itinerant  reference  system  rather  than  a  distributed 
one,  then  perfect  fidelity  can  be  achieved,  as  observed  in  Sec. 
IIC. 


V.  INVARIANT  OPERATIONS  AND  COMMUTANTS 

Our  observations  in  Sec.  II  B  emphasized  the  similarities 
between  Abelian  and  non- Abelian  superselection  rules,  en¬ 
abling  us  to  formulate  a  security  analysis  in  Sec.  Ill  that 
applies  to  both  Abelian  and  non- Abelian  symmetry  groups. 
But  in  several  respects  the  arguments  in  Sec.  Ill  are  still  not 
adequate.  For  one  thing,  so  far  we  have  treated  only  the 
special  case  of  superselection  sectors  labeled  by  unitary  irre¬ 


ducible  representations  of  compact  groups.  For  another, 
while  it  is  possible  to  formulate  a  security  analysis  of  quan¬ 
tum  bit  commitment  within  the  framework  of  our  argument 
in  Sec.  Ill,  it  is  more  natural  to  structure  the  argument  dif¬ 
ferently,  following  more  closely  the  standard  analysis  of 
quantum  bit  commitment. 

In  this  section,  we  will  emphasize  the  essential  differences 
between  superselection  rules  arising  from  non- Abelian  sym¬ 
metry  groups  and  those  arising  from  Abelian  groups.  The 
discussion  will  pave  the  way  for  our  analysis  of  quantum  bit 
commitment  in  Sec.  VII  and  of  general  two-party  protocols 
in  Sec.  VIII. 

A  crucial  difference  between  Abelian  and  non-Abelian 
charges  is  that  non-Abelian  charges  are  nonadditive:  the 
charges  of  two  subsystems  A  and  B  do  not  necessarily  deter¬ 
mine  the  charge  of  the  composite  system  AB.  This  feature 
can  be  restated  as  a  property  of  the  algebra  of  observables  of 
the  bipartite  system.  Let  A  denote  the  algebra  of  local  op¬ 
erators  (an  associative  algebra,  closed  under  Hermitian  con¬ 
jugation,  that  commutes  with  all  locally  conserved  charges) 
acting  on  subsystem  A,  and  let  B  denote  the  algebra  of  local 
operators  acting  on  B.  The  commutant  of  A,  denoted  A',  is 
the  algebra  of  operators  acting  on  the  composite  system  AB 
that  commute  with  everything  in  A,  and  similarly  for  £>'. 
Now,  if  all  superselection  rules  are  Abelian,  then  A' =  B  and 
S'  =  A.  But  if  the  superselection  rules  are  non-Abelian,  the 
theory  has  sectors  with  nontrivial  total  charge  in  which  this 
relation  does  not  hold.  This  unusual  structure  of  the  local 
observables  has  potential  implications  for  the  security  of 
quantum  protocols. 

To  be  more  explicit,  suppose  that  the  superselection  rules 
arise  from  a  non-Abelian  symmetry  group  G,  and  the  opera¬ 
tions  that  Alice  (or  Bob)  can  perform  must  commute  with  G. 
A  state  |  iff)  in  Alice’s  (or  Bob’s)  Hilbert  space  can  be  decom¬ 
posed  into  irreducible  representations  of  G,  as 

liA)=  X  (61) 

q,i,a 

here  q  labels  the  irreducible  representation  (or  “charge”),  i  is 
the  “color”  index  acted  upon  by  the  representation  of  G,  and 
a  is  the  “flavor”  index  that  distinguishes  among  the  various 
copies  of  the  irreducible  representation  q  appearing  in  the 
decomposition.  Note  that  since  we  are  no  longer  assuming 
that  Alice’s  system  transforms  as  the  regular  representation 
of  G,  there  need  be  no  connection  between  the  number  of 
flavors  and  the  number  of  colors  associated  with  q.  The  ac¬ 
tion  of  a  color  gauge  rotation  representing  g  e  G  on  |  if/)  is 

U(g)\ifr)=  X  (62) 

q,i,j,a 

An  operator  M  allowed  by  the  superselection  rule,  which 
must  commute  with  each  Dq(g ),  preserves  the  charge  q  and 
acts  only  on  the  flavor  index  according  to 

M\*)=  X  +la\q,i,b)Mla.  (63) 

q,i,a,b 

Since  allowed  operations  act  nontrivially  only  on  the  flavor 
index,  it  is  convenient  to  use  a  notation  that  suppresses  the 
color  index  i.  We  denote  by  Hq  the  invariant  Hilbert  space  in 
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the  charge-^  sector,  spanned  by  states  \q,a)  that  are  labeled 
only  by  the  flavor  a  within  the  sector.  The  corresponding 
operator  algebra  respecting  the  superselection  rule  is  C(Tiq ), 
spanned  by  linear  operators  acting  on  this  invariant  space. 
Thus  Alice’s  invariant  Hilbert  space  is 


nA  =  ®nAtq 

q 

(64) 

and  Alice’s  local  operator  algebra  is 

n 

^  © 

& 

(65) 

Similarly,  Bob’s  operator  algebra  is 

B  =  ®C{HB<q). 

(66) 

q 


Now  consider  the  composite  system  AB.  Its  invariant  Hil¬ 
bert  space  too  can  be  expressed  as  a  direct  sum  over  charge 
sectors 

H=(BHq,  (67) 

q 

while  the  full  operator  algebra  is  (BqC(Hq).  But  we  should 
consider  how  Hq  is  related  to  the  invariant  Hilbert  spaces  of 
the  subsystems.  The  charge-^  Hilbert  space  of  the  joint  sys¬ 
tem  can  be  expressed  as 

nq=  ®  (68) 

q^B 

where  V^A,qB  denotes  the  space  of  invariant  linear  maps  from 
the  irreducible  representation  q  to  the  tensor  product  of  irre¬ 
ducible  representations  qA®qB-  This  space  can  be  nontrivial 
(of  dimension  greater  than  1)  if  the  tensor  product  contains 
the  representation  q  more  than  once. 

When  expressed  in  terms  of  a  particular  color  basis  for  the 
irreducible  representations  q,  q A,  and  qB ,  the  components  of 
V^A,qB  are  the  Clebsch-Gordon  coefficients  (3 j  symbols),  of 
the  group  G.  Let  {|  qA,i)}  denote  an  orthonormal  basis  for  the 
representation  qA,  {\qBJ)}  a  basis  for  qB,  and  {| q(a),k)}  a 
basis  for  q(a),  where  the  index  a  labels  the  various  copies  of 
the  representation  q  that  may  be  contained  in  qA®qB-  Then 
the  components  of  V^A,qB  are 

[y?A.<?B(a)]y  =  ((qAj\  ®  (qB,j\)\q(a),Jc).  (69) 

These  components  comprise  a  G-invariant  tensor  with  the 
property 

[V^*(a)i=  X  Dqift(g)D<1jjl {g)\y^’qB{oi)%f Dqk,k(g) . 


Invariant  operations  act  not  on  the  color  indices  of 
[V^*(  a)]^,  but  rather  on  the  index  a  that  distinguishes  the 
flavors  of  q  contained  in  qA®qB.  Furthermore,  the  invariant 
operations  can  also  alter  the  charges  qA  and  qB  appearing  in 
Eq.  (68),  while  preserving  the  total  charge  q. 

The  notation  of  Eq.  (68)  and  its  implications  may  be  clari¬ 
fied  by  discussing  specific  examples.  The  trivial  representa¬ 
tion  (q=l)  is  contained  only  in  the  tensor  product  of  qA  with 
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its  conjugate  representation  qA ,  and  it  occurs  only  once  in 
this  product.  Therefore,  in  the  case  where  the  total  charge  is 
q=  1,  Eq.  (68)  reduces  to 

‘H\  =  ®HAa®'HbA,  (71) 

q 

in  this  case,  the  factor  V^A,qB  is  superfluous.  Now,  the  joint 
operator  algebra  contains  operations  that  cannot  be  executed 
by  Alice  and  Bob  locally — these  operations  change  Alice’s 
charge  and  Bob’s  while  preserving  the  total  charge  (of 
course,  this  can  happen  even  if  G  is  Abelian).  But  any  op¬ 
eration  that  commutes  with  Alice’s  algebra  A  must  preserve 
Alice’s  charge  q ,  and  act  trivially  in  each  of  Alice’s  charge 
sectors;  such  operations  preserve  Bob’s  charge  q  as  well,  and 
thus  are  in  Bob’s  algebra  B.  Therefore,  A  and  B  are  commu- 
tants  of  one  another. 

However,  if  the  total  charge  is  nontrivial,  then  B  need  not 
be  the  commutant  of  A.  To  illustrate  this  phenomenon,  con¬ 
sider  the  case  G  =  SU(2),  where  the  irreducible  representation 
is  labeled  by  the  spin  j.  For  SU(2),  V].A,JB  is  always  one-  (or 
zero-)  dimensional,  and  Eq.  (68)  reduces  to 

Hj=  ®HAJa®HBJb,  (72) 

JaJb 

where  it  is  implicit  that  each  product  of  representations  ap¬ 
pearing  on  the  right-hand  side  transforms  as  spin  j.  To  be 
concrete,  suppose  that  Alice’s  system  has  spin  Bob’s  con¬ 
tains  both  a  spin-0  and  a  spin-1  component,  and  the  total  spin 
is  \\  then 

7^1/2  -  ^A,l/2  ®  (77fi,0  ®  HB\).  (73) 

Note  that  in  this  case,  contrary  to  the  case  in  which  the  total 
charge  is  trivial,  a  single  value  of  jA  can  be  combined  with 
either  of  two  different  values  of  jB  to  obtain  the  same  total 
charge  j.  Therefore,  there  are  invariant  operations  acting  on 
the  joint  system  that  preserve  Alice’s  charge  and  the  total 
charge,  but  change  Bob’s  charge.  These  operations  are  in  the 
commutant  of  A  but  not  in  B\  hence  A'  =f=B. 

We  arrive  at  another  way  of  looking  at  this  property  of 
Hh2  if  we  imagine  that  there  is  a  third  party  Charlie  who 
holds  a  compensating  charge,  so  that  the  total  charge  is 
trivial.  Now 

Tfo  =  7~La,  1/2  ®  (HBq  ®  Hc, i/2  ®  HB  \  ©  0,1/2);  (74) 

an  operation  in  A'  can  be  performed  by  Bob  and  Charlie 
acting  together,  but  not  by  Bob  alone. 

In  order  that  A'  A  B,  it  is  not  necessary  for  one  of  the 
parties  to  possess  a  state  with  indefinite  charge.  For  example, 
in  the  case  G=SU(3),  the  tensor  product  of  the  irreducible 
octet  representation  8  with  itself  contains  two  copies  of  8, 
one  symmetric  and  one  antisymmetric  under  interchange  of 
the  factors, 

8a  ®  8fi  D  8sym  ©  8anti.  (75) 

Thus,  in  the  decomposition 
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7^8  -  7^8  ®  77#  8  ®  ^8’8’  (76) 

the  joint  invariant  Hilbert  space  is  two-dimensional,  while 
Alice  and  Bob  both  have  one-dimensional  Hilbert  spaces  and 
trivial  invariant  operator  algebras.  Then  A '  is  the  full  opera¬ 
tor  algebra,  clearly  different  from  B,  and  similarly  B '  is  dif¬ 
ferent  from  A.  Again,  an  alternative  description  of  the  invari¬ 
ant  space  is  to  note  that  Charlie  could  hold  a  compensating  8 
charge,  in  which  case  the  total  charge  is  trivial  and 

W1  =  (Ha,8®Wb,8®WC8)®V?’m  (77) 

is  two-dimensional. 

For  the  purpose  of  describing  G-invariant  operations,  it  is 
always  legitimate  to  introduce  a  compensating  charge  with¬ 
out  incurring  any  loss  of  generality.  To  see  this,  first  note  that 
if  £  is  a  G-invariant  quantum  operation,  then 

£[U(g)PU(g)-1]  =  U(g)£(p)U(g)-'  (78) 

for  any  geG  and  any  state  p.  In  particular,  then, 

mP)]=mp)i  (79) 

where  Q  is  the  map 

S(p)  =  —  X  U(g)pU(g)-\  (80) 

^ G  g<=G 

which  induces  decoherence  of  a  superposition  of  distinct  ir¬ 
reducible  representations  of  G, 

G(\q,i,a){q' ,j,b\)  =  ^"A^X  \q,l,a)(q,l,b\j .  (81) 

Equation  (79)  means  [26]  that  the  state 

l^>  =  X  tfjqjia)  (82) 

i,a 

cannot  be  distinguished  by  any  G-invariant  operation  from 
the  state 

Q(\4>)(^)=  X  'Pla'Pil[—  X  \q,j,a)(q,j,b\\.  (83) 

q,a,b,i  \  nq  j  I 

Now,  consider  a  system  A  whose  charge  is  screened  by  a 
system  C,  so  that  the  state  of  the  joint  system  has  trivial  total 
charge, 

\>P )ac=  X  \q,i,a)A  ®  | q,i)c-  (84) 

q,a,i 

Tracing  over  system  C  produces  the  state 

M|<A>(^I)ac=  X  (“  X  \q,j,a)(q,j,b\\.  (85) 

q,a,b  \nq  j  / 

But  the  state  Eq.  (83)  is  just  a  convex  combination  of  states 
of  the  form  Eq.  (85).  Therefore,  if  only  G-invariant  opera¬ 
tions  are  to  be  considered,  it  is  always  harmless  to  replace 
system  A  by  half  of  a  bipartite  state  that  carries  trivial  total 
charge. 


Up  until  now,  we  have  explicitly  discussed  only  the  case 
of  superselection  sectors  arising  from  a  compact  symmetry 
group,  but  much  of  the  formalism  we  have  outlined  in  this 
section  can  be  extended  to  a  more  general  setting.  Whatever 
the  origin  of  the  superselection  rule,  the  allowed  operations 
act  on  a  suitable  invariant  space.  Sectors  can  still  be  classi¬ 
fied  by  conserved  charges,  but  in  the  general  case,  the  space 
VqqA,qB  is  defined  more  abstractly,  rather  than  in  terms  of 
group  representations.  One  important  property  that  continues 
to  hold  in  the  general  setting  (which  will  play  a  central  role 
in  our  analysis  of  quantum  bit  commitment  in  Sec.  VII  and 
of  general  two-party  games  in  Sec.  VIII)  is  that  for  each 
value  q  of  the  charge,  there  is  a  unique  conjugate  charge  q 
such  that  the  fusion  of  the  charges  contains  the  trivial  charge 
sector. 

VI.  DATA  HIDING 

Verstraete  and  Cirac  [13]  described  a  data-hiding  protocol 
whose  security  is  founded  on  the  charge  superselection  rule 
for  G=U(1).  Suppose  that  a  trusted  third  party  Charlie  pre¬ 
pares  one  of  the  two  orthogonal  states, 

|±>  =  3=(|01>±|10»,  (86) 

where  |0)  and  |l)  denote  states  of  charge  0  and  1,  respec¬ 
tively,  and  distributes  half  to  Alice  and  half  to  Bob.  If  Alice 
and  Bob  could  each  measure  the  Pauli  operator  X  that  inter¬ 
changes  |0)  and  1 1),  they  could  distinguishthe  states  |  +  )  and 
|-)  by  performing  these  measurements  and  comparing  their 
outcomes.  However,  X  does  not  commute  with  the  electric 
charge  Q\  if  Alice  and  Bob  are  permitted  only  to  perform 
local  charge-conserving  operations  and  to  communicate  clas¬ 
sically,  then  they  will  be  powerless  to  distinguish  the  two 
possible  states. 

On  the  other  hand,  if  Alice  and  Bob  share  access  to  a 
common  phase  reference  state,  their  activities  will  be  unre¬ 
stricted  and  nothing  will  prevent  them  from  performing  the  X 
measurements  that  unlock  the  classical  bit  stored  in  the  state 
prepared  by  Charlie  [aside  from  the  small  loss  of  fidelity  that 
arises  if  the  reference  state  has  large  but  finite  charge  fluc¬ 
tuations,  as  in  Eq.  (58)].  In  Bloch  sphere  language,  Alice  and 
Bob  have  no  a  priori  means  of  orienting  their  measurement 
axes  in  the  x-y  plane,  but  a  shared  phase  standard  enables 
them  to  lock  their  axes  together  and  compare  their  measure¬ 
ments.  Since  the  state  prepared  by  Charlie  is  invariant  under 
rotations  about  the  z  axis,  the  overall  orientation  in  the  x-y 
plane  is  irrelevant;  only  the  relative  orientation  needs  to  be 
fixed  to  identify  Charlie’s  state. 

To  be  more  explicit,  while  X  does  not  commute  with  the 
charge, 

4a'  =  (U_)A  0  a-;,  +  (U+)A  0  cr~,  (87) 

commutes  with  Q ,  as  does  X™fm  If  Alice  and  Bob  share  a 
distributed  reference  state  |  iff)AB  that  is  an  eigenstate  of 
(U_)A0  (£/+)#  with  eigenvalue  1,  then 
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is  an  eigenstate  of 


I  0)as  ®  |  ±  )a'b' 


X1 


AA' 


)X 


BB' 


(88) 

(89) 


with  eigenvalue  ±1.  Therefore,  Alice  and  Bob  can  unlock  the 
hidden  bit  by  each  measuring  Xiny  and  comparing  their  re¬ 
sults.  The  same  holds,  of  course,  if  the  shared  reference  state 
pAB  is  a  mixture  of  eigenstates  of  (£/_)A®  (U+)B,  each  with 
eigenvalue  1,  as  in  Eq.  (53).  As  Verstraete  and  Cirac  ob¬ 
served  [13],  quantum  communication  is  needed  to  establish 
this  shared  phase  standard. 

In  the  absence  of  a  shared  phase  standard,  neither  Alice 
nor  Bob  can  detect  the  bit  encoded  in  the  state  |±)  of  Eq. 
(86);  however,  either  Alice  or  Bob  can  manipulate  the  bit. 
Each  can  measure  the  charge  q ,  and  either  can  apply  a  phase 
to  the  state  conditioned  on  the  charge,  flipping  |  +  )<->  |-).  But 
the  property  that  S'  A  A  indicates  that  the  situation  can  be 
more  subtle  in  the  non-Abelian  case  (with  nontrivial  total 
charge).  Suppose,  for  example,  that  G=SU(2)  with  total 
charge  j  =  \  as  in  Eq.  (73).  Two  states  with  the  same  value  of 
the  total  charge  and  of  Alice’s  charge,  but  different  values  of 
Bob’s  charge,  are  \j=\,jA  =  \,jB=0)  and  \j=\,jA  =  \,jB=\). 
Charlie  might  prepare  either  of  the  linear  combinations 


l±)=^ 


1  1  \ 

j  =  2’Ja  =  2 ’Jb  =  0  ;  + 


•  _I  _i  . 

j  =  2’Ja  ~  2’Jb  =  * 


(90) 


and  then  distribute  the  AB  system  to  Alice  and  Bob.  Again, 
neither  Alice  nor  Bob  can  detect  the  hidden  bit,  but  now 
there  is  a  notable  asymmetry  between  Alice’s  power  and 
Bob’s.  Since  Bob  has  a  superposition  of  two  different  charge 
states,  he  can  tamper  with  the  hidden  bit  by  applying  a  phase 
controlled  by  the  charge.  Alice,  on  the  other  hand,  has  a 
trivial  invariant  operator  algebra,  and  has  no  control  over  the 
shared  state. 

We  may  take  this  observation  a  step  further.  Suppose,  for 
example,  that  G=SU(3)  with  total  charge  q  =  8  as  in  Eq.  (76). 
Charlie  might  prepare  either  of  the  linear  combinations 


I  —  )  —  ^-(|<7  ~  ^sym’^A  —  —  8)  —  \q  —  8 anti ,qA  —  8? qs  —  8)) , 

(91) 

and  then  distribute  the  AB  system  to  Alice  and  Bob.  Again, 
neither  Alice  nor  Bob  can  detect  the  hidden  bit,  but  further¬ 
more,  neither  one  can  tamper  with  the  bit’s  value. 

However,  in  the  non-Abelian  case  as  in  the  Abelian  case, 
the  hidden  bit  can  be  opened  via  local  operations  and  classi¬ 
cal  communication  between  Alice  and  Bob  if  they  are  pro¬ 
vided  with  correlated  reference  systems  that  effectively  re¬ 
move  the  restrictions  imposed  by  the  superselection  rule. 


VII.  QUANTUM  BIT  COMMITMENT 
AND  SUPERSELECTION  RULES 

During  the  commitment  stage  of  quantum  bit  commit¬ 
ment,  Alice  encodes  a  classical  bit  by  preparing  one  of  two 
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distinguishable  quantum  states  with  density  operators  p0  or 
Pi,  and  then  she  sends  half  of  the  state  to  Bob.  In  the  unveil¬ 
ing  stage,  Alice  sends  the  other  half  of  the  state  to  Bob,  so 
that  he  can  verify  whether  the  state  is  p0  or  px.  The  protocol 
is  binding  if,  after  commitment,  Alice  is  unable  to  change  the 
value  of  the  bit.  The  protocol  is  concealing  if,  after  commit¬ 
ment  and  before  unveiling,  Bob  is  unable  to  discern  the  value 
of  the  bit.  The  protocol  is  secure  if  it  is  both  binding  and 
concealing. 

In  the  absence  of  superselection  rules,  unconditionally  se¬ 
cure  quantum  bit  commitment  is  impossible  [8,9].  If  we 
imagine  that  the  states  p0  and  px  are  pure  states  shared  by 
Alice  and  Bob,  then  if  the  protocol  is  concealing,  Bob’s  den¬ 
sity  operator  (obtained  by  tracing  over  Alice’s  system)  must 
be  the  same  in  both  cases:  Po,b=Pi,b-  But  then  by  the  HJW 
theorem  [29]  Alice  can  apply  a  unitary  transformation  to  her 
half  of  the  state  that  transforms  p0  to  pb  so  that  the  protocol 
is  not  binding. 


A.  Bit  commitment  with  mixed  states 

We  reached  this  conclusion  under  the  assumption  that  p0 
and  pi  are  pure  states,  but  we  can  extend  the  argument  to  the 
case  were  the  states  are  mixed  by  appealing  to  the  concept  of 
a  purification  of  a  mixed  state.  We  will  describe  this  exten¬ 
sion  in  detail,  as  we  will  follow  very  similar  reasoning  in  our 
discussion  in  Sec.  VII  C  of  bit  commitment  with  nontrivial 
total  charge. 

Suppose  that  at  the  start  of  the  bit  commitment  protocol, 
Alice  and  Bob  share  a  product  state  pA®  pB,  where  the  states 
pA  and  pB  are  mixed.  An  equivalent  way  to  describe  Alice’s 
initial  state  is  to  introduce  the  ancilla  system  C  and  a  pure 
state  |  ifj)AC  (a  purification  of  pA),  such  that  the  density  opera¬ 
tor  pA  is  obtained  from  \  fi)AC  by  tracing  over  system  C: 

Pa  =  Kc(\<I')(<P\)ac-  (92) 

Similarly,  to  describe  pB  we  can  introduce  the  ancilla  D  and 
a  state  |  cp)BD  that  purifies  pB.  Without  loss  of  generality,  we 
may  assume  that  in  each  step  of  the  protocol,  Alice  or  Bob 
applies  a  unitary  transformation,  so  that  the  state  of  the  full 
system  ABCD  remains  pure.  (A  general  quantum  operation 
performed  by  Alice,  say,  can  be  realized  as  a  unitary  trans¬ 
formation  applied  jointly  to  Alice’s  system  and  to  an  appro¬ 
priate  ancilla;  therefore,  the  operation  is  unitary  provided 
that  we  include  this  ancilla  as  part  of  the  system.)  In  particu¬ 
lar,  after  the  bit  is  committed,  the  state  of  the  full  system  is 
one  of  the  two  pure  states  \i//0)abcd  or  |«At  )abcd- 

If  both  parties  are  honest,  the  ancillas  C  and  D  are  off 
limits — Alice  can  manipulate  only  A  and  Bob  can  manipu¬ 
late  only  B — and  in  that  case  the  mixed  state  protocol  and  its 
purification  are  completely  equivalent.  Furthermore,  if  one 
party  cheats,  whether  the  other  party  starts  out  with  a  mixed 
state  or  its  purification  has  no  impact  on  the  effectiveness  of 
the  cheating  strategy,  because  the  honest  party  never  touches 
the  purifying  ancilla  anyway. 

Now  let  us  see  that  in  any  quantum  bit  commitment  pro¬ 
tocol,  one  of  the  players  can  cheat  successfully.  First  suppose 
that  Bob  cheats.  Though  the  honest  protocol  calls  for  Bob  to 
start  our  with  the  mixed  state  pB,  a  cheating  Bob  can  throw 
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this  state  away,  and  replace  it  with  the  purification  |  <p)BD, 
where  D  is  now  an  ancilla  system  that  Bob  controls.  There¬ 
fore,  if  the  protocol  is  perfectly  concealing  (even  when  Bob 
cheats),  then 


Po,BD  —  h-AcO  *AoX(AoDa.8CZ)  —  Pl,BD  ~  trAc(l  filX&ll)  ABCD’ 

(93) 

Bob  is  unable  to  collect  any  information  about  the  committed 
bit  through  any  joint  measurement  on  BD. 

Similarly,  a  cheating  Alice  could  throw  away  her  initial 
state  and  replace  it  by  its  purification;  then  Alice  could  con¬ 
trol  both  A  and  the  ancilla  C.  Applying  the  HJW  theorem  as 
before,  we  conclude  that  if  Po,bd=P\,bd>  then  Alice  can  apply 
a  unitary  transformation  to  AC  that  transforms  \if/0 )Abcd  to 
\i//\)abcD‘  We  conclude  that  if  the  protocol  is  concealing,  then 
it  is  not  binding.  Unconditionally  secure  quantum  bit  com¬ 
mitment  is  impossible,  even  with  mixed  states.  That  quantum 
bit  commitment  is  impossible  even  when  mixed  strategies 
are  used  was  proved  in  [8]  using  a  slightly  different  ap¬ 
proach. 

B.  Trivial  total  charge 

The  argument  in  Sec.  VII  A  shows  that  for  an  analysis  of 
the  security  of  quantum  bit  commitment,  we  may  assume 
that  Alice  and  Bob  share  a  pure  state.  But  how  is  the  security 
affected  if  superselection  rules  constrain  Alice’s  and  Bob’s 
operations?  We  will  first  consider  the  special  case  in  which 
the  total  charge  that  Alice  and  Bob  share  is  trivial.  After 
commitment,  then,  Alice  and  Bob  share  one  of  the  two  pure 
states  1 0b)  or  \ip\),  each  with  trivial  total  charge.  Choosing 
the  Schmidt  basis  in  each  charge  sector,  the  state  1 0O)  can  be 
expanded  as 

l*Ao )ab=  X  A7  \q,b)A  ®  \q,b)B,  (94) 

q  b 

where  Bob’s  density  operator  is 

P(),B  =  Mli/'oX'/'ol)  =  2  PqP0,B,q  (95) 

q 

and 

Po,b, ?  =  2  \,b  \q,b)(q,b\ .  (96) 

b 

Bob  can  measure  the  probability  pq  that  his  charge  is  q\ 
therefore  if  the  protocol  is  concealing,  then  the  distribution 
{pq}  must  be  the  same  for  as  for  |0O).  Furthermore, 
Bob’s  density  operator  in  the  charge-*?  sector  must  not  de¬ 
pend  on  whether  the  state  is  |0O)  or  1^);  therefore  |0j)  can 
be  expanded  as 

I  h)AB  =  '2j'Jpq'2j  1 4’b)A  ®  \q,b)B,  (97) 

q  b 

where  {\q,b)A}  is  another  basis  for  Alice’s  charge-*?  sector. 
But  now  Alice  can  apply  a  unitary  transformation  condi¬ 
tioned  on  the  charge  that  rotates  one  basis  to  the  other: 


FIG.  2.  “Purification”  of  a  two-party  game  with  nontrivial  total 
charge.  At  the  beginning  of  the  game,  the  charge  of  C  (hidden 
behind  a  brick  wall)  compensates  for  Alice’s  charge  *?A,  and  the 
charge  of  D  (also  hidden)  compensates  for  Bob’s  charge  qB.  Honest 
players  never  touch  the  compensating  charges,  but  a  cheating  Alice 
might  manipulate  C  and  a  cheating  Bob  might  manipulate  D. 

Uq'\q,b)  — >  \q,b),  (98) 

which  transforms  |0O)  to  1 0j).  Therefore,  the  protocol  is  not 
binding. 

Obviously,  the  same  argument  applies,  in  the  Abelian 
case,  even  if  the  total  charge  is  nontrivial  [16].  The  key  prop¬ 
erty  of  the  states  that  is  used  in  the  argument  is  that  Alice’s 
charge  is  perfectly  correlated  with  Bob’s,  so  that  B'  =  A. 

C.  Nontrivial  total  charge 

The  property  that  B'  A  A  in  the  non- Abelian  case  (with 
nontrivial  total  charge)  encourages  one  to  hope  that  a  bit 
commitment  protocol  can  be  formulated  whose  security  is 
founded  on  a  non- Abelian  superselection  rule.  Indeed,  con¬ 
sider  again  the  case  G=SU(2)  with  total  charge  j=\  as  in 
Eq.  (73).  When  Alice  has  control  of  the  full  AB  system,  she 
can  prepare  either  of  the  states  |  +  )A5  shown  in  Eq.  (90),  and 
then  she  can  send  the  B  system  to  Bob.  Now  Bob  is  unable  to 
distinguish  the  two  states,  because  he  cannot  measure  the 
relative  phase  in  a  superposition  of  two  states  of  different 
charge.  Furthermore  there  is  no  invariant  operation  Alice  can 
apply  that  changes  |  +  )  to  |-)  or  vice  versa.  It  seems,  then, 
that  the  protocol  is  both  concealing  and  binding.  At  any  rate, 
quantum  bit  commitment  in  a  world  with  non- Abelian  super¬ 
selection  rules  seems  fundamentally  different  from  quantum 
bit  commitment  in  a  world  in  which  all  superselection  rules 
are  Abelian. 

But,  as  always  in  a  discussion  of  information-theoretic 
security,  we  must  be  sure  to  consider  the  most  general  pos¬ 
sible  cheating  strategies.  And  in  fact,  we  can  argue  that  for 
the  security  analysis,  there  is  no  loss  of  generality  if  we 
assume  that  the  charge  shared  by  the  parties  is  trivial,  the 
case  we  have  already  dealt  with  in  Sec.  VII  B.  This  reduction 
to  the  case  of  trivial  total  charge  follows  closely  our  discus¬ 
sion  in  Sec.  VII  A,  where  we  showed  that  it  suffices  to  as¬ 
sume  that  the  parties  share  a  pure  state. 

Consider  a  general  two-party  quantum  bit  commitment 
protocol  in  which  the  initial  state  shared  by  Alice  and  Bob  is 
a  tensor  product  pA  <g>  pB  of  invariant  states.  The  state  pA  can 
be  purified  if  we  introduce  an  ancilla  C;  furthermore,  the 
pure  state  of  AC  can  be  chosen  to  have  trivial  total  charge. 
Similar,  we  can  purify  pB  using  the  ancilla  D ,  in  such  a  way 
that  the  pure  state  of  BD  has  trivial  total  charge.  (See  Fig.  2.) 
Each  operation  performed  by  Alice  or  Bob  can  be  taken  to  be 
a  charge-conserving  unitary  transformation;  therefore,  at 
each  stage  of  the  protocol,  the  state  of  the  full  system  AB  CD 
is  a  pure  state  with  trivial  total  charge. 
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In  the  honest  protocol,  the  ancillas  C  and  D  are  inacces¬ 
sible.  But  if  Bob  cheats,  he  can  throw  away  the  initial  invari¬ 
ant  state  pA  called  for  in  the  protocol,  and  replace  it  by  a 
trivially  charged  pure  state  of  BD ,  where  D  is  now  an  ancilla 
that  Bob  controls.  Therefore,  if  the  bit  commitment  protocol 
is  concealing,  then  Po,bd=Pi,bd — Bob  cannot  learn  anything 
about  the  committed  bit  from  any  invariant  joint  measure¬ 
ment  on  BD.  Since  the  state  of  the  full  system  ABCD  is  a 
pure  state  with  trivial  charge,  the  argument  of  Sec.  VII  B 
suffices  to  show  that  Alice  can  transform  \if/0)  to  \ifjx)  with  an 
invariant  local  operation  applied  to  AC.  Hence,  the  protocol 
is  not  binding.  We  have  proved,  then,  that,  even  when  the 
protocol  calls  for  a  nontrivial  total  charge,  if  Bob  is  unable  to 
cheat  then  Alice  can  cheat — unconditionally  secure  quantum 
bit  commitment  is  impossible.  We  have  the  following. 

Theorem  2.  Consider  a  quantum  bit  commitment  protocol 
in  the  /  world,  where  at  the  beginning  of  the  protocol  Alice 
and  Bob  share  a  product  of  invariant  states.  Then  if  the  pro¬ 
tocol  is  concealing,  it  is  not  binding. 

Our  proof,  which  reduces  the  case  of  nontrivial  total 
charge  to  the  case  of  trivial  total  charge,  is  really  just  a  minor 
variant  of  the  argument  in  Sec.  VII  A  that  reduces  the  case  of 
a  protocol  where  Alice  and  Bob  share  a  mixed  state  to  the 
case  where  they  share  a  pure  state. 

In  the  case  of  our  bit  commitment  protocol  in  which  the 
total  charge  of  AB  is  j=  if  Alice  is  unable  to  access  the 
compensating  charge  in  C,  then  she  cannot  cheat  success¬ 
fully.  But  if  Alice  controls  the  whole  AC  system,  then  Alice’s 
charge  jA c= 0,1  is  perfectly  correlated  with  Bob’s,  and  she 
can  rotate  the  relative  phase  of  the  jAC= 0  and  jAC=l  com¬ 
ponents  of  her  state,  transforming  |  +  )  to  |-). 

This  reduction  of  a  protocol  with  nontrivial  total  charge  to 
a  protocol  with  trivial  total  charge  can  be  generalized.  In  the 
/  world,  consider  an  -party  protocol  in  which  up  to  k<n  of 
the  parties  might  cheat,  where  the  initial  state  is  the  product 
of  invariant  states  ®”=1Pp  and  where  all  operations  performed 
by  the  parties  are  required  to  conserve  the  local  charge.  Then 
we  may  imagine  that  each  party  is  issued  a  compensating 
charge  at  the  beginning  of  the  protocol,  so  that  each  party 
actually  starts  out  with  trivial  charge.  The  honest  parties  will 
never  touch  their  compensating  charges,  but  a  cheating  party 
cannot  be  prevented  from  performing  arbitrary  joint  opera¬ 
tions  on  her  system  and  her  compensating  charge.  This  strat¬ 
egy  is  realizable  because  the  cheater  might  throw  away  the 
invariant  state  she  holds  at  the  beginning  of  the  protocol,  and 
replace  it  by  a  charge-zero  state  that  she  controls  fully.  Fur¬ 
thermore,  if  an  attack  by  the  cheaters  is  successful  in  the 
protocol  where  the  honest  players  start  out  with  trivial 
charge,  then  it  will  also  be  successful  if  the  honest  players 
start  out  with  a  product  of  charged  invariant  states;  since 
honest  players  never  make  use  of  the  compensating  charges, 
their  presence  can  have  no  impact  on  the  effectiveness  of  the 
attack.  Therefore,  we  have  the  following. 

Theorem  3.  Let  P  be  an  n -party  quantum  protocol  in  the  I 
world  that  securely  realizes  a  task  n,  where  the  initial  state 
in  P  is  a  product  of  n  invariant  states.  Then  there  is  an 
I- world  protocol  P'  that  also  securely  realizes  n,  where  the 
initial  state  in  P'  is  a  product  of  n  pure  states,  each  with 
trivial  charge. 

In  other  words,  in  a  security  analysis,  we  may  assume 
without  any  loss  of  generality  that  each  party  holds  a  pure 
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state  with  trivial  charge  at  the  start  of  the  protocol. 

Note  that  for  the  proofs  of  Theorems  2  and  3,  our  obser¬ 
vations  from  Secs.  II  and  III  on  the  use  of  reference  systems 
are  not  needed.  Rather,  to  prove  Theorems  2  and  3,  we  use 
only  two  properties  of  the  /-world  superselection  sectors: 
first,  that  for  each  charge  sector  TLq  there  is  a  unique  conju¬ 
gate  charge  sector  Tiq  such  that  the  trivial  sector  TL\  is  con¬ 
tained  in  Ttq®Ttq,  and  second,  that  any  invariant  state  has  a 
purification  with  trivial  total  charge.  These  properties  hold 
not  just  for  the  case  of  superselection  rules  arising  from  a 
symmetry  group  G,  but  also  for  the  more  general  superselec¬ 
tion  rules  considered  in  Sec.  VIII.  Therefore,  Theorems  2 
and  3  apply  in  this  more  general  setting. 

VIII.  TWO-PARTY  PROTOCOLS  IN  GENERAL 
A.  Overview 

We  will  now  analyze  the  impact  of  superselection  rules  on 
the  security  of  general  two-party  protocols.  We  will  show 
that  for  any  protocol  P  in  the  invariant  world  (/  world)  sub¬ 
ject  to  the  superselection  rule,  there  is  a  corresponding  pro¬ 
tocol  P  in  the  unrestricted  world  ( U  world),  where  P  simu¬ 
lates  P  in  the  following  sense:  First,  when  performed 

honestly,  P  and  P  accomplish  the  same  task.  And  second,  for 
any  cheating  strategy  that  can  be  adopted  by  a  dishonest 
party  in  P,  there  is  a  corresponding  cheating  strategy  in  P 
that  is  just  as  effective.  In  particular  then,  if  P  is  insecure, 
then  so  is  P.  We  conclude,  therefore,  that  superselection 
rules  cannot  enhance  the  (information-theoretic)  security  of 
two-party  protocols.  The  methods  we  will  use  to  establish 
this  result  are  quite  different  from  those  used  in  Sec.  Ill  to 
treat  the  case  of  superselection  rules  arising  from  a  symmetry 
group. 

Before  going  into  detail,  we  will  briefly  describe  the  main 
ideas  used  in  our  argument.  First  of  all,  we  will  restrict  out 
attention  to  a  protocol  in  which  the  total  charge  shared  by  the 
two  parties  is  trivial  (belongs  to  the  trivial  superselection 
sector).  We  know  from  Theorem  3  in  Sec.  VII  C  that  it  suf¬ 
fices  to  treat  this  special  case  in  an  analysis  of  security.  A 
protocol  with  trivial  total  charge  has  this  useful  property:  if 
Alice  knows  that  she  holds  charge  q  after  sending  a  message 
to  Bob,  then  Alice  also  knows  that  Bob  will  hold  the  conju¬ 
gate  charge  q  upon  receiving  the  message.  Similarly,  Bob 
knows  what  Alice’s  charge  will  be  after  she  receives  a  mes¬ 
sage  sent  by  Bob.  Our  analysis  of  security  relies  on  the  prop¬ 
erty  that  Bob  has  a  definite  charge  if  Alice  does,  and  there¬ 
fore  it  applies  only  to  two-party  protocols. 

In  the  I  world,  charge  is  conserved,  so  that  the  total  charge 
shared  by  Alice  and  Bob  is  trivial  at  each  stage  of  the  pro¬ 
tocol;  furthermore,  local  operations  performed  by  Alice  or 
Bob  must  preserve  the  conserved  charge.  In  the  U  world, 

charge  need  not  be  conserved,  but  the  protocol  P  that  simu¬ 
lates  the  /-world  protocol  P  can  be  chosen  to  respect  conser¬ 
vation  of  a  fictitious  “charge”  that  behaves  like  the  actual 
conserved  charge  of  the  I  world.  However,  a  dishonest  party 
who  is  not  bound  to  follow  the  protocol  P  can  perform  op¬ 
erations  that  violate  “charge”  conservation.  Our  task  is  to 
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ensure  that  the  greater  freedom  enjoyed  by  a  dishonest  party 
in  the  U  world  does  not  enhance  her  ability  to  cheat  success¬ 
fully. 

For  this  purpose,  our  argument  relies  on  the  concept  of  the 
format  of  a  message  exchanged  between  the  parties.  In  the  U 
world,  the  format  is  simply  the  Hilbert  space  containing  the 

message.  In  the  protocol  P,  the  recipient  of  a  message  always 
checks  that  the  format  of  the  message  is  valid,  and  aborts  the 
protocol  if  the  message  is  invalid.  A  valid  message  corre¬ 
sponds  to  one  that  could  have  been  sent  in  the  I  world,  while 
a  message  is  invalid  only  if  the  sender  violated  the  local 
conservation  of  “charge”  before  sending  it.  Thus,  a  message 
that  upon  receipt  is  found  to  be  in  the  proper  format  could 
have  been  sent  by  a  party  who  performed  a  charge- 
conserving  local  operation — in  effect  the  sender  is  unable  to 
play  a  charge-nonconserving  strategy  without  being  detected. 
Since  effective  charge  conservation  is  enforced  by  halting 
the  protocol  when  a  charge  nonconservation  is  detected,  it 
will  be  essential  for  our  argument  to  consider  games  that  can 
be  aborted  at  any  stage  by  either  party.  A  cheating  strategy 
for  the  I- world  protocol  P  and  the  corresponding  cheating 

strategy  for  its  U-  world  counterpart  P  will  cause  the  game  to 
halt  prematurely  with  the  same  probability,  as  well  as  pro¬ 
duce  the  same  probability  distribution  of  outcomes  in  the 
event  that  the  game  ends  normally,  without  being  aborted. 

B.  Superselection  rules  and  charges 

Before  proceeding  to  our  proof,  we  should  recall  the  prop¬ 
erties  of  superselection  rules  and  charges  that  will  be  in¬ 
voked  in  the  argument.  These  properties  have  been  explored 
already  in  Sec.  V,  for  the  special  case  of  super-selection  sec¬ 
tors  labeled  by  irreducible  unitary  representations  of  compact 
groups.  Here  we  wish  to  emphasize  that  some  of  the  same 
ideas  can  be  extended  to  a  more  general  setting,  and  we  will 
indicate  how  a  two-party  protocol  in  which  conserved 
charges  are  exchanged  can  be  simulated  using  ordinary  qu¬ 
bits. 

In  general,  a  superselection  rule  is  a  decomposition  of 
Hilbert  space  into  a  direct  sum  of  sectors  such  that  each 
sector  is  preserved  by  the  allowed  operations.  The  charge  q 
is  a  label  that  distinguishes  the  distinct  sectors,  and  we  may 
say  that  the  operations  allowed  by  the  superselection  rule 
conserve  the  charge.  Thus,  the  Hilbert  space  is  expressed  as 

n  =  enq ,  (99) 

q 

and  the  allowed  operations  belong  to  the  algebra 

®C(Hq),  (100) 

q 

where  C(Hq)  denotes  linear  operators  acting  on  Hq. 

Depending  on  the  particular  form  of  the  superselection 
rule,  there  are  specific  rules  governing  how  the  charge  be¬ 
haves  when  a  system  splits  into  two  subsystems,  or  when 
two  systems  fuse  to  become  a  single  system.  These  rules  can 
be  encoded  in  vector  spaces  Vfb  defined  by 


Hc=®Ha®Hb®Vac’b.  (101) 

a,b 

The  space  Vfb  is  n-dimensional  if  there  are  n  distinguishable 
ways  that  a  charge  c  object  can  arise  when  objects  with 
charges  a  and  b  fuse.  Consistency  of  Eq.  (101)  with  associa¬ 
tivity  of  the  tensor  product  requires  the  Vfb’s  to  obey  certain 
identities,  but  we  will  not  discuss  these  further  as  they  will 
not  be  needed  for  our  proof. 

There  is  a  trivial-charge  sector,  denoted  Hi,  that  behaves 
as  the  identity  under  fusion, 

Hc®Hi  =  Hc.  (102) 

Furthermore,  there  is  a  unique  charge  q,  the  conjugate  of  q, 
that  can  fuse  with  q  to  yield  the  identity 

Hi  =  ®Hq®Hq.  (103) 

q 

Now,  in  the  I  world,  consider  a  bipartite  system  shared  by 
Alice  and  Bob.  The  Hilbert  space  decomposes  as 

H  =  ®Hq, 
q 

nq  =  ®  hAAa  ®  hBAb  ®  vqAq\  ao4) 

q^B 

where  q  is  the  total  charge,  qA  is  the  charge  of  Alice’s  sys¬ 
tem,  and  qB  is  the  charge  of  Bob’s  system.  The  physical 
operations,  allowed  by  the  superselection  rule,  conserve  the 
total  charge,  and  hence  belong  to  the  algebra 

0  =  ©£(%).  (105) 

q 

The  operations  Alice  can  perform,  which  conserve  Alice’s 
charge  and  act  trivially  on  Bob’s  system,  belong  to 

A=  ©  C{HM)®PAqqB,  (106) 

q^A^B 

where  IqBqB  denotes  the  identity  acting  on  HB^B®VqA,qB. 
Similarly,  the  algebra  of  operations  that  Bob  can  perform  is 

B=  ©  IqAAqqB®C{HBJ,  (107) 

q,qA,qB 

where  Iq/,qB  denotes  the  identity  acting  on  HA^A®VqA,qB.  In 
contrast,  the  commutant  B'  of  B,  which  conserves  the  total 
charge  and  Bob’s  charge  but  need  not  conserve  Alice’s,  is 

&  =  ©  4©  HMa  0  VqAqB]  0  IB  (108) 

q^B  \qA  J 

where  IB  Qb  is  the  identity  on  HBtq  ,  and  similarly 

A  =  ©  V  ®  ®  KaAb\  (109) 

q,qA  \qB  J 

Thus  A'  =B  and  Bf  =  A  if  and  only  if  the  charges  qA  and  qB 
are  perfectly  correlated  (there  is  a  unique  qB  corresponding 
to  each  qA  and  vice  versa).  This  condition  holds,  in  particu¬ 
lar,  if  the  total  charge  is  trivial,  in  which  case  our  formulas 
simplify  to 
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1~L  —  1~L  \  —  ©  P(-A,q  ®  7Y B,q  ? 
q 

A  =  B'  =  ®C{TiA  q)  ®  IB  q, 

q 

B  =  A'  =  ®4,®£(Wb>?).  (110) 

q 

C.  Simulating  charge  exchange 

A  novelty  of  a  two-party  protocol  in  the  /  world  is  that 
when  Alice  (for  example)  sends  a  message  to  Bob,  she  may 
choose  to  split  the  charge  she  possesses  into  two  parts — the 
charge  she  retains  and  the  charge  of  the  message  that  she 
sends.  If  the  total  charge  is  trivial,  then  the  full  Hilbert  space 
comprising  Alice’s  system  A,  Bob’s  system  B ,  and  the  mes¬ 
sage  system  M  can  be  expressed  as 

nx=  ©  n^qA®uBAB®nMAM®vi^^. 

qAABAM 

(111) 

The  isomorphisms 

yMB^M  ^  yq_A^M  ^  yq_B^M  (112) 

invite  us  to  interpret  Eq.  (Ill)  in  complementary  ways — 
namely,  the  charge  qB  of  AM  is  conjugate  to  the  charge  qB  of 
B ,  and  the  charge  qA  of  BM  is  conjugate  to  the  charge  qA  of 
A.  Thus,  Eq.  (Ill)  describes  the  splitting  of  Alice’s  initial 
charge  qB  into  the  charge  qA  that  she  retains  and  the  charge 
qM  o f  the  message,  as  well  as  the  fusion  of  the  charge  q^  of 
the  message  with  Bob’s  initial  charge  qB  to  yield  Bob’s  final 
charge  qA.  Furthermore,  if  V\Am,qM  is  of  dimension  greater 
than  1,  then  a  vector  in  VqA,qB,qM  describes  the  particular 
manner  in  which  Alice  performs  the  splitting,  which  in  turn 
determines  the  result  of  Bob’s  fusion. 

While  the  information  encoded  in  VqA,qB,qM  is  an  intrinsic 
property  in  the  I  world,  if  we  are  to  simulate  the  process  of 
charge  exchange  in  the  U  world,  then  this  information  must 
be  carried  by  ordinary  qubits.  In  such  a  simulation,  the  Hil¬ 
bert  space  of  Alice’s  system,  Bob’s  system,  and  the  message 
is  expanded  to 

H=  0  ®  UBAl  ®  Hm«m  ® 

qiAlAAABAM 

(113) 

but  where  now  V\A,qB,qM  is  to  be  regarded  as  an  explicit  part 
of  the  message.  If  the  conditions  q\=qA  and  q2  =  qB  were 
imposed,  then  the  “format”  of  this  message  would  coincide 
perfectly  with  the  information  content  of  a  message  sent  in 
the  I  world.  But  while  in  the  I  world  these  conditions  arise 
from  the  intrinsic  physics  of  the  superselection  rule,  in  the  U 
world  they  must  be  imposed  by  hand  through  proper  design 
of  the  protocol. 

Thus,  in  the  //-world  protocol  P  that  simulates  the 
/-world  protocol  P,  we  will  require  the  recipient  of  a  mes¬ 
sage  to  verify  its  format — Alice  checks  that  q\-qA  and  Bob 
checks  that  q2=qB •  Of  course,  at  a  given  stage  of  the  protocol 
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P,  Alice  or  Bob  might  hold  a  coherent  superposition  of  dif¬ 
ferent  charges,  even  though  the  total  charge  is  always  guar¬ 
anteed  to  be  trivial.  Therefore,  the  verification  step  in  P  must 
be  performed  coherently;  Alice,  for  example,  checks  that  ql 
and  qA  match  without  learning  the  value  of  ql  or  qA.  If  veri¬ 
fication  fails,  then  the  message  recipient  has  detected  cheat¬ 
ing  by  the  other  party  and  aborts  the  protocol.  If  verification 
succeeds,  then  the  message  has  been  projected  onto  the  valid 
format,  and  as  far  as  the  recipient  is  concerned,  it  is  just  as 
though  the  message  had  been  sent  in  the  right  format  to 
begin  with. 

Whenever  Alice  cheats  in  the  U- world  protocol  P  by 
modifying  her  charge,  she  risks  detection,  and  if  her  cheating 
is  undetected,  then  her  operation  is  equivalent  to  a  charge- 
conserving  one.  Therefore,  Alice  has  an  equivalent  strategy 
in  the  I- world  protocol  P,  in  which  she  either  halts  the  game 
herself  with  some  probability  before  sending  her  message,  or 
if  the  game  does  not  halt,  performs  an  operation  allowed  by 
the  superselection  rule.  This  observation  suffices  to  establish 

that  P  simulates  P,  and  thus  that  the  superselection  rule  can¬ 
not  thwart  cheating. 

To  summarize,  for  the  purpose  of  characterizing  Alice’s 
ability  to  cheat,  we  are  only  interested  in  how  Alice’s  activi¬ 
ties  will  affect  Bob’s  measurements.  Although  in  the  U  world 
Alice  has  the  power  to  violate  conservation  of  “charge,”  she 
is  unable  to  fool  Bob  into  accepting  a  message  that  is  not 
isomorphic  to  one  that  could  have  been  created  in  the  I 
world.  Therefore,  Alice’s  elevated  power  in  the  U  world 
gives  her  no  advantage. 

D.  Definitions 

Having  explained  the  main  ideas,  we  will  now  present  a 
more  formal  proof  of  our  result.  To  begin,  we  must  define  the 
general  notions  of  “protocol”  and  “simulation”  in  accord 
with  our  goals.  The  definitions  are  quite  natural,  but  there  are 
some  technicalities  that  are  necessary  for  the  proof  to  work. 

We  consider  quantum  games  between  two  parties,  Alice 
and  Bob.  We  assume  that  Alice  sends  the  first  message  and 
the  players  alternate.  The  protocol  of  a  game  specifies  the 
total  number  of  messages,  their  format,  the  strategies  for 
honest  players,  and  a  way  to  determine  the  game  outcome. 
By  “format”  in  the  U  world  we  mean  the  Hilbert  space  HM 
of  a  given  message.  In  the  I  world,  we  specify  the  space 
H M,qM  for  each  value  of  the  message  charge  qM. 

To  define  an  honest  strategy  in  the  I  world,  we  specify  for 
each  value  of  Alice’s  charge  qA  her  corresponding  space 
;  likewise,  we  specify  Bob’s  space  for  each  qB. 

The  game  starts  with  a  pure  state 

\€a)  ®  I €b)  g  1  ®  ^B,  1*  (114) 

where  1  stands  for  the  trivial  charge.  If  one  of  the  players 
(say,  Alice)  cheats,  she  may  use  a  different  set  of  private 
spaces  Ha  ,  but  the  initial  state  still  must  be  of  the  form 

l€i>®  l&)>  where  \Q  e  i- 

Alice’s  and  Bob’s  actions  in  the  kth  step  are  described  by 
operators  WAk,WBk.  The  final  outcome  is  determined  by  a  pair 
of  measurements  that  are  performed  independently  on  Al- 
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/-world 

U- world 

Protocol 

P  - 

P 

Alice  strategy 

A '  «- 

~  A! 

Bob  strategy 

B’  <- 

~  B' 

FIG.  3.  The  U- world  protocol  P  simulates  the  I- world  protocol 
P  if  the  honest  protocols  realize  the  same  task,  and  if  for  any  cheat¬ 
ing  strategy  in  P  there  is  an  equivalent  cheating  strategy  in  P. 

ice’s  and  Bob’s  subsystems  at  the  end  of  the  game.  We  are 
interested  in  the  joint  probability  distribution  of  the  measure¬ 
ment  results.  However,  if  one  of  the  players  cheats,  only  the 
honest  player’s  subsystem  is  measured. 

For  the  reasons  explained  in  Sec.  VIII  A,  we  will  assume 
that  the  game  can  be  aborted  by  either  player.  If  the  game  is 
aborted,  we  will  not  need  to  keep  track  of  who  ends  the  game 
or  when  it  ends — we  will  only  be  interested  in  whether  the 
game  ends  normally  and  if  so  what  is  the  outcome.  For  this 
purpose,  the  quantum  state  can  be  characterized  by  a  vector 
| if/)  such  that  {ifj\  ip)  is  the  probability  that  the  game  has  not 
been  aborted.  Operations  performed  by  each  player  may  then 
be  described  by  contracting  maps,  i.e.,  operators  W  such  that 
We  assume  that  the  game  is  never  aborted  if  both 
players  are  honest,  so  that  the  probabilities  of  different  out¬ 
comes  add  up  to  1  in  the  honest  game.  If  one  of  the  players 
cheats,  the  total  probability  of  all  outcomes  is  generally  less 
than  1. 

Now  we  define  what  it  means  for  one  protocol  to  simulate 
another  (see  Fig.  3). 

Definition.  A  protocol  P  simulates  the  protocol  P  if  the 
following  conditions  are  fulfilled: 

(i)  The  honest  strategies  in  P  and  P  give  rise  to  the  same 
probability  distribution  of  the  outcomes. 

(ii)  For  any  cheating  strategy  A'  by  Alice  compatible 
with  the  protocol  P,  there  exists  an  equivalent  strategy  A  ’  for 
the  protocol  P.  (“Equivalent”  means  that  Bob’s  measurement 
result  has  the  same  probability  distribution  in  both  cases.) 

(iii)  For  any  cheating  strategy  Br  by  Bob  compatible  with 
the  protocol  P  there  is  an  equivalent  strategy  B'  for  the  pro¬ 
tocol  P. 

Note  that  when  we  say  that  the  two  cheating  strategies  are 
equivalent,  we  mean  in  particular  that  the  probability  that  the 
game  ends  normally  is  the  same  for  both  strategies. 

To  better  understand  our  concept  of  simulation,  it  is  very 
helpful  to  consider  this  simple  example:  Suppose  that  the 
message  space  TlM  of  P  is  embedded  in  a  larger  space  HM  of 

P.  Honest  players  follow  the  same  strategies  in  P  as  in  P,  so 
that  condition  (i)  is  obviously  satisfied.  However,  the  players 

in  P  must  be  prepared  to  receive  messages  that  do  not  obey 

the  format  of  P,  i.e.,  do  not  fit  into  the  subspace  TLM.  In  P 
such  messages  are  rejected,  and  the  game  is  aborted.  This 
rule  prevents  a  dishonest  player  from  gaining  any  advantage 


(relative  to  simply  quitting  the  game)  by  sending  an  invalid 
message.  More  formally,  suppose  that  Alice  cheats  using 
some  strategy  A'.  In  the  corresponding  strategy  A',  Alice 
projects  her  message  system  TiM  onto  the  subspace  TiM ,  be¬ 
fore  sending  each  message.  Thus  if  the  strategy  A'  calls  for 
Alice  to  apply  the  operator  W'A  in  the  kth  round,  then  in  the 
strategy  A'  Alice  applies  the  contracting  map  W^=nW^, 
where  n  is  the  orthogonal  projector  onto  TiM.  The  strategies 
A'  and  A'  are  equivalent:  whenever  a  message  sent  accord¬ 
ing  to  A'  causes  Bob  to  abort  the  game,  the  strategy  A' 
requires  Alice  to  abort  the  game  herself.  Similarly,  given  any 

cheating  strategy  B’  for  Bob  in  the  game  P,  there  is  an 
equivalent  cheating  strategy  Bf  in  P.  Thus,  conditions  (ii) 

and  (iii)  are  satisfied,  and  P  simulates  P. 

Our  analysis  of  superselection  rules  in  Sec.  VIII  E  will  be 
based  on  a  closely  related  method  of  simulation. 

We  also  remark  that  Theorem  1  proved  in  Sec.  Ill  can  be 
restated:  for  a  multiparty  protocol  P  in  the  G-in variant 

world,  there  is  a  G-world  protocol  P  that  simulates  P.  In  that 
case,  we  implicitly  adopt  a  redundant  description  of  the 
physical  states  appearing  in  P,  admitting  fictitious  color  de¬ 
grees  of  freedom.  Then  P  is  exactly  the  same  protocol  as  P9 
but  with  the  color  now  reinterpreted  as  a  physical  variable. 
Similarly,  Theorem  3  in  Sec.  VII  C  can  be  stated:  any  n-party 
/-world  protocol  in  which  the  initial  state  is  a  product  of  n 
invariant  states  can  be  simulated  by  an  I- world  protocol  in 
which  the  initial  state  is  a  product  of  n  pure  states,  each  with 
trivial  charge. 

E.  Proof 

Our  goal  is  to  prove  the  following. 

Theorem  4.  Let  P  be  a  two-party  game  in  the  I  world, 
such  that  both  parties  hold  trivial  charges  at  the  beginning  of 

the  game.  Then  there  is  a  G-world  game  P  that  simulates  P. 

In  the  proof,  we  construct  the  U- world  protocol  P  that 
simulates  the  I- world  protocol  P,  and  explain  how  the  cheat¬ 
ing  strategy  A'  that  is  equivalent  to  A'  is  formulated.  We 
achieve  this  by  applying  the  procedure  for  simulating  charge 
exchange  in  the  U  world  that  was  described  in  Sec.  VIII  C. 

Consider  the  I- world  protocol  P.  If  the  total  charge  is 
trivial,  then  the  full  Hilbert  space  including  Alice’s  system  A, 
Bob’s  system  B ,  and  the  message  M  is 

n=  ©  hAAa®hBAb®uMAu®vi^^. 

VA’QB’Vm 

(115) 

Without  loss  of  generality,  we  assume  that  the  spaces  'HA,qA, 
TL B,qB ,  T~L M,qM  are  the  same  in  each  step  of  the  protocol.  We 
may  also  assume  that  the  message  is  present  at  the  beginning 
and  at  the  end  of  the  game  and  that  the  initial  state  has  the 
form  |£a>®  |&>®  |0),  where  |0)  e  UMA. 

Each  time  Alice  receives  one  message  and  sends  another, 
she  applies  an  operator  to  AM  that  preserves  Bob’s  charge 
qB\  this  is  a  contracting  map  belonging  to  the  algebra 
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©4  ©  Ha«a  ©  HMAm  ©  Vq^BM.  (116) 

<1b  XQA’Vm  1 

Alice’s  honest  strategy  consists  of  a  sequence  of  such 
operators — in  the  &th  step  she  applies  an  operator  WA  .  Simi¬ 
larly,  Bob’s  honest  strategy  is  defined  by  operators  WBk. 

Now  consider  the  U- world  protocol  P  that  simulates  P. 
The  Hilbert  space  of  P  is 

H  =  Ha®Hb®Hm ,  (117) 

where 

77A  =  ©  ftA,q^  HB=®  ft B,q2 * 

<12 

ftM=  ©  ftu^^V^n.  (118) 

Thus  the  space  H  of  the  protocol  P  can  be  embedded  in  H 

by  requiring  q\=qA  and  q2=qB •  In  these  constraints  are 
enforced  by  checks  performed  by  both  parties.  A  dishonest 
player’s  attempt  to  break  the  constraints  will  be  detected  im¬ 
mediately  by  the  other  party,  in  which  case  the  game  will 
halt. 

Let  us  describe  Alice’s  honest  strategy  in  P.  When  Alice 
receives  a  message,  she  gains  control  of  the  space  TtA 

<8>Hm.  First  she  verifies  that  q\-qA  (without  determining  the 
value  of  q i  or  gA);  if  verification  fails,  she  aborts  the  game. 
Thus  Alice  effectively  projects  her  input  state  onto  the  sub¬ 
space 

nAM=  © 

QA’QB’Vm 

(119) 

Then  she  applies  the  operator  WAk  (from  the  protocol  P), 
which  acts  on  TLAM  and  preserves  qB.  Thus  Alice’s  strategy  is 
defined  by  the  contracting  maps 

WAk  =  FWAF\  (120) 

where  F  denotes  the  embedding  77M.  Bob’s 

honest  strategy  is  defined  similarly. 

If  both  players  play  the  game  P  honestly,  then  the  verifi¬ 
cation  always  succeeds  and  the  conditions  q\-qA  and  q2 
=  qB  are  maintained  throughout  the  game.  Thus  the  honest 

strategies  for  P  and  P  are  clearly  equivalent.  Note  that  in  P 
some  information  is  encoded  redundantly — for  example,  Al¬ 
ice  can  access  the  value  of  qA  by  examining  either  the  charge 
label  of  ftA,qA  or  one  of  the  slots  of  the  tensor  V%A,qB,qM\ 
similarly  qM  is  encoded  both  in  ft-M,qM  and  in  V\A,qmM. 
However,  this  redundancy  has  no  deleterious  effect  on  the 
fidelity  of  the  simulation. 

Now  suppose  that  Alice  cheats  in  the  game  P.  Then  she 
may  use  an  arbitrary  Hilbert  space  HA  and  operators  WA^ 
acting  on 
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n’AM  =  H'A®HM=H'A®(  0  V?A’WM)- 

QA’QB’Qm 

(121) 

In  particular,  when  Alice  cheats,  her  action  on  the  message 
need  not  respect  the  condition  qB  =  q2.  To  prove  the  theorem, 
we  are  to  define  an  equivalent  cheating  strategy  for  the  game 
P. 

When  Alice  cheats  in  P,  she  uses  an  arbitrary  Hilbert 
space  ftA,qA  for  each  value  of  her  charge  qA,  and  she  applies 
operators  WA^  that  conserve  Bob’s  charge  qB  to  the  space 

Km=  ©  K,qA®nM,qM®v^M.  (122) 

QA’QB’Qm 

The  spaces  ftAM  and  ftAM  seem  to  be  distinct — in  ftAM  the 
charge  label  carried  by  HA  matches  the  label  in  one  of  the 

slots  of  V$A’qB’qM,  while  in  HAM  there  is  no  such  correlation. 
However,  in  the  U  world  the  variable  qA  would  be  encoded 
redundantly  if  it  appeared  in  both  ftAtq  and  V^A,qB,qM,  and  it 
is  not  necessary  to  adopt  this  redundant  encoding  in  order  to 
emulate  the  physics  of  the  I  world.  Instead,  let  us  specify 
ft'A  qA  =  ftA  for  each  qA — then  HAM  and  ftAM  are  of  the  same 
form,  but  where  it  is  understood  in  Eq.  (121)  that  the  infor¬ 
mation  about  the  charge  qA  is  carried  only  by  V^A,qB,qM.  With 
this  choice  Alice’s  operator  WA^  in  P  and  her  operator  WA^  in 
P  act  on  isomorphic  spaces;  however,  WA^  must  conserve 
Bob’s  charge  qB,  while  W'A  need  not  conserve  charge. 

k 

Therefore,  we  define  the  corresponding  cheating  strategy 
in  P  by  specifying 

W^UqW'AnqB,  (123) 

<lB 

where  is  the  projector  onto  the  subspace  with  the  given 

value  of  qB.  That  is,  HqB  projects  HM  onto  the  space  in  which 
yVAm^M  has  Yaiue  qB  m  the  appropriate  slot.  The  con¬ 
tracting  map  W'A  preserves  qB  and  therefore  is  admissible  in 
the  protocol  P.  Applying  this  WA^  causes  Alice  to  abort  the 

game  P  in  the  case  where  qB  would  change  in  the  game  P. 
But  in  that  case  the  new  value  of  qB  would  not  match  Bob’s 
variable  q2 ;  therefore,  Bob  would  reject  Alice’s  message  and 

abort  the  game  P.  Hence  the  two  games  P  and  P  are  aborted 
with  the  same  probability;  furthermore,  the  final  state  that 

Bob  measures  in  P,  if  P  does  not  abort,  is  identical  to  the 
final  state  that  Bob  measures  in  P,  if  P  does  not  abort.  There¬ 
fore,  when  Alice  cheats,  Bob’s  measurement  outcome  has  the 
same  probability  distribution  in  P  as  in  P.  The  same  is  true 
for  Alice’s  measurement  when  Bob  cheats.  Therefore,  P 
simulates  P,  which  completes  the  proof  of  Theorem  4. 

IX.  CONCLUSIONS 

Recent  progress  in  the  theory  of  quantum  computation 
and  quantum  cryptography  highlights  the  importance  of 
adopting  a  computational  model  compatible  with  fundamen¬ 
tal  physics — tasks  that  would  be  impossible  in  a  classical 
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world  may  be  physically  realizable  because  Nature  is 
quantum-mechanical.  Further  refinements  of  the  model  could 
lead  to  further  insights  regarding  what  information¬ 
processing  tasks  are  achievable.  Therefore,  as  Popescu  [12] 
emphasized,  the  impact  of  superselection  rules  on  the  secu¬ 
rity  of  quantum  protocols  is  of  considerable  potential  inter¬ 
est.  However,  our  disappointing  conclusion  is  that  superse¬ 
lection  rules  cannot  foil  a  cheater  who  has  unlimited 
quantum-computational  power. 

Contemplating  this  issue  has  led  us  to  consider  how  phys¬ 
ics  in  the  invariant  world  can  simulate  physics  in  the  unre¬ 
stricted  world,  and  vice  versa.  We  feel  that  the  simulation 
schemes  we  have  devised  offer  fruitful  insights  into  the 
physical  meaning  of  superselection  rules. 

Our  results  do  not  address  whether  the  security  of  proto¬ 
cols  with  more  than  two  parties  can  be  enhanced  by  super¬ 
selection  rules  that  do  not  arise  from  compact  symmetry 
groups.  New  issues  arise  in  this  setting,  because  of  the  non¬ 
trivial  braiding  properties  of  non- Abelian  anyons.  For  ex¬ 
ample,  in  the  case  of  three  parties  (Alice,  Bob,  and  Charlie), 
Alice  can  split  her  charge  into  two  parts,  and  send  one  part 
on  a  voyage  that  circles  Bob’s  lab  and  then  returns  to  Alice’s 
lab.  This  action  can  induce  a  change  in  the  charge  held  by 


Alice,  accompanied  by  a  compensating  change  in  the  total 
charge  held  by  Bob  and  Charlie,  even  though  the  local 
charge  in  Bob’s  lab,  and  in  Charlie’s,  is  unaltered.  Though 
strictly  speaking  Alice’s  operation  is  not  “local,”  she  can 
carry  it  out  surreptitiously,  without  any  cooperation  from 
Bob  and  Charlie.  Such  new  possibilities  enhance  the  poten¬ 
tial  power  of  cheaters,  but  may  also  provide  the  honest  par¬ 
ties  with  new  methods  for  detecting  cheating.  Addressing  the 
security  of  multiparty  quantum  protocols  subject  to  general 
superselection  rules  will  require  different  methods  from 
those  we  have  used  in  this  paper,  and  might  provide  further 
enlightenment  concerning  the  physics  of  non-Abelian 
anyons. 
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We  describe  the  formalism  for  optimally  estimating  and  controlling  both  the  state  of  a  spin  ensemble  and  a 
scalar  magnetic  field  with  information  obtained  from  a  continuous  quantum  limited  measurement  of  the  spin 
precession  due  to  the  field.  The  full  quantum  parameter  estimation  model  is  reduced  to  a  simplified  equivalent 
representation  to  which  classical  estimation  and  control  theory  is  applied.  We  consider  both  the  tracking  of 
static  and  fluctuating  fields  in  the  transient  and  steady-state  regimes.  By  using  feedback  control,  the  field 
estimation  can  be  made  robust  to  uncertainty  about  the  total  spin  number. 
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I.  INTRODUCTION 

As  experimental  methods  for  manipulating  physical  sys¬ 
tems  near  their  fundamental  quantum  limits  improve  [1-5], 
the  need  for  quantum  state  and  parameter  estimation  meth¬ 
ods  becomes  critical.  Integrating  a  modern  perspective  on 
quantum  measurement  theory  with  the  extensive  methodolo¬ 
gies  of  classical  estimation  and  control  theory  provides  in¬ 
sight  into  how  the  limits  imposed  by  quantum  mechanics 
affect  our  ability  to  measure  and  control  physical  systems 
[6-9]. 

In  this  paper,  we  illustrate  the  processes  of  state  estima¬ 
tion  and  control  for  a  continuously  observed,  coherent  spin 
ensemble  (such  as  an  optically  pumped  cloud  of  atoms)  in¬ 
teracting  with  an  external  magnetic  field.  In  the  situation 
where  the  magnetic  field  is  either  zero  or  well  characterized, 
continuous  measurement  (e.g.,  via  the  dispersive  phase  shift 
or  Faraday  rotation  of  a  far  off-resonant  probe  beam)  can 
produce  a  spin-squeezed  state  [10]  conditioned  on  the  mea¬ 
surement  record  [11].  Spin-squeezing  indicates  internal  en¬ 
tanglement  between  the  different  particles  in  the  ensemble 
[12]  and  promises  to  improve  precision  measurements  [13]. 
When,  however,  the  ambient  magnetic  environment  is  either 
unknown  or  changing  in  time,  the  external  field  can  be  esti¬ 
mated  by  observing  Larmor  precession  in  the  measurement 
signal  [2,14-16],  see  Fig.  1.  Recently,  we  have  shown  that 
uncertainty  in  both  the  magnetic  field  and  the  spin  ensemble 
can  be  simultaneously  reduced  through  continuous  measure¬ 
ment  and  adequate  quantum  filtering  [17]. 

Here,  we  expand  on  our  recent  results  [17]  involving 
Heisenberg-limited  magnetometry  by  demonstrating  the  ad¬ 
vantages  of  including  feedback  control  in  the  estimation  pro¬ 
cess.  Feedback  is  a  ubiquitous  concept  in  classical  applica¬ 
tions  because  it  enables  precision  performance  despite  the 
presence  of  potentially  large  system  uncertainty.  Quantum 
optical  experiments  are  evolving  to  the  point  where  feedback 
can  been  used,  for  example,  to  stabilize  atomic  motion 
within  optical  lattices  [4]  and  high  finesse  cavities  [5].  Re¬ 
cently,  we  demonstrated  the  use  of  feedback  on  a  polarized 
ensemble  of  laser-cooled  cesium  atoms  to  robustly  estimate 
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an  applied  magnetic  field  [2].  In  this  work,  we  investigate  the 
theoretical  limits  of  such  an  approach  and  demonstrate  that 
an  external  magnetic  field  can  be  measured  with  high  preci¬ 
sion  despite  substantial  ignorance  of  the  size  of  the  spin  en¬ 
semble. 

The  paper  is  organized  as  follows.  In  Sec.  II,  we  provide 
a  general  introduction  to  quantum  parameter  estimation  fol¬ 
lowed  by  a  specialization  to  the  case  of  a  continuously  mea¬ 
sured  spin  ensemble  in  a  magnetic  field.  By  capitalizing  on 
the  Gaussian  properties  of  both  coherent  and  spin-squeezed 
states,  we  formulate  the  parameter  estimation  problem  in 
such  a  way  that  techniques  from  classical  estimation  theory 
apply  to  the  quantum  system.  Sec.  Ill  presents  basic  filtering 
and  control  theory  in  a  pedagogical  manner  with  the  simpli¬ 
fied  spin  model  as  an  example.  This  theory  is  applied  in  Sec. 
IV,  where  we  simultaneously  derive  mutually  dependent 
magnetometry  and  spin- squeezing  limits  in  the  ideal  case 
where  the  observer  is  certain  of  the  spin  number.  We  con¬ 
sider  the  optimal  measurement  of  both  constant  and  fluctu¬ 
ating  fields  in  the  transient  and  steady- state  regimes.  Finally, 
we  show  in  Sec.  V  that  the  estimation  can  be  made  robust  to 
uncertainty  about  the  total  spin  number  by  using  precision 
feedback  control. 


II.  QUANTUM  PARAMETER  ESTIMATION 

First,  we  present  a  generic  description  of  quantum  param¬ 
eter  estimation  [6-9].  This  involves  describing  the  quantum 
system  with  a  density  matrix  and  our  knowledge  of  the  un¬ 
known  parameter  with  a  classical  probability  distribution. 
The  objective  of  parameter  estimation  is  then  to  utilize  infor¬ 
mation  gained  about  the  system  through  measurement  to 
conditionally  update  both  the  density  matrix  and  the  param¬ 
eter  distribution.  After  framing  the  general  case,  our  particu¬ 
lar  example  of  a  continuously  measured  spin  ensemble  is 
introduced. 


A.  General  problem 

The  following  outline  of  the  parameter  estimation  process 
could  be  generalized  to  treat  a  wide  class  of  problems  (dis¬ 
crete  measurement,  multiple  parameters),  but  for  simplicity, 
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FIG.  1.  (a)  A  spin  ensemble  is  initially  prepared  in  a  coherent- state  polarized  along  x,  with  symmetric  variance  in  the  y  and  z  directions. 
Subsequently,  a  field  along  y  causes  the  spin  to  rotate  as  the  z  component  is  continuously  measured,  (b)  Experimental  schematic  for  the 
measurement  process.  A  far-off-resonant  probe  beam  traverses  the  sample  and  measures  the  z  component  of  spin  via  Faraday  rotation.  The 
measurement  strength  could  be  improved  by  surrounding  the  ensemble  with  a  cavity,  (c)  Experimental  apparatus  subsumed  by  the  Plant 
block,  which  serves  to  map  the  total  field  to  the  photocurrent,  h(t)^y(t). 


we  will  consider  a  continuously  measured  quantum  system 
with  scalar  Hamiltonian  parameter  6  and  measurement 
record  y(t). 

Suppose  first  that  the  observer  has  full  knowledge  of  the 
parameter  6.  The  proper  description  of  the  system  would 
then  be  a  density  matrix  pe(t)  conditioned  on  the  measure¬ 
ment  record  y(t).  The  first  problem  is  to  find  a  rule  to  update 
this  density  matrix  with  the  knowledge  obtained  from  the 
measurement.  As  in  the  problem  of  this  paper,  this  mapping 
may  take  the  form  of  a  stochastic  master  equation  (SME). 
The  SME  is  by  definition  a  filter  that  maps  the  measurement 
record  to  an  optimal  estimate  of  the  system  state. 

Now  if  we  allow  for  uncertainty  in  6 ,  then  a  particularly 
intuitive  choice  for  our  new  description  of  the  system  is 

pit)  =  I  pn(t)p(0,t)d0,  (1) 

J  e 

where  p(0,t)  is  a  probability  distribution  representing  our 
knowledge  of  the  system  parameter.  In  addition  to  the  rule 
for  updating  each  pe(t),  we  also  need  to  find  a  rule  for  up¬ 
dating  p(09t)  according  to  the  measurement  record.  By  re¬ 
quiring  internal  consistency,  it  is  possible  to  find  a  Bayes 
rule  for  updating  p(6,t)  [6].  These  two  update  rules  in  prin¬ 
ciple  solve  the  estimation  problem  completely. 

Because  evolving  p(t)  involves  performing  calculations 
with  the  full  Hilbert  space  in  question,  which  is  often  com¬ 
putationally  expensive,  it  is  desirable  to  find  a  reduced  de¬ 
scription  of  the  system.  Fortunately,  it  is  often  possible  to 
find  a  closed  set  of  dynamical  equations  for  a  small  set  of 
moments  of  p(t).  For  example,  if  c  is  an  operator,  then  we 
can  define  the  estimate  moments 

(c)(t)  =  Tr  [p(t)c\, 
(Ac2)(f)-Tr[p(f)(c-(c))2], 


(6)(t)  =  J  p(0,t)0d6. 


<a  (?)(()  =  J  P{e,t){e-{e))2dd, 

and  derive  their  update  rules  from  the  full  update  rules,  re¬ 
sulting  in  a  set  of  y(t) -dependent  differential  equations.  If 
those  differential  equations  are  closed,  then  this  reduced  de¬ 
scription  is  adequate  for  the  parameter  estimation  task  at 
hand.  This  situation  (with  closure  and  Gaussian  distribu¬ 
tions)  is  to  be  expected  when  the  system  is  approximately 
linear. 

B.  Continuously  measured  spin  system 

This  approach  can  be  applied  directly  to  the  problem  of 
magnetometry  considered  in  this  paper.  The  problem  can  be 
summarized  by  the  situation  illustrated  in  Fig.  1:  a  spin  en¬ 
semble  of  possibly  unknown  number  is  initially  polarized 
along  the  x  axis  (e.g.,  via  optical  pumping),  an  unknown 
possibly  fluctuating  scalar  magnetic  field  b  directed  along  the 
y-axis  causes  the  spins  to  then  rotate  within  the  x-z  plane, 
and  the  z-component  of  the  collective  spin  is  measured  con¬ 
tinuously.  The  measurement  can,  for  example,  be  imple¬ 
mented  as  shown,  where  we  observe  the  difference  photocur¬ 
rent,  y(t ),  in  a  polarimeter  which  measures  the  Faraday 
rotation  of  a  linearly  polarized  far-off-resonant  probe  beam 
traveling  along  z  [2,14,18].  The  goal  is  to  optimally  estimate 
b{t)  via  the  measurement  record  and  unbiased  prior  informa¬ 
tion.  If  a  control  field  u(t )  is  included,  as  it  will  be  eventu¬ 
ally,  the  total  field  is  represented  by  h(t)=b(t)  +  u(t). 

In  terms  of  our  previous  discussion,  we  have  here  the 
observable  c-  \MJV  where  M  is  the  measurement  rate  (de- 
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fined  in  terms  of  probe  beam  parameters),  and  the  parameter 
6=b.  When  b  is  known,  our  state  estimate  evolves  by  the 
stochastic  master  equation  [19] 

dpb(t )  =  -  i[H(b),pb(t)]dt  +  V[^MJz]pb(t)dt+  JyH,[^MJz] 

X  (2'jMrfiy(t)dt  -  {J7)bdt])pb(t) ,  (2) 

where  H(b)  =  yJyb ,  y  is  the  gyromagnetic  ratio,  and 
V[c\p  =  cpc ^  -  (c^cp  +  pcV)/2, 


H[c\p  =  cp  +  pc 1  -  Tr[(c  +  cf)p]p. 

The  stochastic  quantity  2^Mrj[y(t)dt-(Jz)b(t)dt]  =  dW(t)  is 
a  Wiener  increment  (Gaussian  white  noise  with  variance 
dt)  by  the  optimality  of  the  filter.  The  definition  of  the 
photocurrent  may  be  scaled  by  any  constant  gain  factor,  as 
in  Ref.  [17],  as  long  as  the  statistics  of  the  SME  remains 
invariant.  The  sensitivity  of  the  photodetection  per  VHz  is 
represented  by  I/2VM77,  where  the  quantity  77  represents 
the  quantum  efficiency  of  the  detection.  If  77=  0,  we  are 
essentially  ignoring  the  measurement  result  and  the  con¬ 
ditional  SME  becomes  a  deterministic  unconditional  mas¬ 
ter  equation.  If  77=1,  the  detectors  are  maximally  efficient. 
Note  that  our  initial  state  p(0)  =  p^(0)  is  made  equal  to  a 
coherent  state  (polarized  in  x)  and  is  representative  of  our 
prior  information. 

The  stochastic  master  equation,  Eq.  (2),  has  previously 
been  derived  for  homodyne  detection  of  the  output  of  a  cav¬ 
ity  with  a  single  mode  dispersively  coupled  to  the  collective 
atomic  spin  within  the  cavity  [19].  The  resulting  form  of  the 
equation  is,  however,  the  most  generic  form  of  a  continuous 
stochastic  QND  measurement  and  also  applies  under  similar 
approximations  to  the  free  space  Faraday  rotation  measure¬ 
ment  [18]  diagrammed  in  Fig.  1. 

It  can  be  shown  that  the  unnormalized  probability  p{bj) 
evolves  according  to  [6] 

dp(b,t )  =  4Mp(Jz)b(t)p(b,t)y(t)dt.  (3) 

The  evolution  Eqs.  (2)  and  (3)  together  with  Eq.  (1)  solve  the 
problem  completely,  albeit  in  a  computationally  expensive 
way.  Clearly,  for  large  ensembles  it  would  be  advantageous 
to  reduce  the  problem  to  a  simpler  description. 

If  we  consider  only  the  estimate  moments  ( Jz)(t ), 
(A ( b)(t ),  and  (A b2)(t)  and  derive  their  evolution  with 
the  above  rules,  it  can  be  shown  that  the  filtering  equations 
for  those  variables  are  closed  under  certain  approximations. 
First,  the  spin  number  J  must  be  large  enough  that  the  dis¬ 
tributions  for  Jy  and  Jz  are  approximately  Gaussian  for  an 
v-polarized  coherent  state.  Second,  we  only  consider  times 
t<  1/M  because  the  total  spin  becomes  damped  by  the  mea¬ 
surement  at  times  comparable  to  the  inverse  of  the  measure¬ 
ment  rate. 

Although  this  approach  is  rigorous  and  fail-safe,  the  re¬ 
sulting  filtering  equations  for  the  moments  can  be  arrived  at 
in  a  more  direct  manner  as  discussed  in  Appendix  A.  Essen¬ 
tially,  the  full  quantum-mechanical  mapping  from  h(t)  to  y(t) 
is  equivalent  to  the  mapping  derived  from  a  model  which 


appears  classical,  and  assumes  an  actual,  but  random,  value 
for  the  z  component  of  spin.  This  correspondence  generally 
holds  for  a  stochastic  master  equation  corresponding  to  an 
arbitrary  linear  quantum-mechanical  system  with  continuous 
measurement  of  observables  that  are  linear  combinations  of 
the  canonical  variables  [20]. 

From  this  point  on  we  will  only  consider  the  simplified 
Gaussian  representation  (used  in  the  following  section)  since 
it  allows  us  to  apply  established  techniques  from  estimation 
and  control  theory.  The  replacement  of  the  quantum  me¬ 
chanical  model  with  a  classical  noise  model  is  discussed 
more  fully  in  the  Appendix.  Throughout  this  treatment,  we 
keep  in  mind  the  constraints  that  the  original  model  imposed. 
Again,  we  assume  J  is  large  enough  to  maintain  the  Gaussian 
approximation  and  that  time  is  small  compared  to  the  mea¬ 
surement  induced  damping  rate,  t<  1/M.  Also,  the  descrip¬ 
tion  of  our  original  problem  demands  that  (A/^)(0)=//2  for 
a  coherent  state  [32].  Hence  our  prior  information  for  the 
initial  value  of  the  spin  component  will  always  be  dictated 
by  the  structure  of  Hilbert  space. 

III.  OPTIMAL  ESTIMATION  AND  CONTROL 

We  now  describe  the  dynamics  of  the  simplified  represen¬ 
tation.  Given  a  linear  state- space  model  (L),  a  quadratic  per¬ 
formance  criterion  (Q),  and  Gaussian  noise  (G),  we  show 
how  to  apply  standard  LQG  analysis  to  optimize  the  estima¬ 
tion  and  control  performance  [21]. 

The  system  state  we  are  trying  to  estimate  is  represented 
by 

State. 


x(t)  = 


z(t ) 

Mt).' 


(4) 


where  z{t)  represents  the  small  z  component  of  the  collective 
angular  momentum  and  b(t)  is  a  scalar  field  along  the  y  axis. 

Our  best  guess  of  x(t ),  as  we  filter  the  measurement 
record,  will  be  denoted  as 
Estimate. 


m(t)  = 


z(t) 
bit ) 


(5) 


As  stated  in  the  Appendix,  we  implicitly  make  the  associa¬ 
tions:  z(t)  =  (Jz)(t)=Tr[p(t)Jz]  and  b(t)  =  fp(b,t)b  db,  al¬ 
though  no  further  mention  of  p(t)  or  p{b,t)  will  be  made. 

We  assume  the  measurement  induced  damping  of  J  to  be 
negligible  for  short  times  (/  exp[-M^/2]~/  if  t<  1/M)  and 
approximate  the  dynamics  as 
Dynamics. 


dx(t)  =  A  x(t)dt  +  B  u{t)dt  + 


(6) 


A  = 


0 

0 


yj 

-JbY 
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B 


So- 


Si- 


yj 
0 

o-z0  0 

0  (j  i 


bO 


0  0 

o  <rbF. 


where  the  initial  value  v(0)  for  each  trial  is  drawn  randomly 
from  a  Gaussian  distribution  of  mean  zero  and  covariance 
matrix  So-  The  initial  field  variance  ab0  is  considered  to  be 
due  to  classical  uncertainty,  whereas  the  initial  spin  variance 
az0  is  inherently  nonzero  due  to  the  original  quantum  state 
description.  Specifically,  we  impose  (rz0=(A/^)(0).  The 
Wiener  increment  dWx{t)  has  a  Gaussian  distribution  with 
mean  zero  and  variance  dt.  2  i  represents  the  covariance  ma¬ 
trix  of  the  last  vector  in  Eq.  (6). 

We  have  given  ourselves  a  magnetic-field  control  input, 
u{t ),  along  the  same  axis,  y,  of  the  field  to  be  measured,  b(t). 
We  have  allowed  b(t)  to  fluctuate  via  a  damped  diffusion 
(Ornstein-Uhlenbeck)  process  [22] 


db(t)  =  -  ybb(t)dt  +  yj  a bFdW\ . 


(7) 


The  b(t)  fluctuations  are  represented  in  this  particular  way 
because  Gaussian  noise  processes  are  amenable  to  LQG 
analysis.  The  variance  of  the  field  at  any  particular  time  is 
given  by  the  expectation  abFree  =  E[b(t)2^  =  crbF/2yb. 
(Throughout  the  paper  we  use  the  notation  E[x(t)]  to  rep¬ 
resent  the  average  of  the  generally  stochastic  variable  x(t) 
at  the  same  point  in  time,  over  many  trajectories.)  The 
bandwidth  of  the  field  is  determined  by  the  frequency  yb 
alone.  When  considering  the  measurement  of  fluctuating 
fields,  a  valid  choice  of  prior  might  be  ab0=  crbFree,  but  we 
choose  to  let  ab0  remain  independent.  For  constant  fields, 
we  set  crbFree= 0,  but  ab0i=0. 

Note  that  only  the  small  angle  limit  of  the  spin  motion  is 
considered.  Otherwise  we  would  have  to  consider  different 
components  of  the  spin  vector  rotating  into  each  other.  The 
small  angle  approximation  would  be  invalid  if  a  field  caused 
the  spins  to  rotate  excessively,  but  using  adequate  control 
ensures  this  will  not  happen.  Hence,  we  use  control  for  es¬ 
sentially  two  reasons  in  this  paper:  first  to  keep  our  small 
angle  approximation  valid  and  second  to  make  our  estima¬ 
tion  process  robust  to  our  ignorance  of  J.  The  latter  point 
will  be  discussed  in  Sec.  V. 

Our  measurement  of  z  is  described  by  the  process 

Measurement. 


y{t)dt  =  C  x(t)dt  +  yfo^dW2(t), 


sents  the  sensitivity  of  the  measurement,  M  is  the  measure¬ 
ment  rate  (with  unspecified  physical  definition  in  terms  of 
probe  parameters),  and  rj  is  the  quantum  efficiency  of  the 
measurement.  The  increments  dW\  and  dW2  are  uncorre¬ 
lated. 

Following  Ref.  [21],  the  optimal  estimator  for  mapping 
y(t)  to  m(t)  takes  the  form 
Estimator. 

dm{t)  =  A m(t)dt  +  B u(t)dt  +  K 0(t)[y(t)  -  C m(t)]dt,  (9) 


m(  0)  = 


K0W  =  SWCV.  (10) 

=  Xj  +  A2(f)  +  X(t)\T-X(t)CTX-2lCX(t), 


2(r)  = 


o%/V) 

-VcrU) 


Vcr(0 

VbR(t)  J  ’ 


(ID 


2(0)  =  So  = 


az0 

0 


0 

°M)  _ 


(12) 


Equation  (9)  is  the  Kalman  filter  which  depends  on  the 
solution  of  the  matrix  Riccati  equation  (10).  The  Riccati 
equation  gives  the  optimal  observation  gain  K 0(t)  for  the 
filter.  The  estimator  is  designed  to  minimize  the  average  qua¬ 
dratic  estimation  error  for  each  variable:  E[(z(t)  -z(t))2]  and 

E[(b(t)-b(t))2].  If  the  model  is  correct,  and  we  assume  the 
observer  chooses  his  prior  information  2(0)  to  match  the 
actual  variance  of  the  initial  data  2o,  then  we  have  the  self- 
consistent  result 


<rzE(t)  =  E[(z(t)  -  z(t))2]  =  crzR{t) , 


<TbE(t)  =  E[(b(t)  -  b(t))2]  =  abR(t ) . 

Hence,  the  Riccati  equation  solution  represents  both  the  ob¬ 
server  gain  and  the  expected  performance  of  an  optimal  filter 
using  that  same  gain. 

Now  consider  the  control  problem,  which  is  in  many  re¬ 
spects  dual  to  the  estimation  problem.  We  would  like  to  de¬ 
sign  a  controller  to  map  y(t)  to  u(t)  in  a  manner  that  mini¬ 
mizes  the  quadratic  cost  function 
Minimized  cost. 


c  -  [1  0],  (8) 

22  —  —  1/4M  7], 

where  the  measurement  shot  noise  is  represented  by  the 
Wiener  increment  dW2(t)  of  variance  dt.  Again,  repre- 


H 


■f 

Jo 


[xT(t)Px(t) 


+  u(t)Qu(t)]dt  +  xr(T)P  ix(T) , 


(13) 


p  0 
0  0 
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Q  =  q, 

where  is  the  end-point  cost.  Only  the  ratio  pi  q  ever  ap¬ 
pears,  of  course,  so  we  define  the  parameter  \  =  ^p/q  and 
use  it  to  represent  the  cost  of  control.  By  setting  as 

we  often  choose  to  do  in  the  subsequent  analysis  to  simplify 
results,  we  are  putting  no  cost  on  our  control  output.  This  is 
unrealistic  because,  for  example,  making  X  arbitrarily  large 
implies  that  we  can  apply  transfer  functions  with  finite  gain 
at  arbitrarily  high  frequencies,  which  is  not  experimentally 
possible.  Despite  this,  we  will  often  consider  the  limit  X 
— > 00  to  set  bounds  on  achievable  estimation  and  control  per¬ 
formance.  The  optimal  controller  for  minimizing  Eq.  (13)  is 
Controller. 


a(t) 


\ 

k;wc 


-BK^ 

A'  -B'K'c-K'0(t)C 


(17) 


0  0 

0  0 

Vm^oiW  0 

0_ 

where  the  covariance  matrix  of  dW  is  dt  times  the  identity. 
Now  the  quantity  of  interest  is  the  following  covariance  ma¬ 
trix: 

Total  state  covariance. 


0  0 
0 

0  0 
0  0 


u(t)  =  -  Kc(t)m(t),  (14) 

Kc(0  =  Q_1BrV(7’- f), 


=  p  +  Arv(r)  +  v(r)A  -  v(t)bq-1btv(t), 

dT 


O^zb 

(TZZ 

azb 

azb 

abb 

0~bz 

Obb 

Vzz 

0~bz 

Ozz 

0~zb 

O' zb 

abb 

0~zb 

°"bb 

(18) 


V(7’=0)  =  P1.  (15) 


azz  ~  £[zW2I 


Here  V(7)  is  solved  in  reverse  time  T ,  which  can  be  inter¬ 
preted  as  the  time  left  to  go  until  the  stopping  point.  Thus  if 
71— >o°,  then  we  only  need  to  use  the  steady  state  of  the  V 
Riccati  equation  (15)  to  give  the  steady-state  controller  gain 
Kc  for  all  times.  In  this  case,  we  can  ignore  the  (reverse) 
initial  condition  Pl  because  the  controller  is  not  designed  to 
stop.  Henceforth,  we  will  make  Kc  equal  to  this  constant 
steady-state  value,  such  that  the  only  time  varying  coeffi¬ 
cients  will  come  from  K 0{t). 

In  principle,  the  above  results  give  the  entire  solution  to 
the  ideal  estimation  and  control  problem.  However,  in  the 
nonideal  case  where  our  knowledge  of  the  system  is  incom¬ 
plete,  e.g.,  J  is  unknown,  our  estimation  performance  will 
suffer.  Notation  is  now  introduced  which  produces  trivial 
results  in  the  ideal  case,  but  is  helpful  otherwise.  Our  goal  is 
to  collect  the  above  equations  into  a  single  structure  which 
can  be  used  to  solve  the  nonideal  problem.  We  define  the 
total  state  of  the  system  and  estimator  as 

Total  state. 


x(t) 

m(t) 


z(t) 
bit ) 
z(t) 

Rt) 


(16) 


Consider  the  general  case  where  the  observer  assumes  the 
plant  contains  spin  /' ,  which  may  or  may  not  be  equal  to  the 
actual  J.  All  design  elements  depending  on  Jf  instead  of  J  are 
now  labeled  with  a  prime.  Then  it  can  be  shown  that  the  total 
state  dynamics  from  the  above  estimator-controller  architec¬ 
ture  are  a  time-dependent  Ornstein-Uhlenbeck  process, 

Total  state  dynamics. 


dO{t)  =  a(t)  9{t)dt  +  (3(t)dW(t ), 


czb  =  E[z(t)b(t)], 


It  can  be  shown  that  this  total  covariance  matrix  obeys  the 
deterministic  equations  of  motion. 

Total  state  covariance  dynamics. 

^  =  a(?)©(?)  +  0(f)ar(f)  +  (3(t)(3T(t),  (19) 


0(0  =  exp 


-  f  a(t')dtr 

Jo 

fVexpf-  r 

Jo  J  t' 


0oexp 


a(s)ds 


-  f  aT(t')dtf 

J  o 

pit’W) 


Xexp 


i: 


aT(s)ds 


©n 


az0 

0 

0 

0 


0  0  0 

(jbo  o  0 

0  0  0 

0  0  0 


(20) 


Equation  (20)  is  the  matrix  form  of  the  standard  integrating 
factor  solution  for  time-dependent  scalar  ordinary  differential 
equations  [22].  Whether  we  solve  this  problem  numerically 
or  analytically,  the  solution  provides  the  quantity  that  we 
ultimately  care  about. 

Average  magnetometry  error. 
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Time 


FIG.  2.  The  Riccati  equation  solution  gives  the  ideal  field  esti¬ 
mation  performance.  The  parameters  used  here  are  J- 106,  crz0 
=J/2  (for  ensemble  of  spin-l/2’s),  y=106,  M- 104,  crb0=abFree 
=  1.  (All  quantities  within  the  figures  are  kept  dimensionless,  al¬ 
though  expressions  within  the  text  may  be  interpreted  as  having 
dimension.)  The  solution  starts  at  the  free  field  fluctuation  variance 
and  saturates  at  crbs.  The  plot  is  not  valid  at  times  t>  1/M. 


OfcfiO)  =  E [(bit)  -  b(t ))2] 

=  E|>2(f)]  +  E [b2(t)]  -  2E| \b(t)b(t)] 

=  +  v~bb(t)  ~  2abb(t)  ■  (21) 

When  all  parameters  are  known  (and  /'=/),  this  total 
state  description  is  unnecessary  because  crbE(t)  =  crbR(t) .  This 
equality  is  by  design.  However,  when  the  wrong  parameters 
are  assumed  (e.g.,  J'  A/)  the  equality  does  not  hold  crbE(t ) 
A  crbR{t)  and  either  Eqs.  (19)  and  (20)  must  be  used  to  find 
crbE(t).  Before  addressing  this  problem,  we  consider  in  detail 
the  performance  in  the  ideal  case,  where  all  system  param¬ 
eters  are  known  by  the  observer,  including  J. 

At  this  point,  we  have  defined  several  variables.  For  clar¬ 
ity,  let  us  review  the  meaning  of  several  before  continuing. 
Inputs  to  the  problem  include  the  field  fluctuation  strength 
crbF,  Eq.  (7),  and  the  measurement  sensitivity  crM,  Eq.  (8). 
The  prior  information  for  the  field  is  labeled  crb0,  Eq.  (12). 
The  solution  to  the  Riccati  equation  is  crbR(t ),  Eq.  (11),  and  is 
equal  to  the  estimation  variance  crbE(t ),  Eq.  (21),  when  the 
estimator  model  is  correct.  In  the  following  section,  we  ad¬ 
ditionally  use  ahs,  Eq.  (24),  and  crbT(t ),  Eq.  (25),  to  represent 
the  steady  state  and  transient  values  of  c jbE(t ),  respectively. 

IV.  OPTIMAL  PERFORMANCE:  J  KNOWN 


A  0).  The  performance  saturates  because  one  can  track  a  field 
only  so  well  if  the  field  is  changing  and  the  signal-to-noise 
ratio  is  finite.  If  the  field  to  be  tracked  is  constant,  then  t2 
=  °°  and  the  sloped  portion  of  the  curve  extends  to  zero  as  t 
— *  oo  (given  the  approximations  discussed  in  Sec.  II  B).  After 
the  point  where  the  performance  saturates  ( t>t2 ),  all  of  the 
observer  and  control  gains  have  become  time  independent 
and  the  filter  can  be  described  by  a  transfer  function. 

However,  as  will  be  shown,  applying  only  this  steady- 
state  transfer  function  is  nonoptimal  in  the  transient  regime 
(tl<t<t2),  because  the  time  dependence  of  the  gains  is 
clearly  crucial  for  optimal  transient  performance. 

A.  Steady-state  performance 

We  start  by  examining  the  steady- state  performance  of  the 
filter.  At  large  enough  times  (where  we  have  yet  to  define 
large  enough),  K0  becomes  constant  and  if  we  set 
(ignoring  the  end-point  cost),  then  Kc  is  always  constant. 
Setting  dX/dt= 0  and  dX /dt=0  we  find 


where  X  =  ^p/q. 

Now  assuming  the  gains  to  be  constant,  we  can  derive  the 
three  relevant  transfer  functions  from  y(t)  to  m(t)  (z  and  b) 
and  u.  We  proceed  as  follows.  First,  we  express  the  estimates 
in  terms  of  only  themselves  and  the  photocurrent 

dm(t) 

- =  A m{t)  +  B u(t)  +  K0[y(t)  -  Cm(t)] 

dt 

=  A m(t)  +  B[-  K cm(t)]  +  K0fy(t)  -  C m(t)] 

=  (A  -  BKC  -  K0C)m(t)  +  K0y(/) . 

To  get  the  transfer  functions,  we  take  the  Laplace  transform 
of  the  entire  equation,  use  differential  transform  rules  to  give 
v  factors  (where  jo),  j=  V-T),  ignore  initial  condition  fac¬ 
tors,  and  rearrange  terms.  However,  this  process  only 
gives  meaningful  transfer  functions  if  the  coefficients  K0 
and  Kc  are  constant.  Following  this  procedure,  we  have 


We  start  by  observing  qualitative  characteristics  of  the 
^-estimation  dynamics.  Figure  2  shows  the  average  estima¬ 
tion  performance,  crbR(t ),  as  a  function  of  time  for  a  realistic 
set  of  parameters.  Note  that  abR  is  constant  for  small  and 
large  times,  below  tx  and  above  t2.  If  ab0  is  noninfinite  then 
the  curve  is  constant  for  small  times,  as  it  takes  some  time  to 
begin  improving  the  estimate  from  the  prior.  If  crb()  is  infinite, 
then  ti  =  0  and  the  sloped  transient  portion  extends  towards 
infinity  as  >0.  At  long  times,  crbR  will  become  constant 
again,  but  only  if  the  field  is  fluctuating  (p-bF±  0  and  yb 


m(s)  =  (si  -  A  +  BKC  +  K0C)-1K0y(s)  -  Gjs)y(s) , 


u(s)  =  -  K cm(s)  =  -  Kc(s  -  A  +  BKC+  K0C)“1K0j(s) 
=  Gu(s)y(s), 


where 


Gm(s)  = 


Gj(s) 

_Gh(s)  _  ' 
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FIG.  3.  The  Bode  plot  of  Gu(s),  the  transfer  function  of  the  filter 
in  steady  state,  for  a  typical  parameter  regime.  Notice  that  the  con¬ 
troller  closes  the  plant  with  adequate  phase  margin  to  avoid  closed- 
loop  instability.  At  high  frequencies  the  controller  rolls  off  at  co0  if 
X  A 


The  three  transfer  functions  [ Gz(s ),  Gb(s),  and  G^s)] 
serve  three  different  tasks.  If  estimation  is  the  concern,  then 
Gb(s)  will  perform  optimally  in  steady  state.  Note  that,  while 
the  Riccati  solution  is  the  same  with  and  without  control  (Kc 
nonzero  or  zero),  this  transfer  function  is  not  the  same  in  the 
two  cases.  So,  even  though  the  transfer  functions  are  differ¬ 
ent,  they  give  the  same  steady- state  performance. 

Let  us  now  consider  the  controller  transfer  function  Gu(s ) 
in  more  detail.  We  find  the  controller  to  be  of  the  form 

1  +  s/  (x)fh 


Gu(s)  =  GuDC 


1  +  (1  +  s/Cl>q)s/(jl>l 


(22) 


Here  each  frequency  co  represents  a  transition  in  the  Bode 
plot  of  Fig.  3.  A  similar  controller  transfer  function  is  de¬ 
rived  via  a  different  method  in  Appendix  C. 

If  we  are  not  constrained  experimentally,  we  can  make  the 
approximations  X2>  \lycrhF/  crM/2yJ  and  yJ>  y^^o-M/ abF 
giving 


Gu(s) 


Gu,DC 


1  +  s/(x)H 
1  +  s/(x)L  ’ 


ml 


Jb > 


Mfi  * 


where  Gu  AC  is  the  gain  at  high  frequencies  (m>  a)H)  and  we 
find  the  closing  frequency  a>c  from  the  condition 
\P z(j a)c)Gu(j (L>c)\  =  l ,  with  the  plant  transfer  function  being 
the  normal  integrator  Pz(s)  =  yJ/s.  Notice  that  the  controller 
closes  in  the  very  beginning  of  the  flat  high-frequency  region 
(hence  with  adequate  phase  margin)  because  o)c=2coH . 

Finally,  consider  the  steady-state  estimation  performance. 
These  are  the  same  with  and  without  control  (hence  X  inde¬ 
pendent)  and,  under  the  simplifying  assumption  yJ 
>  A  AAU  are  given  by 

->•  A  yJo^4 a $  =  alS,  (23) 

^rU)  ->■  ’\[~^ai/F0'1M  =  <7bs-  (24) 

If  the  estimator  reaches  steady  state  at  t<HM ,  then  the 
above  variance  crzR  represents  a  limit  to  the  amount  of  spin 
squeezing  possible  in  the  presence  of  fluctuating  fields. 

Also  the  J  scaling  of  the  saturated  field  sensitivity  crbR 
ccj~m  is  not  nearly  as  strong  as  the  J  scaling  in  the  transient 
period  abR^J~2.  Next,  we  demonstrate  this  latter  result  as  we 
move  from  the  steady- state  analysis  to  calculating  the  esti¬ 
mation  performance  during  the  transient  period. 


B.  Transient  performance 

We  now  consider  the  transient  performance  of  the  ideal 
filter:  how  quickly  and  how  well  the  estimator-controller  will 
lock  onto  the  signal  and  achieve  steady- state  performance.  In 
many  control  applications,  the  transient  response  is  not  of 
interest  because  the  time  it  takes  to  acquire  the  lock  is  neg¬ 
ligible  compared  to  the  long  steady- state  period  of  the  sys¬ 
tem.  However,  in  systems  where  the  measurement  induces 
continuous  decay,  this  transient  period  can  be  a  significant 
portion  of  the  total  lifetime  of  the  experiment. 

We  will  evaluate  the  transient  performance  of  two  differ¬ 
ent  filters.  First,  we  look  at  the  ideal  dynamic  version,  with 
time-dependent  observer  gains  derived  from  the  Riccati 
equation.  This  limits  to  a  transfer  function  at  long  times 
when  the  gains  have  become  constant.  Second,  we  numeri¬ 
cally  look  at  the  case  where  the  same  steady- state  transfer 
functions  are  used  for  the  entire  duration  of  the  measure¬ 
ment.  Because  the  gains  are  not  adjusted  smoothly,  the  small 
time  performance  of  this  estimator  suffers.  Of  course,  for 
long  times  the  estimators  are  equivalent. 


(x)c 


coq^X  yJ , 


1.  Dynamic  estimation  and  control 

Now  consider  the  transient  response  of  2(0  [giving 
K We  will  continue  to  impose  that  V  (thus  Kc)  is  con¬ 
stant  because  we  are  not  interested  in  any  particular  stopping 
time. 
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TABLE  I.  Field  tracking  error,  crbR(t ),  for  different  initial  variences  of  b  and  z. 


060  =  0 

<*b0 

<Tb  0^“ 

^q 

© 

II 

O 

0 

3<rbocrM(3aM+  -fficTb 0f3)-1 

3o-M(y272r3)“1 

°z0 

0 

12om)0-m(o-m+  az0t)  I2<rh+  ■y2J2ah0az0t4 
+4crM{3  az0t+  y1J2t}ab0) 

12crM(<TM+<7z0t)(y1J2t3(4<TM+  az0t))~l 

0 

^2crb0(TM(12crM+  ylJ2t3ijb{}f' 

l2cjM(^J2r3)_l 

The  Riccati  equation  for  2(r)  [Eq.  (10)]  appears  difficult 
to  solve  because  it  is  nonlinear.  Fortunately,  it  can  be  reduced 
to  a  much  simpler  linear  problem.  See  Appendix  B  for  an 
outline  of  this  method. 

The  solution  to  the  fluctuating  field  problem  ( abF  A  0  and 
yb  A  0)  is  represented  in  Fig.  2.  This  solution  is  simply  the 
constant  field  solution  (abF=0  and  yb=0)  smoothly  saturat¬ 
ing  at  the  steady- state  value  of  Eq.  (24)  at  time  t2 .  Thus, 
considering  the  long-time  behavior  of  the  constant  field  so¬ 
lution  will  tell  us  about  the  transient  behavior  when  measur¬ 
ing  fluctuating  fields.  Because  the  analytic  form  for  the  con¬ 
stant  field  solution  is  simple,  we  consider  only  it  and 
disregard  the  full  analytic  form  of  the  fluctuating  field  solu¬ 
tion. 

The  analytic  form  of  X(t)  is  highly  instructive.  The  gen¬ 
eral  solutions  to  abR(t)  and  crzR(t),  with  arbitrary  prior  infor¬ 
mation  ab0  and  (Tz0,  are  presented  in  the  central  entries  of 
Tables  I  and  II,  respectively.  The  other  entries  of  the  tables 
represent  the  limits  of  these  somewhat  complicated  expres¬ 
sions  as  the  prior  information  assumes  extremely  large  or 
small  values.  Here,  we  notice  several  interesting  tradeoffs. 

First,  the  left-hand  column  of  Table  I  is  zero  because  if  a 
constant  field  is  being  measured,  and  we  start  with  complete 
knowledge  of  the  field  (o-^0=0),  then  our  job  is  completed 
trivially.  Now  notice  that  if  crb0  and  az0  are  both  nonzero, 
then  at  long  times  we  have  the  lower  right  entry  of  Table  I, 

O'bR^)  -  ylj2t3  ~  <J 'bAf )•  (25) 

This  is  the  same  result  one  gets  when  the  estimation  proce¬ 
dure  is  simply  to  perform  a  least- squares  line  fit  to  the  noisy 
measurement  curve  for  constant  fields.  (Note  that  all  of  these 
results  are  equivalent  to  the  solutions  of  Ref.  [17],  but  with¬ 
out  J  damping.)  If  it  were  physically  possible  to  ensure  crz0 
=  0,  then  our  estimation  would  improve  by  a  factor  of  4  to 
the  upper  right  result.  However,  quantum  mechanics  imposes 
that  this  initial  variance  is  nonzero  (e.g.,  crz0=J/2  for  a  co¬ 
herent  state  and  less,  but  still  nonzero,  for  a  squeezed  state), 
and  the  upper  right  solution  is  unattainable. 


Now  consider  the  dual  problem  of  spin  estimation  perfor¬ 
mance  crzR{t )  as  represented  in  Table  II,  where  we  can  make 
analogous  tradeoff  observations.  If  there  is  no  field  present, 
we  set  crb 0=0  and 

azR(t)  =  .  (26) 

crM  +  taz0 

When  crzR(t)  is  interpreted  as  the  quantum  variance  (A 
this  is  the  ideal  (nondamped)  conditional  spin- squeezing  re¬ 
sult  which  is  valid  at  t  <  1  /M,  before  damping  in  J  begins  to 
take  effect  [19].  If  we  consider  the  solution  for  t>  HJM,  we 
have  the  lower  left  entry  of  Table  II,  crzR(t)  =  aM/t.  However, 
if  we  must  include  constant  field  uncertainty  in  our  estima¬ 
tion,  then  our  estimate  becomes  the  lower  right  entry  crzR(t) 
=4crM/t  which  is,  again,  a  factor  of  4  worse. 

If  our  task  is  field  estimation,  intrinsic  quantum  mechani¬ 
cal  uncertainty  in  z  limits  our  performance  just  as,  if  our  task 
is  spin-squeezed  state  preparation,  field  uncertainty  limits 
our  performance. 

2.  Transfer  function  estimation  and  control 

Suppose  that  the  controller  did  not  have  the  capability  to 
adjust  the  gains  in  time  as  it  tracked  a  fluctuating  field.  One 
approach  would  then  be  to  apply  the  steady- state  transfer 
functions  derived  above  for  the  entire  measurement.  While 
this  approach  performs  optimally  in  steady  state,  it  ap¬ 
proaches  the  steady  state  in  a  nonoptimal  manner  compared 
to  the  dynamic  controller.  Figure  4  demonstrates  this  poor 
transient  performance  for  tracking  fluctuating  fields  of  differ¬ 
ing  bandwidth.  Notice  that  the  performance  only  begins  to 
improve  around  the  time  that  the  dynamic  controller  satu¬ 
rates. 

Also  notice  that  the  transfer  function  Gb(s)  is  dependent 
on  whether  or  not  the  state  is  being  controlled,  i.e.,  whether 
or  not  X  is  zero.  The  performance  shown  in  Fig.  4  is  for  one 
particular  value  of  X,  but  others  will  give  different  estimation 
performances  for  short  times.  Still,  all  of  the  transfer  func¬ 
tions  generated  from  any  value  of  X  will  limit  to  the  same 


TABLE  II.  Spin  tracking  error,  crzR(t ),  for  different  initial  variances  of  b  and  z. 
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Time 


FIG.  4.  Estimation  performance  for  estimators  based  on  the  dy¬ 
namic  gain  solution  of  the  Riccati  equation,  compared  against  esti¬ 
mators  with  constant  estimation  gain.  The  latter  are  the  transfer 
function  limits  of  the  former;  hence  they  have  the  same  long-term 
performance.  Three  different  bandwidth  b  processes  are  considered. 

performance  at  long  times.  Also,  all  of  them  will  perform 
poorly  compared  to  the  dynamic  approach  during  the  tran¬ 
sient  time. 


V.  ROBUST  PERFORMANCE:  J  UNKNOWN 

Until  this  point,  we  have  assumed  the  observer  has  com¬ 
plete  knowledge  of  the  system  parameters,  in  particular,  the 
spin  number  J.  We  will  now  relax  this  assumption  and  con¬ 
sider  the  possibility  that,  for  each  measurement,  the  collec¬ 
tive  spin  J  is  drawn  randomly  from  a  particular  distribution. 
Although  we  will  be  ignorant  of  a  given  /,  we  may  still 
possess  knowledge  about  the  distribution  from  which  it  is 
derived.  For  example,  we  may  be  certain  that  J  never  as¬ 
sumes  a  value  below  a  minimal  value  7min  or  above  a  maxi¬ 
mal  value  /max.  This  is  a  realistic  experimental  situation,  as  it 
is  unusual  to  have  particularly  long  tails  on,  for  example, 
trapped  atom  number  distributions.  We  do  not  explicitly  con¬ 
sider  the  problem  of  J  fluctuating  during  an  individual  mea¬ 
surement,  although  the  subsequent  analysis  can  clearly  be 
extended  to  this  problem. 

Given  a  J  distribution,  one  might  imagine  completely  re¬ 
optimizing  the  estimator-controller  with  the  full  distribution 
information  in  mind.  Our  initial  approach  is  more  basic  and 
in  line  with  robust  control  theory:  we  design  our  filter  as 
before,  assuming  a  particular  J ' ,  then  analyze  how  well  this 
filter  performs  on  an  ensemble  with  /A/'.  With  this  infor¬ 
mation  in  mind,  we  can  decide  if  estimator-controllers  built 
with  /'  are  robust,  with  and  without  control,  given  the 
bounds  on  J.  We  will  find  that,  under  certain  conditions, 
using  control  makes  our  estimates  robust  to  uncertainty 
about  the  total  spin  number. 

The  essential  reason  for  this  robustness  is  that  when  a 
control  field  is  applied  to  zero  the  measured  signal,  that  con¬ 
trol  field  must  be  approximately  equal  to  the  field  to  be 


tracked.  Because  J  is  basically  an  effective  gain,  variations  in 
J  will  affect  the  performance,  but  not  critically,  so  the  error 
signal  will  still  be  approximately  zero.  If  the  applied  signal  is 
set  to  be  the  estimate,  then  the  tracking  error  must  also  be 
approximately  zero.  (See  Appendix  C  for  a  robustness  analy¬ 
sis  along  these  lines  in  frequency  space.) 

Of  course,  this  analysis  assumes  that  we  can  apply  fields 
with  the  same  precision  that  we  measure  them.  While  the 
precision  with  which  we  can  apply  a  field  is  experimentally 
limited,  we  here  consider  the  ideal  case  of  infinite  precision. 
In  this  admittedly  idealized  problem,  our  estimation  is  lim¬ 
ited  by  only  the  measurement  noise  and  our  knowledge  of  J. 

First,  to  motivate  this  problem,  we  describe  how  poorly 
our  estimator  performs  given  ignorance  about  J  without  con¬ 
trol. 

A.  Uncontrolled  ignorance 

Let  us  consider  the  performance  of  our  estimation  proce¬ 
dure  at  estimating  constant  fields  when  J'  A  J.  In  general, 
this  involves  solving  the  complicated  total  covariance  matrix 
Eq.  (20).  However,  in  the  long-time  limit  ( t>l/JM )  of  es¬ 
timating  constant  fields,  the  procedure  amounts  to  simply 
fitting  a  line  to  the  noisy  measurement  with  a  least-squares 
estimate.  Suppose  we  record  an  open-loop  measurement 
which  appears  as  a  noisy  sloped  line  for  small  angles  of 
rotation  due  to  the  Larmor  precession.  Regardless  of  whether 
or  not  we  know  /,  we  can  measure  the  slope  of  that  line  and 
estimate  it  to  be  m.  If  we  knew  /,  we  would  know  how  to 
extract  the  field  from  the  slope  correctly:  b=m/yJ.  If  we 
assumed  the  wrong  spin  number,  /'  A  /,  we  would  get  the 

nonoptimal  estimate:  b'  =m/ yJ'  =  bJ/J ' . 

First  assume  that  this  is  a  systematic  error  and  J  is  un¬ 
known,  but  the  same,  on  every  trial.  We  assume  that  the 
constant  field  is  drawn  randomly  from  the  ab0  distribution 
for  every  trial.  In  this  case,  if  we  are  wrong,  then  we  are 
always  wrong  by  the  same  factor.  It  can  be  shown  that  the 
error  always  saturates 

«w;  — ’  ( 1  -/)Vfe0, 

where  /=///'.  Of  course,  because  this  error  is  systematic, 
the  variance  of  the  estimate  does  not  saturate,  only  the  error. 
This  problem  is  analogous  to  ignorance  of  the  constant  elec¬ 
tronic  gains  in  the  measurement  and  can  also  be  calibrated 
away. 

However,  a  significant  problem  arises  when,  on  every 
trial,  a  constant  b  is  drawn  at  random  and  J  is  drawn  at 
random  from  a  distribution,  so  the  error  is  no  longer  system¬ 
atic.  In  this  case,  we  would  not  know  whether  to  attribute  the 
size  of  the  measured  slope  to  the  size  of  J  or  to  the  size  of  b. 
Given  the  same  b  every  trial,  all  possible  measurement 
curves  fan  out  over  some  angle  due  to  the  variation  in  J. 
After  measuring  the  slope  of  an  individual  line  to  beyond 
this  fan-out  precision,  it  makes  no  sense  to  continue  measur¬ 
ing. 

We  should  also  point  out  procedures  for  estimating  fields 
in  open-loop  configuration,  but  without  the  small-angle  ap¬ 
proximation.  For  constant  large  fields,  we  could  observe 
many  cycles  before  the  spin  damped  significantly.  By  fitting 
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the  amplitude  and  frequency  independently,  or  computing  the 
Fourier  transform,  we  could  estimate  the  field  somewhat  in¬ 
dependently  of  /,  which  only  determines  the  amplitude. 
However,  the  point  here  is  that  b  might  not  be  large  enough 
to  give  many  cycles  before  the  damping  time  or  any  other 
desired  stopping  time.  In  this  case,  we  could  not  indepen¬ 
dently  fit  the  amplitude  and  frequency  because  they  appear 
as  a  product  in  the  initial  slope.  Similar  considerations  apply 
for  the  case  of  fluctuating  b  and  fluctuating  J.  See  Ref.  [23], 
for  a  complete  analysis  of  Bayesian  spectrum  analysis  with 
free  induction  decay  examples. 

Fortunately,  using  precise  control  can  make  the  estimation 
process  relatively  robust  to  such  spin  number  fluctuations. 

B.  Controlled  ignorance:  Steady-state  performance 

We  first  analyze  how  the  estimator  designed  with  J'  per¬ 
forms  on  a  plant  with  J  at  tracking  fluctuating  fields  with  and 
without  control.  To  determine  this  we  calculate  the  steady 
state  of  Eq.  (19). 

For  the  case  of  no  control  (X  =  0),  we  simplify  the  result¬ 
ing  expression  by  taking  the  same  large  /'  approximation  as 
before.  This  gives  the  steady- state  uncontrolled  error 

VbE  -*■  (1  -  f)2 =  (1  ~f)2(7bFree’ 

2% 

where  f=J/Jf.  Because  the  variance  of  the  fluctuating  b  is 
a bFreei  the  uncontrolled  estimation  performs  worse  than  no 
estimation  at  all  if  />  2. 

On  the  other  hand,  when  we  use  precise  control  the  per¬ 
formance  improves  dramatically.  We  again  simplify  the 
steady-state  solution  with  the  large  J’  and  X  assumptions 
from  before,  giving 

(TbS{j,r)  ->  (pp)  pp <%Wu=(pjrJ<riJ.J')' 

where  abs(J,J')  is  the  steady-state  controlled  error  when  a 
plant  with  J  is  controlled  with  a  J'  controller  and  crbs(J ')  is 
the  error  when  J=Jf .  One  simple  interpretation  of  this  result 
is  that  if  we  set  J'  to  be  the  minimum  of  the  J  distribution 
(f>  1)  then  we  never  do  worse  than  crbs(J’)  and  we  never  do 
better  than  twice  as  well  (/— >oo).  See  Fig.  5  for  a  demonstra¬ 
tion  of  this  performance. 


C.  Controlled  ignorance:  Transient  performance 

Now  consider  measuring  constant  fields  with  the  wrong 
assumed  J’ .  Again,  when  control  is  not  used,  the  error  satu¬ 
rates  at 


CbE^i  1  -  f)2(?b0 ■ 

When  control  is  used,  the  transient  performance  again  im¬ 
proves  under  certain  conditions.  The  long-time  transient  so¬ 
lution  of  Eq.  (19)  is  difficult  to  manage  analytically,  yet  the 
behavior  under  certain  limits  is  again  simple.  For  large  X  and 
J'  and  for />  1/2,  we  numerically  find  the  transient  perfor¬ 
mance  to  be  approximately 


Time 


FIG.  5.  Steady-state  estimation  performance  for  estimator  de¬ 
signed  with  J' =  106,  and  actual  spin  numbers:  J=J' 
X[0.5, 0.75, 1,1.25, 2, 10, 100].  Other  parameters:  y=106,  M=104, 
yb=  105,  crbFree=  1  (fluctuating  field),  X  =  0.1  (this  is  large  enough  to 
satisfy  large-X  limits  discussed  in  text).  The  inset  compares  the 
normalized  robust  estimation  performance  (curve)  at  a  particular 
time,  to  the  ideal  performance  (line)  when  J  is  known. 


<rhTUJ') 


I  f  +  2  \  12crM  _ 
\4f-l)  y2/'2/3 


/2  +  2 

4/2-l 


PbAJ')’ 


(27) 


where  crbT(J ,/')  is  the  transient  controlled  error  when  a  plant 
with  J  is  controlled  with  a  J’  controller  and  crbT(J')  is  the 
error  when  J=Jf .  See  Fig.  6  for  a  demonstration  of  this  per¬ 
formance  for  realistic  parameters.  As  /^°°  the  /-dependent 


Time 

FIG.  6.  Transient  estimation  performance  for  controller  de¬ 
signed  with  J' =  106,  and  actual  spin  numbers:  J=J' 
X [0.75, 1 , 1.25,2, 10, 100, 1000].  Other  parameters:  y=106,  M 
=  104,  yb= 0,  orbFree= 0  (constant  field),  X  =  1.  Note  that  this  behavior 
is  valid  for  t<  1/M=10-4.  The  inset  compares  the  normalized  ro¬ 
bust  estimation  performance  (curve)  at  a  particular  time,  to  the  ideal 
performance  (line)  when  J  is  known. 
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prefactor  saturates  at  a  value  of  1/4.  However,  as  /— ►  1/2 
then  the  system  takes  longer  to  reach  such  a  simple 
asymptotic  form,  and  the  solution  of  Eq.  (27)  becomes  in¬ 
valid. 

Accordingly,  one  robust  strategy  would  be  the  following. 
Suppose  that  the  lower  bound  of  the  /-distribution  was 
known  and  equal  to  /min.  Also  assume  that  crbT{Jmm)  repre¬ 
sents  an  acceptable  level  of  performance.  In  this  case,  we 
could  simply  design  our  estimator  based  on  J'  =Jm[n  and  we 
would  be  guaranteed  at  least  the  performance  o-^r(/min)  and 
at  best  the  performance  o-^r(/min)/4. 

This  approach  would  be  suitable  for  experimental  situa¬ 
tions  because  typical  /  distributions  are  narrow:  the  differ¬ 
ence  between  /min  and  /max  is  rarely  greater  than  an  order  of 
magnitude.  Thus,  the  overall  sacrifice  in  performance  be¬ 
tween  the  ideal  case  and  the  robust  case  would  be  small.  The 
estimation  performance  still  suffers  because  of  our  ignorance 
of  /,  but  not  nearly  as  much  as  in  the  uncontrolled  case. 


h(t) 


h(t) 


h(t) 


-y(t) 


■y(t) 


-y(t) 


VI.  CONCLUSION 

The  analysis  of  this  paper  contained  several  key  steps 
which  should  be  emphasized.  Our  first  goal  was  to  outline 
the  proper  approach  to  quantum  parameter  estimation.  The 
second  was  to  demonstrate  that  reduced  representations  of 
the  full  filtering  problem  are  relevant  and  convenient  be¬ 
cause,  if  a  simple  representation  can  be  found,  then  existing 
classical  estimation  and  control  methods  can  be  readily  ap¬ 
plied.  The  characteristic  that  led  to  this  simple  description 
was  the  approximately  Gaussian  nature  of  the  problem.  Next, 
we  attempted  to  present  basic  classical  filtering  and  control 
methodology  in  a  self-contained,  pedagogical  format.  The 
results  emphasized  the  inherent  tradeoffs  in  simultaneous  es¬ 
timation  of  distinct,  but  dynamically  coupled,  system  param¬ 
eters.  Because  these  methods  are  potentially  critical  in  any 
field  involving  optimal  estimation,  we  consider  the  full  ex¬ 
position  of  this  elementary  example  to  be  a  useful  resource 
for  future  analogous  work. 

We  have  also  demonstrated  the  general  principle  that  pre¬ 
cision  feedback  control  can  make  estimation  robust  to  the 
uncertainty  of  system  parameters.  Despite  the  need  to  as¬ 
sume  that  the  controller  produced  a  precise  cancellation  field, 
this  approach  deserves  further  investigation  because  of  its 
inherent  ability  to  precisely  track  broadband  field  signals  [2]. 
It  is  anticipated  that  these  techniques  will  become  more  per¬ 
vasive  in  the  experimental  community  as  quantum  systems 
are  refined  to  levels  approaching  their  fundamental  limits  of 
performance. 
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FIG.  7.  Equivalent  models  for  the  filtering  problem  (see  discus¬ 
sion  at  the  beginning  of  Appendix  A).  Each  version  can  be  inserted 
into  the  plant  block  of  Fig.  1(c).  The  filters  all  presume  complete 
knowledge  of  h{t)  =  b(t)  +  u{i). 

APPENDIX  A:  SIMPLIFIED  REPRESENTATION 
OF  THE  PLAN 

In  Sec.  II  we  outlined  a  general  approach  to  quantum 
parameter  estimation  based  on  the  stochastic  master  equation 
(SME),  but  subsequently  we  derived  optimal  observer  and 
controller  gains  from  an  explicit  representation  of  the  plant 
dynamics  [Eq.  (6)].  This  representation  appears  classical  in 
that  the  plant  state  is  given  by  a  scalar  variable  z  rather  than 
a  density  operator.  In  this  section  we  present  a  derivation  of 
this  simplified  representation  and  discuss  the  equivalence  of 
our  approach  to  the  original  quantum  estimation  problem. 

From  the  perspective  of  quantum  filtering  theory  we  will 
simply  show  that  a  Gaussian  approximation  to  the  relevant 
SME  can  be  viewed  as  a  Kalman  filter,  which  in  turn  induces 
a  simplified  representation  of  the  dynamics  of  the  spin  state. 
In  this  simplified  representation  the  quantum  state  of  the  spin 
system  is  replaced  by  a  scalar  variable  z  and  (Jz)(t)  is  viewed 
as  the  optimal  estimate  of  the  random  process  z(t).  Equations 
for  dz{t)  and  its  relation  to  the  observed  photocurrent  y{t)dt 
are  given  in  Eqs.  (A3)  and  (A6),  which  have  the  convenient 
property  of  being  formally  time  invariant.  The  technical  ap¬ 
proach  in  the  main  body  of  the  text  is  then  to  replace  Eq. 
(Al),  which  is  derived  from  the  SME,  by  a  state-space  ob¬ 
server  derived  directly  from  the  simplified  model  of  Eq. 
(A3).  By  doing  so  we  achieve  transparent  correspondence 
with  classical  estimation  and  control  theory.  We  should  note 
that  the  diagrams  in  Fig.  7  indicate  signal  flows  and  depen¬ 
dencies  in  a  way  that  is  quite  at  odds  with  the  quantum 
filtering  perspective.  This  figure  is  meant  solely  to  motivate 
the  simplified  model  [Eq.  (A3)]  for  readers  who  prefer  a 
more  traditional  quantum  optics  perspective,  in  which  the  Ito 
increment  in  the  SME  corresponds  to  optical  shot-noise  (as 
opposed  to  an  innovation  process  derived  from  the  photocur¬ 
rent)  and  the  SME  itself  plays  the  role  of  a  “physical”  evo¬ 
lution  equation  mapping  h(t )  to  y{t)dt. 
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Adopting  the  latter  perspective,  let  us  briefly  discuss  (with 
reference  to  the  top  diagram  in  Fig.  7)  the  overall  structure  of 
our  estimation  problem.  The  physical  system  that  exists  in 
the  laboratory  (the  spins  and  optical  probe  beam)  acts  as  a 
transducer,  whose  key  role  in  the  magnetometry  scheme  is  to 
imprint  a  statistical  signature  of  the  magnetic  field  h(t)  onto 
the  observable  photocurrent,  y(t)dt.  Hence  whatever  theoret¬ 
ical  model  we  adopt  for  describing  the  spin  and  probe  dy¬ 
namics  must  provide  an  accurate  description  of  the  mapping 
from  h(t )  to  y(t)dt ,  as  represented  by  the  plant  in  Fig.  1(c). 
An  open-loop  estimator,  designed  on  the  basis  of  this  plant 
model,  would  construct  a  conditional  probability  distribution 
for  h{t )  based  on  passive  observation  of  y{t)dt.  In  a  closed- 
loop  estimation  procedure  we  would  allow  the  controller  to 
apply  compensation  fields  to  the  system  in  order  to  gain 
accuracy  and/or  robustness.  In  either  case,  the  essential  role 
of  the  spin-probe  (plant)  model  in  the  design  process  is  to 
provide  an  accurate  description  of  the  influence  of  an  arbi¬ 
trary  time-dependent  field  h(t )  on  the  photocurrent  y(t)dt. 
Note  that  the  consideration  of  arbitrary  h(t)  subsumes  all 
possible  effects  of  real-time  feedback. 

Thomsen  and  co-workers  [19]  have  derived  an  accurate 
plant  model  for  our  magnetometry  problem,  in  the  form  of  an 
SME  [Eq.  (2)].  Following  a  common  convention  in  quantum 
optics,  let  us  here  write  this  SME  and  the  corresponding 
photocurrent  equation  in  the  form 

dp(t)  =  -  idt[H(h),p(t)~\  +  V[^fMJz]p(t)dt 

y(t)dt  =  (Jz)(t)dt  +  \(JMdW(t) , 


where  H(h)  =  yhJy  and  p(t)  is  the  state  of  the  spin  system 
conditioned  on  the  measurement  record  y(t)dt.  The  quantity 


dW(t)  is  a  Wiener  increment  that  heuristically  represents  shot 
noise  in  the  photodetection  process  [24],  and  these  are  to  be 
interpreted  as  Ito  stochastic  differential  equations.  If  h(t)  and 
y(t)dt  are  considered  as  input  and  output  signals,  respec¬ 
tively,  this  pair  of  equations  jointly  implement  a  plant  trans¬ 
fer  function  as  depicted  in  Fig.  7,  with  p(t)  taking  on  the  role 
of  the  plant  state. 

For  a  large  spin  ensemble,  however,  p{t)  will  have  very 
high  dimension  and  it  would  be  impractical  to  utilize  the  full 
SME  for  design  purposes.  It  is  straightforward  to  derive  a 
reduced  model  by  employing  a  moment  expansion  for  the 
observable  of  interest.  Extracting  the  conditional  expectation 
values  of  the  first  two  moments  of  Jz  from  SME  gives  the 
following  scalar  stochastic  differential  equations: 


d(J7)(t)  =  y(Jx)(t)h(t)dt  + 


<A  J2Mt) 


dW(t), 


aM 


d(AJ2z)(t)  =  - 


(AJ2)2(t) 

- — lL-2j-dt  -  iy([AJ2,JJ)(t)h(t)dt 
Om 


+ 


(AJy>{t) 


dW{t). 


aM 


If  the  spins  are  initially  fully  polarized  along  v  and  the 
spin  angle  ~(JZ)/(JX )  is  kept  small  (e.g.,  by  active  control), 


then,  by  using  the  evolution  equation  for  the  v  component, 
we  can  show  (Jx)(t)  —  /  exp[-M/72]  —  /  for  times 
Making  the  Gaussian  approximation  at  small  times,  the  third- 
order  terms  (A J3Z)  and  -iy([AJ^,Jy])it)hit)  can  be  neglected. 
The  Holstein-Primakoff  transformation  [25],  commonly  used 
in  the  condensed  matter  physics  literature,  makes  it  possible 
to  derive  this  Gaussian  approximation  as  an  expansion  in 
1 IJ.  Both  of  the  removed  terms  can  be  shown  to  be  —1//V7 
smaller  than  the  retained  nonlinear  term.  Additionally,  the 
second  removed  term  will  be  reduced  if  h(t)~  0  by  active 
control. 

These  approximations  give 


d(Jz)(t)  =  yJh{t)dt  + 


<A  J2)(t) 


dW(t) , 


M 


(Al) 


d(AJ2)(t)  =  -  —dt,  (A2) 

Om 


which  constitute  a  Gaussian,  small-time  approximation  to  the 
full  SME  that  represents  the  essential  dynamics  for  magne¬ 
tometry.  Note  that  we  can  analytically  solve 


<A  J2z)(t)  = 


(A/^)(Q)ctm 


where  (AJ^)(0)=J/2  for  an  initial  coherent  spin  state. 

At  this  point  we  may  note  that  Eqs.  (Al)  and  (A2)  have 
the  algebraic  form  of  a  Kalman  filter.  [This  is  not  at  all 
surprising  since  the  SME,  as  written  in  Eq.  (2),  represents  an 
optimal  nonlinear  filter  for  the  reduced  spin  state  [6,9]  and 
our  subsequent  approximations  have  enforced  both  linearity 
and  sufficiency  of  second-order  moments.]  Viewed  as  such, 
the  quantity  (Jz)(t)  would  represent  an  optimal  (least  square) 
estimate  of  some  underlying  variable  z(t)  based  on  observa¬ 
tion  of  a  signal  d£;(t),  and  (A would  represent  the  un¬ 
certainty  (variance)  of  this  estimate.  It  thus  stands  to  reason 
that  we  might  be  able  to  simplify  our  magnetometry  model 
even  further  if  we  could  find  an  “underlying”  model  for  the 
evolution  of  z{t)  and  d£(t),  for  which  our  equations  derived 
from  the  SME  would  be  the  Kalman  filter. 

It  is  not  difficult  to  do  so,  and  indeed  a  very  simple  model 
suffices, 


dz(t)  =  yJh{t)dt , 

dg(t)  =  z(t)dt  +  yfc J^dW{t ) ,  (A3) 

where  dW(t)  is  an  Wiener  increment  that  is  distinct  from 
(though  related  to)  dW(t).  In  order  to  match  initial  conditions 
with  the  equations  derived  from  the  SME,  we  should  assume 
that  the  expected  value  of  z(£=0)  is  zero  and  that  the  vari¬ 
ance  of  our  prior  distribution  for  z(0)  is  J 12.  Written  in  ca¬ 
nonical  form,  the  Kalman  filter  for  this  hypothetical  system 
is  then 


dz{t)  =  yJh{t)dt  + 


o-zR(t)  [<7gQ)  -  z(t)dt] 
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dcrzR{t)  =  - 


aM 


Here  z{t)  is  the  optimal  estimate  of  z(t)  and  azR(t )  is  the 
variance.  We  exactly  recover  the  SME  model,  Eqs.  (Al)  and 
(A2),  by  the  identifications 


z(t)  (Jz)(t), 


^  (A4) 


[d€(t)  -  z(t)dt ] 


dW{t). 


aM 


It  is  important  to  note  that  the  quantity  \d^(t)-zdt\/ 
represents  the  so-called  innovation  process  of  this  Kalman 
filter,  and  it  is  thus  guaranteed  (by  least- squares  optimality  of 
the  filter  [26])  to  have  Gaussian  white-noise  statistics.  Hence 
we  have  solid  grounds  for  identifying  it  with  the  Ito  incre¬ 
ment  appearing  in  the  SME. 

Given  this  insight,  we  see  that  our  original  magnetometry 
problem  can  equivalently  be  viewed  in  a  way  that  corre¬ 
sponds  to  the  middle  diagram  of  Fig.  7.  In  this  version,  we 
posit  the  existence  of  a  hidden  transducer  that  imprints  sta¬ 
tistical  information  about  the  magnetic  field  h(t)  onto  a  sig¬ 
nal  d£;(t).  A  Kalman  filter  receives  this  signal,  and  from  it 
computes  an  estimate  z(t)  as  well  as  an  innovation  process 
dW(t).  [Note  that  as  indicated  in  the  diagram,  the  Kalman 
filter  will  only  function  correctly  if  it  “has  knowledge  of’  the 
true  magnetic  field  h(t )  in  the  way  that  a  physical  system 
would,  but  this  is  not  an  important  point  for  what  follows.] 
According  to  the  model  equations,  the  Kalman  filter  then 
emits  the  following  signal  to  be  received  by  our  photodetec¬ 
tor, 


y(t)dt  =  z(t)dt  +  Jcr^dW(t) .  (A5) 


Note  that  dW(t)  now  appears  as  an  internal  variable  to  the 
Kalman  filter,  computed  from  the  input  signal  dg(t)  and  the 
recursive  estimate  z{t),  while  the  inherent  randomness  is  re¬ 
ferred  back  to  dW(t).  Although  this  may  seem  like  an  unnec¬ 
essarily  complicated  story,  it  should  be  noted  that  the  com¬ 
pound  model  with  z(t)  and  the  Kalman  filter  predicts  an 
identical  transfer  function  from  h(t)  to  the  experimentally 
observed  signal  y(t)dt  to  that  of  the  equations  originally  de¬ 
rived  from  the  SME  (top  diagram  in  Fig.  7).  Hence,  for  the 
purposes  of  analyzing  and  designing  magnetometry  schemes, 
these  are  equivalent  models. 

Combining  several  definitions  above  we  find 


/  \  /  _ ( \  1  I — [dij(t)  -  z(t)dt ] 

y(t)dt  =  z(t)dt  +  ycrM - 7= - =d^(t).  (A6) 


M 


of  the  transducing  physical  system  corresponds  to  the  sim¬ 
plified  model  [Eq.  (A3)],  since  we  can  do  so  without  making 
any  error  in  our  description  of  the  effect  of  h{t)  on  the  re¬ 
corded  signal.  We  thus  conclude  that  for  the  purposes  of 
open-  or  closed-loop  estimations  of  h(t),  filters  and  control¬ 
lers  can  in  fact  be  designed — without  loss  of  performance — 
using  the  simplified  model  [Eq.  (A3)]. 

It  is  interesting  to  note  that  z(t)  can  loosely  be  interpreted 
as  a  “classical  value”  of  the  spin  projection  Jz.  Since  the 
operator  Jz  is  a  backaction  evading  observable,  the  continu¬ 
ous  measurement  we  consider  is  quantum  nondemolition  and 
its  backaction  on  the  system  state  is  minimal  (conditioning 
without  disturbance).  Hence  if  h(t)  =  0,  we  may  think  of  the 
measurement  process  as  gradually  “collapsing”  the  quantum 
state  of  the  spin  system  from  an  initial  coherent  state  towards 
an  eigenstate  of  Jz\  the  hidden  variable  z(t)  in  the  simplified 
model  Eq.  (A3)  would  then  represent  the  eigenvalue  corre¬ 
sponding  to  the  ultimate  eigenstate,  and  z(t)=(Jz)(t)  in  the 
Kalman  filter  would  be  our  converging  estimate  of  it.  (Again, 
this  is  as  expected  from  the  abstract  perspective  of  quantum 
filtering  theory  for  open  quantum  systems.)  Conditional  spin 
squeezing  in  this  case  can  then  be  understood  as  nothing 
more  than  the  reduction  of  our  uncertainty  as  to  the  underly¬ 
ing  value  of  z — as  we  acquire  information  about  z  through 
observation  of  dg(t)=y(t)dt ,  our  uncertainty  crzR{t)  <-+  (A/^) 
X(t)  naturally  decreases  below  its  initial  coherent- state  value 
of  J/2.  Still,  the  quantum-mechanical  nature  of  the  spin  sys¬ 
tem  is  not  without  consequence,  as  it  is  known  that  continu¬ 
ous  QND  measurement  produces  entanglement  among  the 
spins  in  the  ensemble  [12]. 

It  seems  worth  commenting  on  the  fact  that  Eq.  (A3) 
clearly  predicts  stationary  statistics  for  the  photocurrent 
y(t)dt ,  whereas  Eq.  (Al)  contains  a  time-dependent  diffusion 
coefficient  that  might  color  the  statistics  of  y(t)dt=(Jz)(t)dt 

+  'Ja^dW(t).  In  fact  there  is  no  discrepancy.  It  is  possible 
[24]  to  derive  the  second-order  time-correlation  function  of 
the  observed  signal  y(t)dt  directly  from  the  stochastic  master 
Eq.  (2), 


(y(t)y(t  +  r)>  =  [(Jz(t)Jz(t  +  r)>  +  (Jz(t  +  r)Jz(t))]/2 


+ 


1 

4rjM 


S(t). 


(This  result  could  also  be  obtained  from  the  standard  input- 
output  theory  of  quantum  optics.)  Since  the  master  equation 
results  in  linear  equations  for  the  mean  values  (Jx(t))  and 
(Jz(t))  the  quantum  regression  theorem  [27]  allows  the  cor¬ 
relation  functions  {Jz(t)Jz(t+r))  and  (Jz(t+T)Jz(t))  to  be  cal¬ 
culated  explicitly.  In  this  paper  we  are  most  interested  in  the 
early  time  evolution  for  which  we  obtain  the  expressions 


=  ( Jz(t ))  =  ybJt  +  0(t2) , 


It  thus  follows  that  in  the  compound  model,  the  Kalman  filter 
actually  implements  a  trivial  transfer  function  and  can  in  fact 
be  eliminated  from  the  diagram.  Doing  this,  we  obtain  the 
simplified  representation  in  the  bottom  diagram  of  Fig.  7. 
Here  the  perspective  is  to  pretend  that  the  internal  dynamics 


<y(0)y(f)>  -  <y(0)><y(r)>  =  -i—S(t)  +  <A#(0)  +  0(t2). 

4jjM 

These  correlation  functions  correspond  to  a  white-noise  sig¬ 
nal  which  is  a  linear  ramp  with  gradient  ybJ  with  a  random 
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offset  of  variance  (A/J)(0),  in  perfect  agreement  with  our 
simplified  model  Eq.  (A3).  If  the  statistics  of  y(r)  were 
Gaussian  these  first  and  second-order  moments  would  be 
enough  to  characterize  the  signal  completely,  and  indeed  for 
sufficiently  large  J  the  problem  does  become  effectively 
Gaussian. 

As  a  final  comment  we  note  that  the  essential  step  in  the 
above  discussion  is  to  observe  that  the  equations  for  the 
first-and  second-order  moments  of  the  quantum  state  derived 
from  the  stochastic  master  equation  correspond  to  a  Kalman 
filter  for  some  classical  model  of  a  noisy  measurement.  This 
correspondence  holds  for  the  stochastic  master  equation  cor¬ 
responding  to  an  arbitrary  linear  quantum  mechanical  sys¬ 
tems  with  continuous  measurement  of  observables  that  are 
linear  combinations  of  the  canonical  variables  [20].  In  the 
general  case  of  measurements  that  are  not  QND  the  equiva¬ 
lent  classical  model  will  have  noise-driven  dynamical  equa¬ 
tions  as  well  as  noise  on  the  measured  signal.  The  noise 
processes  driving  the  dynamics  and  the  measured  signal  may 
also  be  correlated.  The  case  of  position  measurement  of  a 
harmonic  oscillator  shows  all  of  these  features  [28]. 

APPENDIX  B:  RICCATI  EQUATION  SOLUTION  METHOD 

The  matrix  Riccati  equation  is  ubiquitous  in  optimal  con¬ 
trol.  Here,  following  Ref.  [29],  we  show  how  to  reduce  the 
nonlinear  problem  to  a  set  of  linear  differential  equations. 
Consider  the  generic  Riccati  equation 

=  C  -  DV(f)  -  V(f)A  -  V(f)BV(r) . 
at 

We  propose  the  decomposition 


Feedback  b(t)  Atomic 


FIG.  8.  Spin  control  system  with  plant  transfer  function  P(s) 
-  yJIs.  r(t)  is  the  reference  signal,  which  is  usually  zero.  e(t)  is  the 
error  signal.  u(t)  is  the  controller  output,  b(t)  is  the  external  field  to 
be  tracked.  h(t)=b(t)+u(t)  is  the  total  field.  b(t)  is  the  field  esti¬ 
mate. 

APPENDIX  C:  ROBUST  CONTROL  IN  FREQUENCY 
SPACE 

Here  we  apply  traditional  frequency- space  robust  control 
methods  [30,31]  to  the  classical  version  of  our  system.  This 
analysis  is  different  from  the  treatment  in  the  body  of  the 
paper  in  several  respects.  First,  we  assume  nothing  about  the 
noise  sources  (bandwidth,  strength,  etc.).  Also,  this  approach 
is  meant  for  steady- state  situations,  with  the  resulting 
estimator-controller  being  a  constant  gain  transfer  function. 
The  performance  criterion  we  present  here  is  only  loosely 
related  to  the  more  complete  estimation  description  above. 
Despite  these  differences,  this  analysis  gives  a  very  similar 
design  procedure  for  the  steady- state  situation. 

We  proceed  as  follows  with  the  control  system  shown  in 
Fig.  8,  where  we  label  h(t)  =  u(t)  +  b(t)  as  the  total  field.  Con¬ 
sider  the  usual  spin  system  but  ignore  noise  sources  and  as¬ 
sume  we  can  measure  z(t)  directly,  so  that  z(t)=y(t).  For 
small  angles  of  rotation,  the  transfer  function  from  h(t)  to 
y{t)  is  an  integrator 


V(I)  =  W(I)U_1(0 
with  the  linear  dynamics 


dy(t) 

dt 


dz(t ) 
dt 


=  yJh(t ), 


d\V(t) 

dt 

-D  C 

’w(t)' 

dU(t) 

dt 

B  A 

U(f)  

It  is  straightforward  to  then  show  that  this  linearized  solution 
is  equivalent  to  the  Riccati  equation 


dV(t)  dW(t)  , 

~r  =  — r^u  +  w«— 

dt  dt  dt 


.d\J(t) 

dt 


U-‘(r) 


=  [-  DW(r)  +  CU(f)]U_1(0  -  W(f)U_1(f)(BW(f) 

+  AU(?))U_1(f)  =  C  -  DV(f)  -  V(f)A  -  V(f)BV(f), 


where  we  have  used  the  identity 


dxr\t) 

dt 


=  -U-'(r) 


dXJCQ 

dt 


U  “‘(r). 


Thus  the  proposed  solution  works  and  the  problem  can  be 
solved  with  a  linear  set  of  differential  equations. 


sy(s )  =  yJh(s ), 
y(s)  =  P(s)h(s), 

P(s)  =  yj/s. 

Now  we  define  the  performance  criterion.  First  notice  that 
the  transfer  function  from  the  field  to  be  measured  b(t)  to  the 
total  field  h(t)  is  S(s )  where 

h(s)  =  S(s)b(s), 

SiS'>  ~  l+P(s)C(s)' 

[Also  notice  that  this  represents  the  transfer  function  from 
the  reference  to  the  error  signal  e(s) = S(s)r(s) .]  Because  our 

field  estimate  will  be  b(t)=-u(t),  we  desire  h(t)  to  be  sig¬ 
nificantly  suppressed.  Thus  we  would  like  S(s )  to  be  small  in 
magnitude  [controller  gain  \C(s)\  large]  in  the  frequency 
range  of  interest.  However,  because  the  gain  \C(s)\  must 
physically  decrease  to  zero  at  high  frequencies  we  must  close 
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the  feedback  loop  with  adequate  phase  margin  to  keep  the 
closed-loop  system  stable.  This  is  what  makes  the  design  of 
CCs)  nontrivial. 

Proceeding,  we  now  define  a  function  which  repre¬ 

sents  the  degree  of  suppression  we  desire  at  the  frequency 
s=jo).  So  our  controller  C(s)  should  satisfy  the  following 
performance  criterion: 

||W1(j)S(j)||oo<l. 

Thus  the  larger  Wi(s)  becomes,  the  more  precision  we  desire 
at  the  frequency  5.  We  choose  the  following  performance 
function: 


W1(s)  = 


W  ip 

1  +  s/(Dl  ’ 


such  that  cd i  is  the  frequency  below  which  we  desire  suppres¬ 
sion  1/W10. 

Because  our  knowledge  of  J  is  imperfect,  we  need  to 
consider  all  plant  transfer  functions  in  the  range 


P=  —{J  •  - 

1  l*7  mm 

S' 


*  ‘Anaxi  • 


coq  -  max[coc^]  -  cdc  ^  .  (C2) 

Combining  Eqs.  (Cl)  and  (C2)  we  find  our  fundamental 
tradeoff 


Ww  =  vQy^  (C3) 

‘'max 

which  is  the  basic  result  of  this  section.  Given  experimental 
constraints  (such  as  7min,  /max,  and  cdq ),  it  tells  us  what 
performance  to  expect  (1/W10  suppression)  below  a  cho¬ 
sen  frequency  cdx. 

From  Eq.  (C3),  we  recognize  that  the  controller  gain  at 
the  closing  frequency  needs  to  be 

I,  wc  (QjWio  a>Q 

C  yf  y.J  yj 

mm  yj  max 

In  the  final  analysis,  we  do  not  need  to  use  /'  and  cdc  to 
parametrize  the  controller,  only  the  tradeoff  and  the  gain. 
Also,  notice  that  now  we  can  express  min[coc/?]  =  cdxWiq. 

To  check  our  previous  assumption 


Our  goal  is  now  to  find  a  C(s)  that  can  satisfy  the  perfor¬ 
mance  condition  for  any  plant  in  this  family.  We  choose  our 
nominal  controller  as 


So  if  J=J'  then  the  system  closes  at  cdc  (i.e., 
\P(i(oc)C0(ia)c)\  =  l,  whereas  in  general  the  system  will  close 
at  cdcr=cdc(J/J').  We  choose  this  controller  because 
P(s)C(s )  should  be  an  integrator  (oG/s)  near  the  closing 
frequency  for  optimal  phase  margin  and  closed-loop  stabil¬ 
ity. 

Next  we  insert  this  solution  into  the  performance  condi¬ 
tion.  We  make  the  simplifying  assumption  cdx<cdc(J / /') 
(we  will  check  this  later  to  be  self-consistent).  Then  the  op¬ 
timum  of  the  function  is  obvious  and  the  condition  of  Eq. 
(Cl)  becomes 


=  (DiWiq-  , 

‘'min 

which  is  true  if  W10>  1. 

Finally,  the  system  will  never  close  below  the  frequency 
min[toc/?]  so  we  should  increase  the  gain  below  a  frequency 
cdh  which  we  might  as  well  set  equal  to  min[coc^].  This 
improves  the  performance  above  and  beyond  the  criterion 
above.  Of  course  we  will  be  forced  to  level  off  the  gain  at 
some  even  lower  frequency  cdl  because  infinite  dc  gain  (a 
real  integrator)  is  unreasonable.  So  the  final  controller  can  be 
expressed  as 


C(i)  =  |C|c- 


1  U)^(  1  +  s/ CDj^) 

1  +  s!(Dq  (Dl(  1  +  s/CDj) 


with  the  frequencies  obeying  the  order 


cdl< 


<*lW10  <  0>CR  ~  M Cj,  • 

We  want  this  condition  to  be  satisfied  for  all  possible  spin 
numbers,  so  we  must  have 

cDlW10  =  mm[coCR]  =  coc'^.  (Cl) 

Experimentally,  we  are  forced  to  rolloff  the  controller  at 
some  high  frequency  that  we  shall  call  cdq.  Electronics  can 
only  be  so  fast.  Of  course,  we  never  want  to  close  above  this 
frequency  because  the  phase  margin  would  become  too 
small,  so  this  determines  the  maximum  J  that  the  controller 
can  reliably  handle 


(Dfj  -  min[<x)rAJ  -  (D\  W10< 

J 

(DCr  -  ~  (DlWW< 

J  min 

cog  =  max[ojCA>]  =  j -wjWjo- 

Notice  that  the  controller  now  looks  like  the  steady-state 
transfer  function  in  Fig.  3  derived  from  the  steady  state  of 
the  full  dynamic  filter.  (The  notation  is  the  same  to  make  this 
correspondence  clear).  Here  cdq  was  simply  stated,  whereas 
there  it  was  a  function  of  X  that  went  to  infinity  as  \^o°. 
Here  the  high  gain  due  to  cdl  and  cdh  was  added  manually, 
whereas  before  it  came  from  the  design  procedure  directly. 
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Cooled,  low-loss  nanomechanical  resonators  offer  the  prospect  of  directly  observing  the  quantum  dynamics 
of  mesoscopic  systems.  However,  the  present  state  of  the  art  requires  cooling  down  to  the  milliKelvin  regime 
in  order  to  observe  quantum  effects.  Here  we  present  an  active  feedback  strategy  based  on  continuous  obser¬ 
vation  of  the  resonator  position  for  the  purpose  of  obtaining  these  low  temperatures.  In  addition,  we  apply  this 
to  an  experimentally  realizable  configuration,  where  the  position  monitoring  is  carried  out  by  a  single-electron 
transistor.  Our  estimates  indicate  that  with  current  technology  this  technique  is  likely  to  bring  the  required  low 
temperatures  within  reach. 

DOI:  10. 1103/PhysRevB. 68. 235328  PACS  number(s):  85.85.+j,  85.35.Gv,  03.65.Ta,  45.80.+r 

I.  INTRODUCTION 

Nanomechanical  resonators  are  now  being  built  with 
quality  factors  in  the  range,  104,  and  resonance  frequen¬ 
cies  of  up  to  several  hundred  MHz.1  The  ground  state  energy 
of  these  devices  can  correspond  to  temperatures  in  the  mil¬ 
liKelvin  range.  As  a  result,  the  observation  of  quantum  be¬ 
havior  in  these  devices  is  becoming  a  real  possibility.2  To 
detect  such  behavior,  the  resonator  must  be  sufficiently  cold; 
since  a  quantum  harmonic  oscillator  driven  by  thermal  noise 
behaves  as  a  classical  oscillator  driven  by  thermal  noise,  one 
must  ensure  that  the  signatures  of  quantum  effects  are  not 
swamped  by  the  thermal  behavior.  The  approach  taken  so  far 
to  achieve  low  temperatures  is  to  place  the  resonator  in  a 
refrigerator.  However,  cooling  very  small  devices  in  this  way 
is  inherently  inefficient  in  that  the  system  becomes  weakly 
coupled  to  the  thermal  bath.  Here  we  explore  the  possibility 
of  using  feedback  control  to  effect  an  “active”  cooling  of  the 
resonator,  in  order  to  cool  below  the  possible  limits  set  by 
the  “passive”  refrigeration  technique. 

To  perform  such  feedback  cooling  the  resonator  must  be 
monitored,  and  the  result  fed  back  in  real  time  to  affect  the 
dynamics.  A  practical  method  of  performing  a  continuous 
measurement  of  the  position  of  the  resonator  is  to  use  a 
single-electron  transistor  (SET).3-5  To  measure  the  position 
of  the  resonator  one  locates  the  central  island  of  the  SET 
next  to  the  resonator.  When  the  resonator  is  charged,  and  the 
SET  is  biased  so  that  current  flows  through  it,  changes  in  the 
resonator’s  position  alter  the  potential  on  the  central  island, 
which  in  turn  changes  the  current.  The  current  therefore  pro¬ 
vides  a  continuous  measurement  of  the  position  of  the  reso¬ 
nator,  and  this  is  just  what  is  required  for  implementing  a 
linear  feedback  cooling  algorithm.6,7  A  feedback  force  can  be 
applied  by  applying  a  voltage  to  a  gate  capacitively  coupled 
to  the  resonator,  and  adjusting  the  voltage  so  as  to  damp  the 
resonator  (see  Fig.  1),  or  by  passing  a  variable  current 
through  the  oscillator  in  the  presence  of  a  fixed  external 
magnetic  field.  We  will  analyze  the  first  system,  although  the 
results  should  apply  to  the  second  as  well.  In  our  analysis  we 
will  use  the  theory  of  the  dc-SET.  While  an  experiment 
would  most  likely  use  a  radio-frequency  SET,8,20  the  charac¬ 


teristic  frequency  of  a  SET  is  typically  of  the  order  of 
10  GHz,  so  that  the  rf  drive  looks  constant  to  the  SET,  and 
the  dc-SET  equations  can  be  used. 

We  will  use  a  quantum  mechanical  model  of  the  measure¬ 
ment  and  feedback  process,  but  discuss  how,  in  this  case, 
such  a  description  is  equivalent  to  a  classical  measurement  of 
a  noisy  classical  system.  Thus,  this  paper  is  intended  for  both 
experimentalists  familiar  with  classical  descriptions  of  noise 
in  systems  as  well  as  quantum  measurement  theorists. 

Rather  than  performing  a  microscopic  analysis  of  the 
measurement  process  in  terms  of  the  interaction  of  the  SET 
and  the  resonator,  we  start  by  introducing  equations  which 
describe  the  continuous  observation  of  a  quantum  observ¬ 
able,  and  show  how  this  includes  the  shot  noise  and  back- 
action,  these  being  the  key  sources  of  noise  in  a  continuous 
quantum  measurement.  This  description  can  then  be  tailored 
to  the  case  of  a  measurement  with  a  SET  by  choosing  the 
parameters  so  that  the  noise  sources  match  those  calculated 
in  microscopic  noise  analyses  which  have  been  performed 
for  the  SET.3’8 

A  treatment  of  the  continuous  quantum  measurement  of  a 
two- state  system  using  a  SET  has  been  carried  out  by 
Korotkov,9  using  what  might  be  referred  to  as  a  partially 


FIG.  1.  A  schematic  of  the  resonator,  measuring,  and  feedback 
apparatus.  As  the  resonator  moves  closer  to  the  SET,  the  current 
flowing  through  the  SET  changes,  and  that  information  is  then  used 
to  generate  a  feedback  voltage  applied  to  an  actuating  gate. 
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microscopic  approach.  The  equations  we  use  here  may  be 
derived  by  replacing  the  two-state  observable  in  those  equa¬ 
tions  by  the  resonator  position.10  A  full  analysis,  along  the 
lines  of  those  performed  for  quantum  optical  systems,11,12 
can  also  be  expected  to  produce  the  same  equations  under 
reasonable  approximations.  The  form  of  these  equations  is 
determined  by  how  information  is  obtained,  and  not  by  the 
specific  implementation,  which  explains  why  the  form  of  the 
equations  is  similar  in  optical  position  measurements  and 
position  measurement  using  SETs.  If  the  measurement  is  of  a 
physical  observable,  and  the  resulting  error  about  the  expec¬ 
tation  value  of  that  observable  in  a  short  time  interval  At  is 
Gaussian,  then  the  most  straightforward  implementation  of 
that  measurement  process  has  the  form  used  here. 

In  Sec.  II  we  introduce  the  equations  that  describe  a  con¬ 
tinuous  measurement  process,  derive  the  form  of  the  result¬ 
ing  noise,  and  give  the  equivalent  classical  model.  We  then 
discuss  how  this  model  can  be  applied  to  position  measure¬ 
ment  using  a  SET,  and  compare  our  formulas  to  those  de¬ 
rived  using  a  semiclassical  treatment  of  the  SET  (Refs.  3  and 
8)  in  order  to  express  our  results  in  terms  of  experimental 
parameters.  In  Sec.  Ill  we  discuss  the  implementation  of  a 
feedback  algorithm  and  calculate  the  minimum  achievable 
temperature  in  terms  of  physical  parameters.  We  then  calcu¬ 
late  estimates  of  realistic  achievable  temperatures  for  an  an 
experimentally  realizable  sample  system  in  Sec.  IV,  and  fi¬ 
nally  conclude  with  a  summary  of  the  results  obtained. 


II.  CONTINUOUS  QUANTUM  MEASUREMENT 
OF  POSITION 

Given  a  quantum  system  whose  state  is  specified  by  the 
density  matrix  p,  and  whose  evolution  is  determined  by  the 
Hamiltonian  H,  then  a  continuous  measurement  of  the  ob¬ 
servable  O  of  that  system,  which  provides  the  continuous 
output  results  (measurement  record), 

1 

dr  =  (0)dt+  -^dW,  (1) 

V8& 


uncertainty  is  not  fed  back  into  the  observable  being  mea¬ 
sured,  then  the  measurement  is  referred  to  as  “back-action 
evading.” 

Now  let  us  examine  the  case  of  a  position  measurement 
on  a  harmonic  oscillator.  To  do  this,  we  set  0  =  x,  and  the 
Hamiltonian  becomes 


P  1  22 

H=2^+2m^X’ 


(3) 


where  m  is  the  mass  of  the  particle,  a>0  is  the  (angular)  fre¬ 
quency  of  the  oscillation,  and  v  and  p  are  the  position  and 
momentum  operators,  respectively.  To  make  our  model  suf¬ 
ficiently  realistic,  we  need  to  include  two  more  sources  of 
noise:  the  first  is  the  intrinsic  thermal  noise  of  the  harmonic 
oscillator,  and  the  second  is  the  possibility  that  the  oscillator 
may  be  driven  by  white  noise  over  and  above  that  required 
by  Heisenberg’s  uncertainty  principle  (excess  “technical 
noise”). 

The  second  of  these  is  easily  included  by  adding  a  term 
—  fi[x,[x,p]\  to  the  equation  of  motion  of  p;  this  describes  a 
noise  term  identical  to  the  one  caused  by  the  back-action,  but 
without  the  corresponding  dynamics  of  p  associated  with 
obtaining  a  measurement  result  which  causes  the  back- 
action.  It  is  equivalent  to  adding  a  term  linear  in  v  to  Hamil¬ 
tonian  (3)  multiplied  by  white  noise. 

The  inclusion  of  thermal  fluctuations  is  only  a  little  more 
involved,  and  can  be  achieved  by  coupling  the  oscillator  to  a 
thermal  bath.  In  our  case  the  effect  of  the  thermal  bath  may 
be  included  by  adding  the  “standard  Brownian  motion  mas¬ 
ter  equation”  (SBMME)  (Ref.  15)  to  our  equation  of  motion 
for  p: 


i  iT 

dp  =  —  jr[H,p]dt—  ^-[ x,{p,p}+]dt 


+ 


I  m(JL)()V  hcoa  \ 

(k  +  P+^-^oth—jlx^pJW 

^2k(xp  +  px  —  2{x)p)  dW, 


(4) 


induces  the  following  evolution  of  the  system:6,13,14 

dp=  —  (if  h)[H ,  p]dt  —  k[0  ,[0 ,  p]\dt 

+  ^2k(Op  +  pO~2(0)p)dW.  (2) 

Here  k  is  proportional  to  the  measurement  strength,  and  dW 
is  a  Weiner  process.  The  noise  contained  in  the  measurement 
record  is  a  necessary  result  of  the  fact  that  only  a  finite 
amount  of  information  is  obtained  regarding  the  observable 
O  in  a  finite  time.  This  direct  noise  on  the  record  is  called  the 
shot  noise.  However,  this  is  not  the  only  noise  resulting  from 
the  measurement  process.  As  a  result  of  Heisenberg’s  uncer¬ 
tainty  relation,  information  about  one  observable  makes 
other  observables  less  certain.  Due  to  the  dynamics,  the  un¬ 
certainty  (noise)  in  these  observables  can  feed  into  the  ob¬ 
servable  being  measured.  This  source  of  noise  is  referred  to 
as  hack- action.  If  the  Hamiltonian  is  such  that  the  increased 


where  T  =  col  Q,  Q  being  the  quality  factor  of  the  resonator. 
The  two  terms  proportional  to  T  are  due  to  the  inclusion  of 
the  SBMME,  the  first  representing  dissipation  due  to  the  res¬ 
ervoir  while  the  second  is  a  diffusion  term  due  to  environ¬ 
mental  fluctuations.  Here  we  are  using  an  approximate  form 
of  the  SBMME  appropriate  for  the  weak  coupling  regime 
(small  T,  large  Q )  but  covering  all  ranges  of  temperatures.16 
Since  the  nanomechanical  resonators  we  consider  all  have 
large  values  of  Q ,  the  weak  coupling  requirement  is  easily 
satisfied.  The  temperature  dependence  of  the  diffusion  coef¬ 
ficient  is  given  by  coth(^w0/2^fiT)  so  that  the  diffusion  does 
not  vanish  as  kBT^ 0:  this  correctly  accounts  for  the  exis¬ 
tence  of  quantum  vacuum  fluctuations  which  exist  even  at 
zero  temperature.  In  the  absence  of  a  rigorous  characteriza¬ 
tion  of  the  dissipation  channels  of  nanomechanical  systems 
there  is  as  yet  no  need  to  include  a  more  sophisticated  de¬ 
scription  of  SBMME  environmental  effects.17  Phenomeno¬ 
logical  corrections  to  the  SBMME  such  as  the  temperature 
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dependence  of  T  can  be  added  if  needed,  but  these  are  not 
significant  effects  in  the  high- Q  regime. 

We  also  need  to  include  in  our  model  the  possibility  that 
there  is  noise  driving  the  oscillator  which  is  correlated  with 
the  noise  on  the  measurement  record  (the  shot  noise).  This 
can  happen  if  the  noisy  behavior  of  the  oscillator  explicitly 
causes  some  of  the  noise  in  the  measurement  apparatus,  or 
vice  versa.  In  this  situation,  the  measurement  record  contains 
more  information  about  the  oscillator  position,  so  when  it 
comes  to  adding  feedback,  we  are  able  to  cool  the  oscillator 
further  than  would  otherwise  be  expected.  In  Eq.  (2)  the 
noise  driving  the  oscillator  is  purely  the  quantum  back- 
action.  It  may  appear  from  Eqs.  (1)  and  (2)  that  the  quantum 
back-action  is  correlated  with  the  shot  noise  due  to  the  fact 
that  the  same  noise  term  ( dW )  appears  in  both  equations. 
However,  this  is  not  the  case.  The  term  proportional  to  dW 
which  appears  in  the  equation  for  p  describes  the  random 
way  in  which  the  measurement  changes  the  observers  state 
of  knowledge  about  the  system.  Thus,  on  average,  this  noise 
term  decreases  the  entropy  of  p.  The  back-action  noise, 
which  is  driving  the  oscillator  and  consequently  increasing 
the  entropy  of  p,  is  described  by  the  term  proportional  to  k. 
The  quantum  back-action  is,  in  fact,  completely  uncorrelated 
with  the  shot  noise. 

To  drive  the  oscillator  with  a  random  force,  one  applies 
the  Hamiltonian  h£(t)x,  where  g(t)  is  the  magnitude  of  the 
random  force.  We  can  choose  g(t)  to  be  correlated  with  the 
shot  noise,  with  the  correlation  coefficient  k ,  by  setting 


d^Jla^KdW+'jX-KdV),  (5) 

where  dV  is  a  Wiener  noise  uncorrelated  with  dW.  The  re¬ 
sulting  spectral  density  of  g(t)  is  a ,  so  that  (£(0£(O) 
=  aS(t-t').  The  Stratonovich  equation  which  describes  the 
driving  by  £(f)  is 


\ijj)=-iZ{t)x\<p),  (6) 

and  converting  this  to  an  Ito  equation  gives 

d\ifj)=  —  i^2ax\if/)d£—  ax2\ip)dt.  (7) 


i  iT 

dp=  -  H,p]dt-^[x,{p,p}  +  ]dt 

I  ma)0T  hco o  \ 

-I  k  +  p+  2ft  [x,[x,p]]dt 

—  i  ^2k/3[x,p]  dW+  ^2k(xp  +  px  —  2(x)p)  dW. 

(10) 

This  completes  our  quantum  mechanical  description  of  a 
resonator  under  continuous  observation. 

Now  that  we  have  an  equation  that  includes  all  the  rel¬ 
evant  noise  terms,  the  noise  spectrum  of  the  measurement 
record  can  be  obtained: 


5(W)“8 \  + 


k  +  (3  + 


MCOqT  ft (jl)q  \ 

-2iTMh 2^1 


2  (him)1 

r2co2  +  (co2—  a>o)2 


(ID 


The  first  term  is  the  shot  noise,  which  is  white,  the  term 
proportional  to  k  is  the  quantum  back-action,  the  term  pro¬ 
portional  to  T  is  the  effect  of  the  noise  from  the  resonator’s 
thermal  environment,  and  the  term  proportional  to  /3  gives 
any  excess  noise  over  and  above  the  necessary  quantum 
back-action.  Note  that  the  last  three  terms  all  have  the  same 
form  as  a  function  of  co.  This  is  because  they  are  all  white 
noises  filtered  through  the  harmonic  oscillator  spectral  re¬ 
sponse  function. 

While  our  treatment  so  far  has  been  fully  quantum  me¬ 
chanical,  it  is  worth  noting  that  a  purely  classical  model  of  a 
measured,  damped  oscillator  will  completely  reproduce  the 
dynamics  of  this  measured  quantum  system,  no  matter  how 
cold  the  resonator,  so  long  as  the  initial  density  matrix  is 
Gaussian  in  v  and  p.6  Thus,  one  can  understand  the  behavior 
of  the  oscillator  in  terms  of  classical  noise  and  a  classical 
measurement  process.  The  equations  of  motion  for  the  posi¬ 
tion  vc  and  momentum  pc  of  this  equivalent  classical  oscil¬ 
lator  are 


1 

dx=—pcdt , 
m 


(12) 


Converting  the  Ito  equation  further  to  an  equation  for  p  one 
obtains 

dp  =  —  a[x  ,[x ,  p]]dt  —  i  ^2a[x,p]  df;.  (8) 

Since  the  observer  has  access  to  dW ,  but  not  to  dV ,  she  must 
average  over  dV ,  and  this  gives 

dp  =  —  a[x,[x,p]]dt~  i^2Ka[x,p]dW.  (9) 

If  we  allow  part  of  the  excess  noise  given  by  f3  in  our  model 
to  be  due  to  driving  by  the  shot  noise  dW  (that  is,  this  noise 
is  correlated  with  the  shot  noise  dW  with  correlation  coeffi¬ 
cient  k)  then  the  equation  of  motion  for  the  system  becomes 


dpc=  —m(JL>QXcdt  —  Ypcdt  +  l^^|2kdYc+1^^|2|3dVc 

Vh(jL>{) 

mhoj0T  coth^— ^  duc>  ( 1 : 3) 

where  dYc ,  dVc,  and  dUc  are  each  zero-mean  Gaussian 
white  noise,  and  mutually  uncorrelated.  The  position  of  the 
oscillator  is  then  observed  by  a  continuous  classical  mea¬ 
surement,  which  generates  the  output  record 

1 

drc= xcdt+  -j^=dZc,  (14) 

and  where  dZc  is  zero-mean  Gaussian  white  noise,  uncorre¬ 
lated  with  dYc .  The  noise  term  dYc  is  what  is  required  in  the 
classical  model  to  correctly  include  the  back-action  of  the 
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quantum  measurement  process.  It  is  now  explicit  that  this 
noise  is  uncorrelated  with  the  shot  noise  on  the  measure¬ 
ment,  dZc. 

In  the  classical  case,  the  observer’s  state  of  knowledge 
about  the  oscillator  is  given  by  a  joint  probability  density 
over  xc  and  pc.  This  probability  density  is  the  classical 
equivalent  of  the  density  matrix  p.  So  long  as  the  initial 
probability  density  is  Gaussian,  it  remains  Gaussian  as  time 
passes,  and  as  a  result  the  observer’s  full  state  of  knowledge 
may  be  represented  by  merely  five  variables:  the  mean  posi¬ 
tion  and  momentum,  (xc)  and  (pc),  and  the  variances  and 
covariance,  given  by 


(15) 

v2p  =  (pI)-{Pc)2, 

(16) 

(r2xp  =  (xcPc)-(Xc)(Pc)- 

(17) 

It  is  the  means  (xc)  and  ( pc )  (being  the  observer’s  best  esti¬ 
mates  of  the  value  of  xc  and  pc)  which  are  the  classical 
equivalents  of  the  quantum  expectation  values  (x)  and  (p). 
It  turns  out  that  if  one  writes  the  classical  measurement 
record  as 

1 

drc=(xc)dt+ -j^dWc,  (18) 

then  dWc  is  zero-mean  Gaussian  white  noise,7  uncorrelated 
with  dZc .  The  classical  model  is  then  equivalent  to  the  quan¬ 
tum  model  if  we  equate  dWc  with  the  quantum  measurement 
noise,  dW ,  and  correlate  dVc  with  dWc ,  so  that 
( Vc(t)Wc(t'))  =  KS(t-t ')• 

III.  CONTINUOUS  MEASUREMENT 
WITH  A  SINGLE-ELECTRON  TRANSISTOR 

Having  obtained  a  model  which  is  sufficiently  general  to 
encompass  the  dynamics  of  a  resonator  monitored  by  a  SET, 
we  need  to  express  the  theoretical  parameters  k,  (3,  and  k  in 
terms  of  the  actual  experimental  parameters  of  the  SET. 
Since  it  is  by  measuring  current  through  the  SET  that  we 
measure  the  resonator  position,  it  is  the  spectral  density  of 
this  current  which  determines  the  shot  noise  of  the  measure¬ 
ment.  The  back-action  from  the  measurement  is  due  to  the 
action  of  the  SET  on  the  resonator,  which  is  the  force  that  the 
resonator  feels  from  the  charge  on  the  SET  island.  As  a  result 
the  back-action  noise  [3  can  be  calculated  from  the  spectral 
density  of  the  charge  fluctuations  on  the  SET  island,  and 
hence  k  is  determined  by  the  correlation  between  the  current 
and  the  island  charge  fluctuations. 

However,  the  dynamics  of  the  SET  are  sufficiently  com¬ 
plex  that  analytic  results  for  these  spectra  have  as  yet  only 
been  obtained  for  certain  parameter  regimes.  These  calcula¬ 
tions  have  been  performed  by  Zhang  and  Blencowe,8  using 
previous  results  of  Korotkov.3  The  technique  used  is  to  ap¬ 
proximate  the  dynamics  of  the  electron  tunneling  on  and  off 
the  SET  island  by  a  classical  master  equation.  That  is,  the 
electrons  are  assumed  to  tunnel  independently  across  each  of 
the  junctions,  with  certain  rates  (the  rates  being  obtained 


using  a  perturbative  quantum  calculation).  This  ignores  the 
possibility  that  electrons  will  tunnel  coherently  across  both 
junctions  simultaneously,  a  quantum  effect  referred  to  as  co¬ 
tunneling.  This  method  may  be  referred  to  as  a  “semiclassi- 
cal”  model  for  the  dynamics  of  the  SET,  and  it  is  the  model 
that  we  will  use  here. 

It  is  important  to  note  that  the  above  semiclassical  method 
for  calculating  the  charge  fluctuations,  does  not  include  the 
quantum  back-action  noise.  This  can  be  seen  from  the  fol¬ 
lowing  argument.18  In  the  classical  treatment,  since  the  fluc¬ 
tuating  force  on  the  resonator  is  due  to  the  electrons  jumping 
on  and  off  the  island,  in  principle  the  time  history  of  this 
force  can  be  known  by  detecting  the  electrons  flowing  in  the 
circuit.  In  principle,  then,  the  effect  of  the  noise  can  be 
known,  and  if  desired,  undone.  As  a  result  it  cannot  include 
the  quantum  back-action,  since  this  cannot,  even  in  principle, 
be  undone.  Thus,  the  charge  fluctuations  calculated  using  the 
semiclassical  SET  model  gives  the  excess  noise  [3  and  the 
current  shot  noise  gives  k. 

The  quantum  mechanical  measurement  model,  [Eqs.  (1) 
and  (2)],  describe  a  valid  quantum  measurement  for  any 
value  of  k  and  /3,  However,  the  classical  model  of  the  SET 
will  only  give  an  accurate  description  of  the  dynamics  of  the 
SET,  and  thus  of  the  true  values  of  k  and  (3,  in  certain  pa¬ 
rameter  regimes.  In  fact,  it  is  useful  to  note  that  the  ratio  kl (3 
provides  a  diagnostic  tool  for  determining  when  the  classical 
calculation  breaks  down;  if  k/(3<l  is  not  satisfied,  then  the 
classical  calculation  no  longer  provides  a  good  estimate  of 
the  total  force  noise  on  the  resonator.  Thus  it  should  be  noted 
that  if  k/ (3^  1 ,  then  the  classical  calculation  cannot  be  relied 
upon.  That  is,  it  is  possible  in  this  case  that  the  total  noise  on 
the  resonator  is  significantly  larger  than  our  estimate  k  +  f3, 
due  to  quantum  contributions  not  taken  into  account  in  the 
classical  calculation. 

We  find  that  in  the  regions  of  best  cooling,  which  we 
explore  in  the  following,  k  is  not  necessarily  much  smaller 
than  f3  (although  near-optimal  cooling  can  be  obtained  with 
k<(3,  and  in  particular  we  will  give  as  an  example  results 
for  k  =  (3/10).  Hence  our  calculations  should  be  regarded  as 
estimates  of  the  performance  of  the  feedback  algorithm, 
rather  than  exact  results.  We  note,  however,  that  a  more  so¬ 
phisticated  analysis  using  the  diagrammatic  techniques  de¬ 
veloped  by  Schoeller  and  Schon19  might  provide  analytic,  or 
semianalytic  results  for  the  parameter  regime  of  most  interest 
for  quantum  measurement  and  control,  and  therefore  may 
provide  a  method  for  more  accurate  calculations. 

The  spectral  densities  given  by  the  classical  calculation 
are  derived  in  the  Appendix.  Approximations  which  are  used 
in  the  derivation  are  detailed  there,  and  come  primarily  from 
Zhang  and  Blencowe.8  The  noise  spectrum  of  the  displace¬ 
ment  of  the  resonator  due  to  the  shot  noise  of  the  SET  cur¬ 
rent  is 


( dlds/dx )2 


(19) 


where  S7(  w)  is  the  spectral  density  of  the  shot  noise,  given  in 
Eq.  (A  10),  and  I  is  the  current  through  the  SET,  given  in  Eq. 
(A8).  The  dependence  of  the  current  on  the  displacement  of 
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the  resonator  comes  from  its  dependence  on  the  gate  capaci¬ 
tance,  which  can  be  approximated  by 

o(l-J).  (20) 

The  shot  noise  Sjico)  is,  to  a  very  good  approximation,  fre¬ 
quency  independent,  as  required  by  our  quantum  measure¬ 
ment  model.  Thus 


trolled  case.  To  derive  these  equations,  we  note  that  the  equa¬ 
tion  of  motion  for  p,  under  feedback,  is  given  by  Eq.  (10), 
where  one  sets 


P  1 

H= —  + -mtofa2- y(m<D0(x)  +  (p))x 


(25) 


to  include  the  feedback  force.  Using  the  fact  that  d(O) 
=  Tr \Odp\,  and  dW2  =  dt ,  one  obtains,  for  the  means, 


l=sz  Stw) 

8k  X  o>=o  ( dljdx )2 


(21) 


/  p\ 

d{x)=  —dt  +  2j2ka2xdW, 


(26) 


The  spectral  density  of  the  classical  part  of  the  displacement 
noise  due  to  the  fluctuating  force  on  the  resonator  is 


SFx(co)  = 


SF(co)/m2 

r2w2+(w2-^)2’ 


(22) 


where  5'F(w)  is  the  spectral  density  of  the  fluctuating  force 
given  in  Eq.  (A13).  Since,  once  again,  SF(ao)  is  effectively 
frequency  independent,  we  have 


P 


S  F 

2  h2 


o)  =  0 


(23) 


d(p)=  —  ma>2{x)dt  —  Y{p)dt—  y(ma>{x)  +  (p))dt 


+  \/2  KfMdW+  2  \l2kojpdW, 
and,  for  the  covariances, 


(27) 


&i= 


m 


'  xp 


(28) 


&p  =  —  2m(L>l(Jxp  —  %k(<Jxp)2  —  2T  crp+2h2k 


+  2/L 


mwnr  hco{) 
(  1  —  k)P~\ - ~i — coth; 


2  h 


2  kBT 


(29) 


The  correlation  coefficient,  k ,  between  the  shot  noise  and  the 
excess  back-action  is  therefore  simply  the  correlation  C  be¬ 
tween  S j  and  S F,  which  is  given  in  Eq.  (All). 

IV.  FEEDBACK  CONTROL 

We  wish  to  cool  the  dynamics  of  the  resonator  by  using 
the  information  obtained  continuously  about  the  state  of  the 
resonator  to  direct  a  time-dependent  external  force.  Such  a 
force  may  be  applied,  for  example,  by  passing  current 
through  the  resonator  and  immersing  it  in  a  magnetic  field.  It 
can  also  be  applied  by  placing  an  actuating  gate  near  the 
resonator,  and  varying  the  potential  difference  between  the 
charged  resonator  and  the  actuating  gate. 

In  this  case  the  results  of  modern  optimal  control  theory 
apply,  since  the  dynamics  of  the  resonator  are  equivalent  to 
that  of  a  classical  oscillator  driven  by  Gaussian  noise,  so 
long  as  we  restrict  ourselves  to  a  linear  external  force.6,21 
This  allows  us  to  obtain  the  optimal  feedback  algorithm  in  a 
straightforward  manner.  Choosing  the  minimization  of  the 
energy  of  the  resonator  as  the  feedback  objective  it  turns  out 
that  as  long  as  the  force  we  apply  is  sufficiently  large,  this 
force  should  be  chosen  to  be6 

F=-  y(mw0(x)  +  (p)),  (24) 

where  y  is  a  rate  constant  which  determines  the  overall 
strength  of  the  force.  This  equation  gives  an  optimal  perfor¬ 
mance  so  long  as  y>co0,  which  is  within  reach  of  current 
experiments,  as  detailed  below. 

To  calculate  the  average  energy  of  the  controlled  resona¬ 
tor,  we  first  need  the  equations  of  motion  for  the  means  and 
covariances  of  v  and  p  in  the  continually  observed  and  con- 


2<t%~U<t2^p^  4 ^ KPkh • 

(30) 

In  these  equations,  <x2  and  cr2  are  the  variances  in  position 
and  momentum,  respectively,  and 

9  1 

<rxP  =  2(xP+Px)-(x)(p)  (31) 

is  the  symmetrized  covariance.  This  system  of  equations  is 
exactly  equivalent  to  Eq.  (10)  as  long  as  the  initial  state  is 
Gaussian.  In  order  to  solve  this  set  of  equations  most  easily, 
we  make  what  we  call  the  truncated  Gaussian  approximation. 
We  assume  that  the  feedback  rate  y  is  much  larger  than  the 
system’s  small  intrinsic  damping  T,  and  we  therefore  drop 
all  damping  terms  proportional  to  T  from  the  above  equa¬ 
tions.  This  approximation  is  easily  justified  for  current  ex¬ 
periments. 

The  steady-state  solutions  to  these  equations  are 

V 2(0  _ 

(32) 
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where 
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A+l 


V.  ESTIMATES  FOR  ACHIEVABLE  TEMPERATURES 


A  mcooT  halo 

kh2\  k  +  (l  —  k)/3~\ — — — coth; 


1  +  16- 


2  h 


2kBT 


1  1/2 


2  4 

m  co 


(35) 


In  the  limit  of  both  large  and  small  values  of  k,  A  ~k. 

The  average  energy  of  the  resonator  under  feedback  con¬ 
trol,  being  the  expectation  value  of  the  Hamiltonian  [Eq. 
(25)]  averaged  over  all  trajectories,  is  a  linear  combination  of 
the  variances  of  v  and  p ,  since  the  expectation  values  of  both 
v  and  p  are  zero.  These  variances  are  the  sum  of  the  intrinsic 
variances  of  the  Gaussian  steady  state  for  each  trajectory,  and 
the  variances  of  the  means  of  v  and  p  for  each  trajectory 
(usually  referred  to  as  the  conditional  means)  across  all  tra¬ 
jectories.  We  can  calculate  these  latter  variances,  which  we 
will  denote  by  4)  and  4) ,  by  substituting  into  Eqs.  (26) 
and  (27)  the  solutions  for  the  steady- state  values  of  the  vari¬ 
ances  a2  and  cr2xp ,  and  solving  for  the  first  and  second  mo¬ 
ments  of  the  conditional  means.22  One  obtains 
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Thus  the  average  energy  of  the  oscillator,  under  feedback,  is 


E=  -nuo2(a2x  +  (rz{x))  + 


2m 


(38) 
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(39) 


Here  we  have  used  the  simplifying  assumption  y>co ,  since 
this  is  inherent  in  the  optimal  control  condition. 

It  is  clear  from  Eqs.  (35)  and  (39)  that  reducing  the  back¬ 
ground  temperature  allows  for  lower  final  temperatures.  Ex¬ 
tremely  low  values  of  k  lead  to  heating,  as  can  be  seen  from 
the  fact  that  A ~k.  For  large  k  (corresponding  to  large  gate 
voltage),  the  increased  sensitivity  of  the  measurement  can¬ 
cels  the  increased  disturbance  due  to  the  measurement,  with 
the  result  that  the  minimal  temperature  levels  off  as  k  is 
increased. 


Current  refrigeration  technology  allows  experiments  on 
nanomechanical  resonators  to  be  performed  at  temperatures 
of  about  100  mK.  It  is  therefore  sensible  to  assume  that  the 
feedback  algorithm  will  be  applied  to  a  device  which  is  ini¬ 
tially  at  this  temperature.  In  such  experiments  the  resonators 
typically  have  fundamental  frequencies  in  the  range 
/o  =  1-100  MHz.  As  our  example  system  we  take  a  realistic 
resonator  with  f0=  12  MHz,  which  is  6  /z m  in  length, 
50  nm  wide,  and  150  nm  thick.  We  restrict  ourselves  to  rela¬ 
tively  low  frequencies  because  of  the  limits  of  feedback  cir¬ 
cuitry,  which  we  estimate  can  easily  operate  at  50  MHz.  The 
effective  mass  of  such  a  resonator  is  roughly  10-16  kg.  An 
achievable  quality  factor  Q  is  on  the  order  of  104. 

Realistic  values  for  the  resistances  and  capacitances  of  the 
junctions  of  a  SET  which  would  be  used  to  monitor  the  reso¬ 
nator  are  Rl  =  R2  =  50  kkl  and  C1  =  C2=100aF,  and  we 
place  it  100  nm  from  the  resonator.  We  estimate  that  the 
capacitance  between  the  gate  of  the  SET  and  the  resonator 
will  be  roughly  Cd=50  aF,  so  that  C%  =  250  aF  (C%  =  2Cj 
+  Cg) .  It  is  important  to  note  that  the  analysis  we  use  in  the 
appendix  to  obtain  the  noise  spectra  is  only  a  good  approxi¬ 
mation  in  certain  parameter  regimes.  In  particular,  we  require 
that  Vg ,  being  the  drain-source  voltage  across  the  SET,  sat¬ 
isfies  Vds<e/C% ,  and  that  k//3<l,  as  discussed  in  Sec.  III. 

To  apply  the  feedback  force,  we  place  the  resonator 
100  nm  from  the  actuating  gate,  and  allow  the  controller  to 
vary  the  voltage  difference  between  the  gate  and  the  resona¬ 
tor  from  —  4  to  4  V.  The  capacitance  of  this  arrangement  is 
about  50  aF,  so  the  maximum  force  that  can  be  applied  to 
the  resonator  is  of  the  order  of  10“ 8  N.  This  corresponds  to 
y^l.08X  1013  s-1,  which  is  much  larger  than  co  and  T,  as 
required  by  the  optimal  control  condition  and  truncated 
Gaussian  approximation  used  in  Sec.  IV. 

In  evaluating  the  effectiveness  of  the  feedback  loop  at 
cooling  the  resonator,  it  should  be  noted  that  the  concept  of 
temperature  is  only  well  defined  for  a  system  at  equilibrium 
with  a  thermal  reservoir.  While  the  resonator  starts  at  thermal 
equilibrium,  the  action  of  the  feedback  loop  is  to  reduce  the 
energy  of  the  resonator  so  that  it  is  far  from  equilibrium. 
Thus,  when  we  quote  results  for  the  achievable  steady- state 
effective  “temperature,”  we  will  mean  the  temperature 
which  the  resonator  would  have  if  it  were  in  thermal  equi¬ 
librium  and  had  the  average  energy  achieved  by  the  feedback 
loop. 

Before  giving  theoretical  estimates  of  the  achievable 
steady-state  effective  temperature  (or  equivalently,  the 
steady-state  average  occupation  number  of  the  oscillator, 
(N)  =  (a^a)),  we  need  to  explain  two  subtleties  which  affect 
the  presentation  of  our  results.  When  one  examines  the  de¬ 
pendence  of  the  steady  state  ( N )  on  the  gate  voltage,  one 
finds  that  it  oscillates  very  rapidly,  with  minima  occurring  in 
closely  spaced  pairs.  Since  Vg  is  experimentally  easy  to  tune, 
all  else  being  equal  it  would  make  sense  simply  to  plot  these 
minima  and  ignore  the  complex  structure.  However,  as  dis¬ 
cussed  in  Sec.  Ill,  our  results  are  more  trustworthy  the 
smaller  kl (3,  but  this  quantity  is  not  necessarily  small  at  the 
minima.  The  situation  is  shown  in  detail  in  Fig.  2,  in  which 
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FIG.  2.  The  steady-state  aver¬ 
age  occupation  number,  (A),  as  a 
function  of  the  gate  voltage  (solid 
line),  plotted  along  with  the  ratio 
k/ p  (dashed  line),  and  the  drain- 
source  current,  Ids  (dot-dashed 
line).  The  lower  dotted  line  gives 
the  minima  of  (A),  and  the  upper 
dotted  line  gives  the  values  of  (A) 
when  kip  =  0.1. 


we  display,  as  a  function  of  Vg ,  two  pairs  of  the  (A)  minima, 
as  well  as  kl P  and  the  current  7ds.  In  view  of  this,  when 
plotting  results  in  what  follows,  we  will  show  both  the 
minima  of  the  effective  temperature  with  respect  to  Vg ,  and 
the  (somewhat  higher)  effective  temperature  which  results  if 
we  demand  that  kl  P^ 0.1.  For  clarity  the  points  at  which 
kl P=  0.1  are  also  displayed  in  Fig.  2.  As  will  be  clear  from 
Figs.  3  and  4,  for  T  =  100  mK  and  Q  =  104,  the  effect  of  the 
restriction  kl  P^  0.1  on  the  achievable  temperature  is  small. 
In  addition,  kl p  remains  fairly  small  at  the  minima.  Since 
this  is  the  case,  when  we  quote  values  in  the  following,  we 
will  give  the  values  obtained  at  the  minima,  along  with  the 
corresponding  values  for  kl p. 

As  an  example  of  the  relative  magnitudes  of  the  various 
noise  sources  at  the  minima  displayed  in  Fig.  2,  if  we  set  the 
drain-source  voltage  at  Vds=  e/(4C%)  =  0.l6  mV  and  the 


FIG.  3.  Estimates  for  the  minimum  achievable  effective  tem¬ 
peratures  as  a  function  of  gate  voltage  for  a  range  of  initial  tem¬ 
peratures,  T.  On  this  plot,  the  increase  in  achievable  temperature 
which  results  from  the  restriction  kl  p^ 0.1  is  virtually  impercep¬ 
tible  for  T  above  100  mK.  For  T=  100  mK  the  dotted  line  shows 
the  result  under  this  restriction.  From  top  to  bottom,  the  initial  tem¬ 
peratures  are  2  K,  1  K,  500  mK,  and  100  mK. 


gate  voltage  at  Vg~  1  V,  then  the  noise  sources  are 


P= 

1.01X  1031  m  2s  \ 

(40) 

£  =  0.184  p , 

(41) 

mcooT  ho)o 

(42) 

2  h 

cotW=9-25  * 

and  the  correlation  coefficient  is  k  =  0.638. 

Using  the  above  parameter  values  to  calculate  the  effec¬ 
tive  temperature,  Teff,  at  the  minima,  we  find  that  A  =  5.1 
X10-5,  and  reff=2.11  mK.  This  corresponds  to  an  energy 
of  about  £ss=2.91  X  10  26  J,  and  an  average  occupation 
number  (A)  =  3. 17.  While  this  is  very  encouraging,  ideally 


FIG.  4.  Estimates  for  the  minimum  achievable  effective  tem¬ 
peratures  as  a  function  of  gate  voltage  for  a  range  of  resonator 
quality  factors  and  an  initial  temperature  of  100  mK.  The  dotted 
lines  give  the  minimum  temperature  under  the  additional  restriction 
that  kip^  0.1.  From  top  to  bottom,  the  quality  factors  are  103,  104, 
105,  and  106.  A  quality  factor  of  104  is  achievable  with  current 
technology. 
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one  wants  to  cool  below  the  energy  of  the  first  excited  state, 
and  we  now  examine  what  is  required  to  do  this. 

While  classically  an  increase  in  measurement  strength 
would  automatically  lead  to  an  improved  tracking  of  the 
resonator,  and  therefore  a  more  efficient  cooling,  quantum 
mechanically  the  situation  is  more  complex  due  to  the  fact 
that  a  more  precise  measurement  also  leads  to  increased 
heating  due  to  back-action.  Nevertheless,  in  the  present  case 
one  finds  that  the  increased  sensitivity  of  the  measurement 
with  increasing  measurement  strength  effectively  cancels  this 
heating,  and  as  a  result  a  larger  value  of  Vg  corresponds  to 
better  cooling.  However,  after  a  sharp  increase  in  cooling 
with  increasing  Vg ,  the  minimal  temperature  levels  off,  so  a 
greater  Vg  no  longer  provides  much  benefit.  In  addition,  at 
some  value  of  Vg  snap-in  is  likely  to  occur  as  the  force 
between  the  SET  gate  and  the  resonator  becomes  too  strong. 
This  voltage,  in  our  example  system,  is  estimated  to  be 
roughly  4  V.  As  a  result,  we  limit  ourselves  to  Vg^4  V.  At 
Vg=4  V  the  steady- state  minimum  energy  E  =  9.83 
X  10-27  J,  which  is  below  the  energy  of  the  first  excited 
state.  This  corresponds  to  Teff=0.71  mK  and  (N)  =  0.74, 
with  k/ (3  =  0.28.  Thus,  if  the  energy  were  to  be  measured 
directly,  immediately  after  turning  off  the  feedback,  energy 
jumps  as  a  signature  of  quantum  behavior  may  well  be  ob¬ 
servable.  As  an  indication  of  the  return  from  increasing  the 
gate  voltage,  the  minimum  steady-state  energy  is  £’=1.58 
X10-26J  for  Vg^2  V,  which  corresponds  to  (N)^l.5, 
with  k/(3  =  0.21. 

In  Fig.  3  we  plot  the  theoretical  estimates  for  the  achiev¬ 
able  steady- state  effective  temperature  as  a  function  of  Vg  for 
a  range  of  starting  temperatures.  The  solid  lines  correspond 
to  the  absolute  minima,  and  the  dotted  lines  to  the  minimum 
values  under  the  restriction  that  k/ (3^  0.1.  Of  particular  in¬ 
terest  is  the  fact  that  for  a  starting  temperature  of  2  K  (i.e., 
with  pumped  liquid  He),  we  obtain  minimum  temperatures  in 
the  range  of  50  mK.  Thus,  even  for  an  initial  temperature  of 
2  K,  feedback  cooling  might  well  be  able  to  compete  with 
dilution  refrigerators.  If  the  resonator  is  first  cooled  in  a  di¬ 
lution  refrigerator,  and  then  feedback  cooled,  the  semi- 
classical  theory  predicts  achievable  temperatures  below 
1  mK,  as  discussed  above.  In  Fig.  4  we  plot  the  dependence 
of  the  minimum  temperature  on  Vg  for  a  range  of  quality 
factors,  which  shows  that  somewhat  lower  final  temperatures 
could  be  achieved  by  increasing  Q. 

VI.  DISCUSSION  AND  CONCLUSION 

The  results  obtained  above  are  consistent  with  heuristic 
arguments.  The  response  of  cooling  to  the  measurement 
strength  is  as  expected:  for  very  weak  continuous  measure¬ 
ments,  we  do  not  learn  enough  about  the  state  of  the  system 
to  cool  it  effectively,  and  can  in  fact  heat  the  system  due  to 
acting  on  our  poor  information.  For  very  strong  continuous 
measurements,  we  gain  sensitivity,  but  inject  more  quantum 
back-action,  and  approach  a  minimum  only  asymptotically. 
The  range  of  improvement  is  limited,  however,  and  beyond  a 
few  volts,  the  benefits  may  not  warrant  the  additional  effort. 

Higher  drain- source  voltages  provide  a  larger  signal-to- 
noise  ratio,  and  therefore  improve  cooling.  However,  since 


we  do  not  know  exactly  how  our  approximations  will  fail  as 
Vds  approaches  e/C%,  and  we  lack  a  complete  theory  of  the 
SET  once  more  than  two  island  states  play  a  significant  role 
in  the  dynamics,  we  have  chosen  to  stay  below  that  limit. 

We  have  made  a  few  additional  simplifying  assumptions, 
as  a  way  to  indicate  a  goal,  rather  than  an  immediately 
achievable  experimental  realization.  First,  we  have  assumed 
a  perfectly  efficient  (and  infinite  bandwidth)  measurement  — 
that  is,  that  no  electron  passes  the  detector  without  being 
detected.  While  detection  efficiency  is  not  as  much  of  a  prob¬ 
lem  here  as  in  optical  experiments,  detectors  will  necessarily 
be  inefficient  to  some  extent.  Second,  we  have  assumed  a 
perfect,  noiseless  feedback.  In  reality,  the  actuating  gate  ap¬ 
plying  the  feedback  will  not  provide  a  perfect  noiseless  volt¬ 
age.  Also,  we  have  assumed  that  the  actuating  gate  does  not 
affect  the  SET.  This  last  assumption  is  realistic,  however,  for 
two  reasons.  First,  the  resonator  itself  acts  as  a  shield  be¬ 
tween  the  gate  and  the  SET.  Second,  since  the  observer 
knows  the  voltage  on  the  feedback  gate,  she  can  subtract  that 
effect  off  the  SET  signal,  albeit  with  the  addition  of  some 
noise. 

As  mentioned  previously,  the  dynamics  of  a  quantum  me¬ 
chanical  harmonic  oscillator  and  a  classical  one  are  indistin¬ 
guishable  as  long  as  the  wave  function  is  Gaussian,  which  is 
the  case  in  the  present  analysis.  Therefore,  although  the  os¬ 
cillator  is  near  the  quantum  mechanical  ground  state,  the 
SET  measurement  of  position  will  not  show  any  quantum 
behavior.  In  the  face  of  these  limitations,  it  is  a  pleasant 
result  that  experimentally  obtainable  situations  today  allow 
for  the  feedback  cooling  of  a  resonator  to  the  point  that  quan¬ 
tum  behavior  could  become  distinguishable  from  classical 
behavior  with  an  appropriate  measurement  scheme. 
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APPENDIX:  SPECTRA  OF  THE  SET  SHOT  NOISE 
AND  BACK-ACTION 

Here  we  discuss  briefly  how  the  expressions  for  the  shot 
noise  and  back-action  of  the  position  measurement  via  a  SET 
are  obtained.  For  more  details  the  reader  is  referred  to  Zhang 
and  Blencowe8  (from  which  we  obtain  most  of  the  following 
expressions)  and  Korotkov.3 

The  SET  consists  of  a  central  island,  which  electrons  tun¬ 
nel  in  and  out  of  via  junctions  on  either  side.  If  one  requires 
that  the  spacing  between  the  energy  levels  of  the  electron 
states  on  the  island  are  sufficiently  large  compared  to  the 
voltage  drop  across  the  SET,  then  only  two  island  states  will 
be  appreciably  populated,  these  being  the  states  in  which 
there  are  n  and  n  +  1  electrons  on  the  island,  for  some  n.  This 
is  because  the  transition  rates  which  connect  these  states  to 
the  other  states  are  suppressed.  The  value  of  n  can  be  set  by 
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biasing  the  central  island.  In  particular,  n  is  determined  by 
the  condition 


n<[^{Vg-VJ2)<n+l.  (Al) 

As  a  result,  we  can  write  a  master  equation  for  the  probabil¬ 
ity  density  for  the  occupation  of  the  two  states.  Denoting  this 
density  by  o=\o(n),o(n  +  l)]r,  we  have 

do  t  —a(n)  b(n+l)  \  _ 
dt  \  a(n)  —  fr(w+l)/  ’ 

where  a(n)  is  the  transition  rate  from  n  to  n+  1,  and  b(n 
+  1 )  is  the  transition  rate  from  n+  1  to  n. 

If  we  denote  the  tunneling  rates  into  the  island  across  the 
source  junction  and  the  drain  junction  (see  Fig.  1)  as  a-(n) 
and  a  +  (n ),  respectively  (the  plus  and  minus  subscripts 
record  whether  the  tunneling  event  has  a  positive  or  negative 


contribution  to  the  SET  current),  and  out  of  the  island  as  b  + 
and  b  _  ,  respectively,  then 

a(n)  =  a  +  (n)  +  a  _(n) , 

(A3) 

b(n+\)  =  b  +  (n+\)  +  b  _(n+\). 

(A4) 

It  is  also  useful  to  define 

f(n)  =  a  +  (n)  —  a-  (n), 

(A5) 

g(n+\)  =  b  +  (n+\)~  b-(n+\). 

(A6) 

In  what  follows  we  will  repress  the  arguments  of  these  func¬ 
tions,  so  that  a  =  a(n ),  b  =  b(n+  1)  etc.  The  solution  to  the 
master  equation  is 


a(t)  = 


b  b\  /  a 
a  a  \—a 


-b 

b 


e~(a  +  b)t 


Q'(O) 

(a  +  b)' 


(A7) 


From  this  it  is  straightforward  to  calculate  the  average 
steady-state  current  flowing  through  the  SET,  the  noise  spec¬ 
tra  of  the  current,  Sj((o),  along  with  that  of  an  arbitrary 
function,  <fi(n ),  of  the  island  electron  number  S^co),  and 
their  mutual  correlation  spectrum,  C(a>).  The  average  cur¬ 
rent  is 


I=e 


CUag  +  bf ) 
cj  (a  +  b)  ’ 


(A8) 


and  the  spectra  are 


S,p((i))  = 


lab  [<f)(n)  —  4>(n+  l)]2 


(a  +  b)  (a  +  b)2  +  co2 


(A9) 


Sr(co)-- 


2  e2C 


2^2 


(a  +  b)C\ 


ab  + 


(f-g)(a2g~b2f) 


(a  +  b)2+  co1 


(A10) 


_  (ag  +  ^/)2(a  —  fr)2  +  <o2(qg  —  bff 

4  ab[ab[(a  +  b)2+  (o2]  +  (f-  g)(a2g-  b2f)] 

(All) 

The  force  from  the  island  on  the  resonator  is  given  by8 

F=(A/d)[C(Vds-2Vg)-ne]2,  (A12) 

with  A  =  Cg(2C—  Cg)/(2C|).  Thus,  using  Eq.  (A9)  we  have 

2 abe2A2  [2C(Vds-2Vg)-e(2n+  l)]2 

Sf(oj)= - - - - - - - . 

(a  +  b)d2  ( a  +  b)2+(o 2 

(A13) 

Recall  that  in  deriving  these  expressions  we  require  that 
the  two-level  approximation  is  valid,  and  this  demands  that 


VAs<e/Ci , 
kBT<ieVds. 

The  tunneling  rates  are  given  by 

(\n±Vds)/(RjCx) 

a  +  (n)= - - - — , 

1  —  exp[  —  ( Arc  ±  Vds)/T] 

,  ,  ,,,  (-A  n±Vds)/(RjCx) 

b±(n+  l)= - — - — , 

1  -  exp[  -  ( -  A  n±  Vds  )/T] 

where 


(A  14) 
(A15) 


(A16) 


A  n  = 


C  V 

^gv  g 

e 


cgvds 

2e 


Vds= 


C%V ds 

2e 


(A17) 


j= 

C2  ' 

Note  that  the  condition  which  determines  n  [Eq.  (Al)]  is 
equivalent  to  — 0.5<A^<0.5. 

From  the  expressions  for  the  noise  spectra  we  see  that 
both  sources  of  noise  are  effectively  white  (independent  of 
c o )  so  long  as  a)2  is  much  less  than  [a(n)  +  b(n  +  l)]2.  If 
this  is  the  case  then  the  simple  quantum  theory  of  continuous 
position  measurement  presented  in  the  main  body  of  the  pa¬ 
per  provides  a  good  model  for  the  SET  measurement.  Note 
that  the  actual  back-action  noise  on  the  position  of  the  reso¬ 
nator  is  the  force  noise  filtered  through  the  resonator  spectral 
function.  This  is  therefore 


(co2-co20)2+a>2copQ 


2.  2//n2  ’ 


(A  18) 


and  has  the  same  form  as  that  predicted  using  the  quantum 
mechanical  model  [Eq.  (11)],  so  long  as  the  force  noise  is 
white. 

We  must  therefore  evaluate  \a(n)  +  b(n+  l)]2  for  the 
range  of  parameters  of  interest,  and  verify  that  it  is  much 
larger  than  co2  over  the  relevant  frequency  range.  First  we 


235328-9 


HOPKINS,  JACOBS,  HABIB,  AND  SCHWAB 


PHYSICAL  REVIEW  B  68,  235328  (2003) 


note  that  the  form  of  the  spectral  equations  is  such  that  they 
are  periodic  in  the  gate  voltage.  That  is,  the  values  of  a(n ) 
and  b(n+  1 )  depend  only  on  An,  not  on  the  particular  value 
of  n  in  question.  As  a  result  we  merely  need  evaluate 
[a(n)  +  b(n  +  l)]2  for  a  single  value  of  n,  and  check  all 
values  of  A n  between  —0.5  and  0.5. 

Substituting  in  realistic  parameter  values  (those  that  we 
use  in  our  examples  in  the  body  of  paper)  in  Eqs.  (A  16)  and 
(A  17),  we  find  that,  regardless  of  the  value  of  An, 


[n(n)  +  Z?(n+ 1)]^2X  1010  (A18) 

for  the  range  of  initial  temperatures  that  we  consider,  and  this 
is  much  greater  than  the  range  of  a>  relevant  for  the  dynamics 
of  the  resonator,  as  required.  Thus,  we  can  drop  m  from  the 
expressions  for  the  spectra,  [Eqs.  (A10),  (A13),  and  (All)], 
and  use  these  to  determine  the  parameters  k ,  f3,  and  k  in  the 
model  of  the  quantum  position  measurement. 
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in  our  optical  scattered  light  image  may  consist 
of  primordial  solid  material. 

Within  —50  AU  of  the  star,  the  time 
scales  for  grain  removal  by  collisions  and  PR 
drag  become  significantly  shorter  than  the 
stellar  age.  Primordial  dust  at  the  inner  limit 
of  our  images  (Figs.  1  and  2)  has  mostly 
vanished,  and  the  grains  observed  here,  as 
well  as  those  discovered  as  close  as  17  AU 
from  the  star  (14),  must  be  continually  re¬ 
plenished  by  the  collisional  erosion  of  much 
larger  objects  such  as  comets  and  asteroids. 
The  existence  of  planetesimals  in  this  region 
lends  plausibility  to  the  argument  that  the 
same  objects  will  form  planets  by  accretion. 
Given  that  AU  Mic  is  only  —10  My  old,  we 
may  be  able  to  observe  planets  that  are  still  in 
the  process  of  accreting  mass,  or  at  least 
discern  disk  structure  that  is  sculpted  by  plan¬ 
et-mass  bodies.  Because  AU  Mic  is  closer  to 
the  Sun  than  (3  Pic,  the  2  to  30  AU  zone 
where  terrestrial  and  gas  giant  planets  might 
form  can  be  resolved  by  current  and  future 
instrumentation  (fig.  S5).  Planets  around  AU 
Mic  may  also  be  detected  by  indirect  meth¬ 
ods.  The  low  stellar  mass  means  that  the  star 
will  display  a  significant  astrometric  reflex 
motion  (2  milli-arc  sec  for  a  Jupiter  analog). 
The  near  edge-on  orientation  favors  planet 
detection  by  transits  of  the  stellar  photo¬ 
sphere.  Finally,  if  a  planet  is  detected  by 
radial  velocity  techniques,  then  the  near 
edge-on  orientation  gives  the  planet  mass  by 
constraining  the  sin(z')  ambiguity  intrinsic  to 
these  measurements. 
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A  crucial  building-block  for  quantum  informa¬ 
tion  science  is  a  deterministic  source  of  single 
photons  that  generates  one-quantum  wave  pack¬ 
ets  in  a  well-controlled  spatiotemporal  mode  of 
the  electromagnetic  field.  For  example,  protocols 
for  the  implementation  of  quantum  cryptography 
(1)  and  of  distributed  quantum  networks  rely  on 
this  capability  (2),  as  do  models  for  scalable 
quantum  computation  with  single-photon  pulses 
as  flying  qubits  (3-6). 

The  earliest  observations  of  single-photon 
emission  used  the  fluorescent  light  from  sin¬ 
gle  atoms  in  two-  and  three-level  configura¬ 
tions  (7-9),  and  thereby  produced  light  with 
manifestly  quantum  or  nonclassical  charac¬ 
ter.  Fluctuations  in  the  number  of  atoms  pro¬ 
vided  inherent  limitations  to  these  original 
schemes,  and  have  since  been  mitigated  by 
isolating  single  ions  (10)  and  molecules  (11, 
12)  and  by  using  individual  quantum  dots 
(13,  14)  and  color  centers  (15,  16). 

With  a  single  dipole,  pulsed  excitation  allows 
for  “triggered”  emission  of  a  single  photon  with¬ 
in  a  prescribed  interval,  albeit  into  4tt  steradians. 
To  achieve  emission  as  a  directed  output  with 
high  efficiency,  the  dipole  emitter  can  be  placed 
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inside  an  optical  resonator,  as  by  coupling  single 
quantum  dots  to  microcavities  (17-19).  These 
experiments  make  use  of  the  Purcell  effect  to 
enhance  radiative  decay  into  a  cavity  mode  of 
interest  and  thereby  achieve  a  deterministic  bit 
stream  of  single-photon  pulses  (20)  in  a  regime 
of  weak  coupling  in  cavity  quantum  electrody¬ 
namics  (cQED). 

By  contrast,  the  generation  of  single  photons 
within  the  domain  of  strong  coupling  in  cQED 
(21, 22)  enables  diverse  new  capabilities,  includ¬ 
ing  the  reversible  transfer  of  quantum  states  be¬ 
tween  atoms  and  photons  as  a  fundamental  prim¬ 
itive  for  the  realization  of  quantum  networks  (2). 
A  single  photon  source  consisting  of  a  trapped 
atom  strongly  coupled  to  an  optical  cavity  rep¬ 
resents  an  ideal  node  for  such  a  network,  in 
which  long-lived  internal  atomic  states  can  be 
mapped  to  quantum  states  of  the  electromagnetic 
field  by  way  of  “dark”  eigenstates  of  the  atom- 
cavity  system  (23).  By  way  of  a  quantum  repeat¬ 
er  architecture,  converting  stationary  qubits  to 
flying  qubits  in  this  way  enables  distributed 
quantum  entanglement  over  long  distances  (2). 

We  report  on  the  deterministic  generation 
of  single-photon  pulses  by  a  single  atom 
strongly  coupled  to  an  optical  cavity  in  a 
configuration  suitable  for  quantum  network 
protocols.  Single  cesium  atoms  are  cooled 
and  loaded  into  an  optical  trap  (Fig.  1A), 
which  localizes  them  within  the  mode  of  a 
high-finesse  optical  cavity  (24-26).  The  atom 
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is  then  illuminated  by  a  sequence  of  laser 
pulses  {kl3J(t),kl/(t)},  the  first  of  which, 
kl3(t),  drives  a  “dark-state”  transfer  between 
hyperfme  ground  states,  F  =  3  — >  4  (Fig.  1, 
B  and  C).  In  this  process,  one  photon  is 
created  in  the  cavity  mode  because  the  atomic 
transition  F'  =  3'  — >  F  =  4  is  strongly 
coupled  to  the  cavity  field  with  rate  g  (2,  23). 
The  emitted  photon  leaves  the  cavity  as  a 
freely  propagating,  spatially  Gaussian  wave 
packet  whose  temporal  profile  is  determined 
by  the  external  field  H3(f)  (2,  20,  23).  The 
atom  is  then  recycled  back  to  the  original 
ground  state  by  a  second  laser  pulse,  Fl4(t), 
and  the  protocol  is  repeated  for  subsequent 
single-photon  generations. 

The  lifetime  for  a  trapped  atom  in  the 
presence  of  the  driving  fl3  4  fields  is  Ttrap  = 
0.14  s,  which  should  be  compared  to  the 
repetition  period  =  10  |xs  for  single¬ 
photon  generation  and  to  the  lifetime  of  3  s 
recorded  in  the  absence  of  the  H34  fields 
(25).  Given  our  measured  overall  efficiency 
a  =  (2.4  ±  0.4)%  for  escape  from  the  cavity, 
for  propagation,  and  for  photodetection  (26), 
this  means  that  on  average,  we  generate  (de¬ 
tect)  about  1.4  X  104  (350)  single-photon 
pulses  from  each  trapped  atom. 

The  Gaussian  beam  emerging  from  the  cav¬ 
ity  mirror  M2  is  directed  to  a  beam  splitter  and 
then  to  two  photon-counting  detectors  (DA,DB). 
For  each  atom  k,  photoelectric  pulses  from 
Da  b  that  occur  during  the  trapping  interval  are 
stamped  with  their  time  of  detection  (with  8  = 
2  ns  time  resolution)  and  recorded  for  later 
analysis.  An  example  of  the  pulse  shape  for 
single-photon  generation  is  shown  (Fig.  2A) 
over  the  detection  window  [70-G0-/+8f]  within 
which  the  control  field  fl3y(f)  is  ON,  where  8/  = 
1  pis  and  /0y  is  the  onset  of  kl3J(t).  The  histogram 
of  the  total  counts  n(t)  from  both  detectors 
Z)AB,  binned  according  to  their  delay  with  re¬ 
spect  to  tj,  is  a  sum  over  all  repeated  trials  {/} 


of  the  generation  process  from  all  atomic  trap¬ 
ping  events  {k).  For  the  particular  choice  of 
kl3(t)  used  here,  single-photon  pulses  have  du¬ 
ration  t  =  120  ns  (full  width  at  half  maximum). 
The  extended  tail  for  n(t)  likely  arises  from 
generation  attempts  for  which  the  atom  resides 
in  Zeeman  sublevels  that  are  weakly  coupled  to 
the  control  field  at  the  beginning  of  the  kl3(t) 
pulse  (27,  28),  as  well  as  from  roughly  twofold 
variations  in  the  coupling  coefficient  g(r)  (29). 
Qualitative  agreement  of  this  measured  pulse 
shape  has  been  obtained  with  multilevel  quan¬ 
tum  Monte  Carlo  simulations  (28). 

To  investigate  the  quantum  character  of 
the  emitted  field,  we  calculate  the  function 
C( t)  obtained  by  cross-correlating  the  photo¬ 
electric  counting  events  from  the  detectors 
Z)AB  as  a  function  of  time  separation  t  (Fig. 
3)  (26).  The  large  suppression  of  C(t)  around 
t  =  0  strongly  supports  the  nonclassical  char¬ 
acter  of  the  light  pulses  emitted  by  the  atom- 
cavity  system.  The  likelihood  of  two  photons 
being  detected  within  the  same  trial  is  greatly 
reduced  relative  to  that  for  detection  events  in 
different  trials. 

Suppression  of  two-photon  events  is  also 
quantified  by  the  time  dependence  of  the  photon 
statistics  over  the  course  of  the  pulse  (Fig.  2,  B 
and  C).  Figure  2B  displays  the  integrated  prob¬ 
abilities  for  single  Px(t)  and  joint  P2(t)  detection 
events  for  times  t  after  the  onset  of  the  control 
pulse  0/(t),  with P2(t)  normalized  to  Px(t)!2.  We 
calculate  Px(t)  and  P2(t)  for  an  effective  single 
detector  without  dead  time  or  after-pulsing,  and 
define  Pl  2  =  Pl  2(8 t).  Over  the  duration  of  the 
control  pulse  0  <  t  <  8^,  Px(t)  rises  to  a  final 
value  Pl  =  0.0284;  that  is,  the  probability  of 
registering  a  single  photoelectric  event  in  a  trial 
is  2.84%.  The  lower  trace  in  Fig.  2B  quantifies 
the  suppression  of  joint  detection  events  relative 
to  that  expected  for  a  weak  coherent  state,  which 
would  have  2P2(t)/Px (t)  ***  Px(t)  (as  we  have 
confirmed  in  separate  measurements).  By  the 


end  of  the  control  pulse,  2P2IPX  has  reached  the 
value  1.8  X  1 0  3,  which  represents  a  16-fold 
suppression  of  joint  detection  events  relative  to  a 
Poisson  process. 

Figure  2C  examines  the  ratio  R(t)  = 
[(Pl\t))/(2P2(t))'\,  where  R  1  for  a  weak 
coherent  state  and  increases  with  suppression 
of  two-photon  events.  Significantly,  R  is  in¬ 
dependent  of  propagation  and  detection  loss¬ 
es  for  Px  P2.  The  trace  in  Fig.  2C  restates 
the  result  that  two-photon  events  are  greatly 
suppressed  relative  to  a  coherent  state,  name¬ 
ly  R  =  R(8t)  =  15.9  ±  1.0.  Also,  in  Fig.  3, 
the  average  area  of  the  large  peaks  in  C( t) 
around  t  =  jAt  should  exceed  that  of  the 
central  peak  around  t  =  0  by  a  factor  of  about 
R,  which  we  have  confirmed. 

The  background  rate  during  the  kl3  drive 
pulses  is  time  independent,  and  can  be  obtained 
from  the  record  of  photoelectric  detections  when 
no  atom  is  trapped.  The  measured  background 
count  probability  is  PB  =  2.7  X  104  for  the 
entire  window,  of  which  PD  =  0.82PB  comes 
from  detector  dark  counts,  and  the  rest  come 
from  various  sources  of  scattered  light.  For  an 
ideal  single-photon  source,  coincidence  events 


Fig.  2.  (A)  Total  histogram  of  photoelectric  de¬ 
tection  events  n(t)  from  both  detectors  DA  B.  In  all 
cases,  the  control  field  H3(t)  is  initiated  at  time 
t  =  0  with  rise  time  100  ns.  (B)  The  integrated 
probability  Pn(t)  for  a  single  photoelectric  event 
and  ratio  2 P2(t)/P1(t),  where  P2(t)  is  proportional 
to  the  integrated  coincidence  probability  for  joint 
detections  from  DA  B.  For  a  weak  coherent  state, 
the  two  traces  would  nearly  overlap.  (C)  The  ratio 
R(t)  =  [(P^2(t))/(2Pz(t))]  versus  time,  which  in¬ 
dicates  as  high  as  20-fold  suppression  of  coin¬ 
cidences  relative  to  a  Poisson  process. 
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Fig.  1.  Illustration  of 
the  generation  of  sin¬ 
gle  photons  by  one 
atom  trapped  in  an 
optical  cavity.  (A)  A 
single  Cs  atom  is 
trapped  in  a  cavity 
formed  by  the  reflec¬ 
tive  surfaces  of  mir¬ 
rors  (Mv  M2)  and  is 
pumped  by  the  exter¬ 
nal  fields  (H3,  H4) 

(25).  (B)  The  relevant 
atomic  levels  of  the  Cs 
D2  line  at  852.4  nm. 

Strong  coupling  at 
rate  g  is  achieved  for 
the  transition  F  =  3' 

— >  F  =  4  near  a  cavity 
resonance,  where  g  = 

2tt  X  16  MHz.  Atom 
and  cavity  decay  rates 

(7,k)/2tt  =  (2.6  MHz,  4.2  MHz).  (C)  The  timing  sequence  for  the  generation  of  successive  single 
photons  by  way  of  the  fl34  fields. 
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at  Dab  in  the  same  trial  would  arise  only 
because  of  background  counts,  because  the 
source  never  emits  two  photons  in  one  trial. 
Using  the  known  values  of  Px(t)  and  PB,  the 
background-limited  value  RB(t)  for  this  ideal¬ 
ized  scenario  can  be  predicted.  Our  measured 
values  are  actually  lower  than  this  prediction 
[Rb  =  RB(§t)  =  52.5],  indicating  a  substantial 
rate  of  excess  coincidences. 

These  excess  coincidences  most  likely  arise 
from  rare  events  with  two  atoms  trapped  within 
the  cavity  (26).  We  test  this  hypothesis  in  Fig.  4 
by  noting  that  the  two-atom  population  should 
decay  at  roughly  twice  the  rate  of  the  single¬ 
atom  population  [as  we  have  confirmed  in  other 
measurements  related  to  figure  4  in  (25)].  The 
probability  P2  for  joint  detection  should  there¬ 
fore  diminish  as  a  function  of  duration  of  the 
trapping  interval,  with  a  corresponding  increase 
in  the  ratio  R,  which  is  precisely  the  behavior 
evidenced  in  Fig.  4. 

Operationally,  we  bin  all  our  detection  time- 
stamps  according  to  their  delay  with  respect  to 
the  trap-loading  time  (tT  =  0),  and  then  compute 
photon  statistics  separately  for  each  bin.  Only 
four  intervals  in  tT  are  used  owing  to  poor  sta¬ 
tistics  for  the  coincidence  counts,  especially  for 
large  tT.  The  analysis  is  the  same  as  for  Fig.  2, 
but  we  concentrate  on  the  value  R  =  R(§t),  at  the 
end  of  the  Cl3(t)  pulse  window.  Furthermore,  the 
ratio  R0  plotted  in  Fig.  4  is  obtained  from  R  with 
the  contribution  from  the  measured  dark-count 
probability  PD  removed,  thereby  providing  a 
characterization  of  the  atom-cavity  source  that  is 
independent  of  the  dark  counts  for  our  particular 
detectors.  The  results  clearly  support  the  hypoth¬ 
esis  that  rare  two-atom  events  are  responsible  for 
our  excess  of  coincidences. 

Also  shown  in  Fig.  4  as  the  full  curve  is  the 
result  for  R0  from  a  model  calculation  that 
assumes  that  a  fraction  r\x  of  our  data  are  ac- 


Fig.  3.  Time-resolved  coincidences  C(t)  as  a 
function  of  delay  t  between  detections  at  DA  B. 
Around  t  =  0,  C(t)  is  suppressed  for  two  events 
from  the  same  trial  relative  to  its  values  for  t  = 
y'At  for  two  events  from  different  trials,  where 

j  =  ±1,2, _ As  indicated  in  Fig.  1C,  At  =  10 

|jls  is  the  repetition  interval  for  the  generation 
of  single  photons  and  8t  =  1  pus  is  the  duration 
of  our  control  pulse  fl3(t). 


quired  with  a  single  trapped  atom,  and  that  a 
fraction  nqn  =1  —  t),  has  two  atoms  trapped, 
with  TjpT],,  functions  of  the  time  tT  within  the 
trapping  interval  (26).  The  correspondence  be¬ 
tween  the  model  and  our  measurements  sup¬ 
ports  the  conclusion  that  excess  coincidences 
arise  from  rare  events  with  two  atoms  loaded 
into  the  trap.  From  this  model,  we  infer  that  (i) 
—3%  of  the  trials  are  taken  with  two  trapped 
atoms;  and  (ii)  the  generation  of  single  photons 
succeeds  with  probability  consistent  with  unity, 
<|)G  =  1.15  ±  0.18  as  constrained  by  our  abso¬ 
lute  knowledge  of  the  various  efficiencies  (26). 

Given  our  ability  to  distinguish  multiatom 
trap-loading  events  in  real  time  [as  demon¬ 
strated  in  figure  4  of  (25)],  events  with  N  > 
2  atoms  trapped  in  the  cavity  could  be  active¬ 
ly  discarded;  alternatively,  the  extra  atoms 
could  be  heated  out  of  the  trap,  before  even 
attempting  single-photon  generation.  More¬ 
over,  in  its  current  implementation,  our  atom- 
cavity  system  generates  unpolarized  single 
photons,  and  a  well-defined  polarization  is 
subsequently  selected  with  50%  efficiency. 
This  efficiency  could  be  greatly  improved  by 
separating  the  functions  of  cooling  and  of 
single-photon  generation  for  the  H3  control 
field,  so  that  the  atom  is  optically  pumped 
into  a  known  Zeeman  sublevel  before  excita¬ 
tion.  This  separation  of  function  would  allow 
the  interaction  configuration  of  (29)  to  be 
implemented,  making  the  pulse  shape  and 
phase  for  the  photon  wave  packets  insensitive 
to  randomness  of  the  atomic  position. 

We  have  used  a  single  atom  trapped  with¬ 
in  a  high-finesse  optical  cavity  as  an  efficient 
source  for  the  generation  of  single  photons  on 
demand.  The  photons  are  emitted  as  a  Gauss¬ 
ian  beam  with  user-controlled  pulse  shapes. 
As  shown  in  Fig.  4,  the  average  ratio  of 
single-  to  two-photon  event  probabilities  is 


Fig.  4.  Evolution  of  the  ratio  R0  =  [(P-,2)/(2P2)] 
versus  trapping  time  tT,  here  corrected  for 
detector  dark  counts.  The  data  points  are  ex¬ 
perimentally  determined  as  discussed  in  the 
text,  with  vertical  error  bars  based  on  counting 
statistics  of  coincidence  events,  and  horizontal 
bars  indicating  the  bin  widths  in  tT.  The  full 
curve  is  the  prediction  from  our  model  calcu¬ 
lation  that  includes  (rare)  two-atom  events. 
The  dashed  line  represents  the  measured  over¬ 
all  average  of  R0  for  all  t  T. 


R0  =  20.8  ±  1.8,  whereas  R0  >  150  for 
single-photon  generation  at  long  trapping 
times  tT  =  0.4  s.  With  this  large  suppression 
of  two-photon  probability,  the  Mandel-g  pa¬ 
rameter  is  determined  almost  exclusively  by 
propagation  efficiency.  For  example,  for  po¬ 
larized  (unpolarized)  photon  wave  packets, 
Q  =  -0.34  ±  0.05  (Q  =  -0.68  ±  0.10) 
referenced  to  the  total  cavity  output  from 
(MvM2).  Absent  passive  losses  from  the  cav¬ 
ity  boundaries,  the  generation  of  single  pho¬ 
tons  succeeds  with  probability  close  to  unity, 
where  this  high  success  probability  derives 
from  the  near-ideal  nature  of  the  atom-cavity 
interaction  in  a  regime  of  strong  coupling. 

References  and  Notes 

1.  N.  Lutkenhaus,  Phys.  Rev.  A.  61,  52304  (2000). 

2.  H.-J.  Briegel,  S.  J.  van  Enk,  J.  I.  Cirac,  P.  Zoller,  in  The  Physics 
of  Quantum  Information,  D.  Bouwmeester,  A.  Ekert,  A. 
Zeilinger,  Eds.  (Springer,  Berlin,  2000),  pp.  192-197. 

3.  I.  L.  Chuang,  Y.  Yamamoto,  Phys.  Rev.  A.  52,  3489  (1995). 

4.  Q.  A.  Turchette,  C.  J.  Hood,  W.  Lange,  H.  Mabuchi, 
H.  J.  Kimble,  Phys.  Rev.  Lett.  75,  4710  (1995). 

5.  E.  Knill,  R.  Laflamme,  G.  Milburn,  Nature  409,  46  (2001). 

6.  L.-M.  Duan,  H.  J.  Kimble,  in  press;  preprint  available  at 
http://arXiv.org/abs/quant-ph/0309187. 

7.  J.  F.  Clauser,  Phys.  Rev.  D  9,  853  (1974). 

8.  H.  J.  Kimble,  M.  Dagenais,  L.  Mandel,  Phys.  Rev.  Lett. 
39,  691  (1977). 

9.  P.  Grangier,  G.  Roger,  A.  Aspect,  Europhys.  Lett.  1, 1 73 
(1986). 

10.  F.  Diedrich,  H.  Walther,  Phys.  Rev.  Lett.  58,  203  (1987). 

11.  T.  Basche,  W.  E.  Moerner,  M.  Orrit,  H.  Talon,  Phys. 
Rev.  Lett.  69,  1516  (1992). 

12.  B.  Lounis,  W.  E.  Moerner,  Nature  407,  491  (2000). 

13.  P.  Michler  et  al.,  Nature  406,  968  (2000). 

14.  C.  Santori,  M.  Pelton,  G.  Solomon,  Y.  Dale,  Y. 
Yamamoto,  Phys.  Rev.  Lett.  86,  1502  (2001). 

15.  R.  Brouri,  A.  Beveratos,  J.-P.  Poizat,  P.  Grangier,  Opt. 
Lett.  25,  1294  (2000). 

16.  C.  Kurtsiefer,  S.  Mayer,  P.  Zarda,  H.  Weinfurter,  Phys. 
Rev.  Lett.  85,  290  (2000). 

17.  P.  Michler  et  al.,  Science  290,  2282  (2000). 

18.  E.  Moreau  et  al.,  Appl.  Phys.  Lett.  79,  2865  (2001). 

19.  M.  Pelton  et  al.,  Phys.  Rev.  Lett.  89,  233602  (2002). 

20.  C.  K.  Law,  H.  J.  Kimble,  J.  Mod.  Opt.  44,  2067  (1997). 

21.  A.  Kuhn,  M.  Hennrich,  G.  Rempe,  Phys.  Rev.  Lett.  89, 
67901  (2002). 

22.  H.  J.  Kimble,  Phys.  Rev.  Lett.  90,  249801  (2003). 

23.  A.  S.  Parkins,  P.  Marte,  P.  Zoller,  H.  J.  Kimble,  Phys. 
Rev.  Lett.  71,  3095  (1993). 

24.  J.  Ye,  D.  W.  Vernooy,  H.  J.  Kimble,  Phys.  Rev.  Lett.  83, 
4987  (1999). 

25.  J.  McKeever  et  al.,  Phys.  Rev.  Lett.  90,  133602  (2003). 

26.  Materials  and  methods  are  available  as  supporting 
material  on  Science  Online. 

27.  J.  McKeever,  A.  Boca,  A.  D.  Boozer,  J.  R.  Buck,  H.  J. 
Kimble,  Nature  425,  268  (2003). 

28.  A.  D.  Boozer,  A.  Boca,  J.  R.  Buck,  J.  McKeever,  H.  J. 
Kimble,  in  press;  preprint  available  at  http:// 
arXiv.org/abs/quant-ph/03091 33. 

29.  L.-M.  Duan,  A.  Kuzmich,  H.  J.  Kimble,  Phys.  Rev.  A.  67, 
032305  (2003). 

30.  We  gratefully  acknowledge  the  contributions  of  K.  M. 
Birnbaum,  S.  J.  van  Enk,  H.-C.  Nagerl,  D.  M.  Stamper- 
Kurn,  D.  W.  Vernooy,  and  J.  Ye.  This  work  was  sup¬ 
ported  by  the  Caltech  MURI  Center  for  Quantum 
Networks  under  U.S.  Army  Research  Office  grant 
DAAD 19-00- 1-0374,  by  the  NSF,  and  by  the  Office  of 
Naval  Research. 

Supporting  Online  Material 

www.sciencemag.org/cgi/content/full/1095232/DC1 

Materials  and  Methods 

Table  SI 

References 

31  December  2003;  accepted  18  February  2004 
Published  online  26  February  2004; 

10.1 126/science.1095232 

Include  this  information  when  citing  this  paper. 


1994 


26  MARCH  2004  VOL  303  SCIENCE  www.sciencemag.org 


arXiv:quant-ph/0309187  vl  25  Sep  2003 


Scalable  photonic  quantum  computation  through  cavity-assisted  interaction 


L.-M.  Duan1  and  H.  J.  Kimble2 

1  Department  of  Physics  and  FOCUS  Center,  University  of  Michigan,  Ann  Arbor,  MI  48109-1120 
2  Norman  Bridge  Laboratory  of  Physics  12-33,  California  Institute  of  Technology,  Pasadena,  CA  91125 


We  propose  a  scheme  for  scalable  photonic  quantum  com¬ 
putation  based  on  cavity  assisted  interaction  between  single¬ 
photon  pulses.  The  prototypical  quantum  controlled  phase- 
flip  gate  between  the  single-photon  pulses  is  achieved  by  suc¬ 
cessively  reflecting  them  from  an  optical  cavity  with  a  single- 
trapped  atom.  Our  proposed  protocol  is  shown  to  be  robust 
to  practical  nose  and  experimental  imperfections  in  current 
cavity- QED  setups. 

Realization  of  quantum  computation  requires  accurate 
coherent  control  of  a  set  of  qubits.  A  small  volume  opti¬ 
cal  cavity  provides  a  platform  to  achieve  strong  coherent 
interactions  between  atoms  and  photons,  and  has  been 
exploited  as  the  critical  component  in  several  schemes  for 
implementation  of  quantum  computation  and  commu¬ 
nication  [1-3].  In  a  prototypical  cavity-based  quantum 
computation  scheme  of  Ref.  [1],  the  atoms  are  adopted 
as  qubits  while  photons  mediate  the  interaction  between 
them.  Scaling  to  large-scale  quantum  computation  via 
this  paradigm  then  requires  that  many  atoms  be  localized 
and  separately  addressed  within  a  tiny  optical  cavity  [1], 
or  alternatively  be  coherently  transported  into  and  out 
of  the  cavity  mode  [4].  However,  in  spite  of  recent  signif¬ 
icant  laboratory  advances  [5, 6, 4, 7, 8],  these  tasks  remain 
daunting  experimental  challenges. 

Here,  we  propose  a  scalable  quantum  computation 
scheme  where  qubits  are  encoded  as  polarizations  of 
single-photon  pulses.  An  optical  cavity  with  a  sin¬ 
gle  trapped  atom  is  employed  as  the  critical  resource 
to  achieve  controlled  gate  operations  between  photonic 
qubits  and  to  act  as  a  high  efficiency  single-photon  de¬ 
tector.  The  proposed  computation  architecture  is  based 
on  the  state-of-the-art  in  cavity  quantum  electrodynam¬ 
ics  [5],  can  be  readily  scaled  up  to  many  qubits,  and 
could  be  integrated  with  protocols  for  the  realization  of 
quantum  networks  [2]. 

Quantum  computation  with  single-photon  polariza¬ 
tions  as  qubits  [9,10]  has  the  obvious  advantage  that  the 
number  of  qubits  can  readily  be  scaled  up  by  generat¬ 
ing  many  single-photon  pulses.  The  main  obstacle  to 
this  approach  is  that  it  is  exceedingly  difficult  to  achieve 
quantum  gate  operations  between  single-photon  pulses. 
The  typical  photon-photon  coupling  rate  in  available  ma¬ 
terials  is  orders  of  magnitude  too  small  to  allow  for  any 
meaningful  gate  operation  at  the  single-quantum  level. 
An  interesting  idea,  as  has  been  put  forward  recently  in 
the  so-called  linear  optics  quantum  computation  scheme 
[11],  is  to  achieve  effective  nonlinear  interaction  between 
photons  through  feed-forward  from  high  efficiency  single¬ 
photon  detectors.  Though  this  approach  is  a  very  impor¬ 
tant  advance,  a  significant  obstacle  is  that  the  required 


efficiency  a  of  the  single-photon  detectors  for  scalable 
quantum  computing  is  extremely  high  (e.g.,  for  gate  suc¬ 
cess  with  probability  p  ~  0.99,  a  >  0.999987  [12]). 

In  our  proposed  scheme,  we  combine  the  advantage 
of  scalability  from  the  photonic  qubits  and  the  power 
of  strong  atom-photon  coupling  in  a  high-finesse  optical 
resonator.  Such  a  cavity  with  one  or  few  atoms  in  a 
configuration  of  far-off-resonant  interactions  provides  an 
effective  Kerr  nonlinearity  for  the  input  light  [10,13,14], 
as  was  first  observed  in  Ref.  [10].  However,  this  nonlin¬ 
ear  phase  shift  is  typically  too  small  for  realization  of 
the  operation  of  the  prototypical  quantum  Controlled- 
NOT  gate  (C-NOT).  Compared  with  the  approach  of 
Ref.  [10],  our  new  protocol  has  the  following  significant 
advances:  (i)  A  different  interaction  mechanism  between 
photon  pulses  leads  to  a  much  larger  effective  interac¬ 
tion  rate  sufficient  for  the  realization  of  a  quantum  C- 
NOT  gate  with  current  experimental  capabilities,  (ii) 
The  conditional  phase  flip  in  our  scheme  is  very  insen¬ 
sitive  to  variation  of  the  atom-photon  coupling  rate,  so 
that  high-fidelity  gate  operations  can  be  realized  even 
if  the  atom  is  not  localized  in  the  Lamb-Dicke  regime. 
(in)  The  pulse  shapes  for  pairs  of  interacting  single  pho¬ 
tons  suffer  very  small  changes  due  to  interactions  with 
the  atom-cavity  system,  which  is  otherwise  quite  diffi¬ 
cult  to  achieve  [14].  (iv)  Finally,  the  noise  properties  of 
our  scheme  are  quite  favorable,  and  should  allow  signif¬ 
icant  improvement  in  the  error  threshold  for  large-scale, 
fault-tolerant  quantum  computation. 

The  basis  states  for  our  qubit  consist  of  two  orthogonal 
polarization  states  of  a  single-photon  pulse,  denoted  by 
|  h)  and  |t;).  A  series  of  single-photon  pulses  is  generated 
by  emission  from  a  single  atom  in  a  cavity  [15,2];  single¬ 
qubit  operations  on  these  photonic  qubits  are  accurately 
performed  through  polarization  rotations.  The  critical 
problem  for  quantum  computation  with  these  qubits  is 
to  achieve  a  nontrivial  two-qubit  interaction.  Here,  we 
choose  the  quantum  controlled  phase  flip  (CPF),  where 
the  CPF  gate  for  qubits  j  and  k  flips  the  phase  of  the 
input  state  if  both  qubits  are  in  | h)  polarizations,  and 
has  no  effect  otherwise.  The  CPF  gates,  together  with 
simple  single-qubit  operations,  realize  universal  quantum 
computation  [16]. 

As  illustrated  in  Fig.  la,  the  CPF  gate  for  two  arbi¬ 
trary  pulses  j  and  k  is  implemented  by  simply  reflect¬ 
ing  them  successively  from  a  high-Q  cavity  which  con¬ 
tains  a  single-trapped  atom.  The  atom  has  three  rele¬ 
vant  levels  as  shown  in  Fig.  lb,  and  is  initially  prepared 
in  an  equal  superposition  of  the  two  ground  states,  i.e., 
I^az)  =  (|0>  +  |1))  /y/2.  The  atomic  transition  |1)  — >  \e) 
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is  resonantly  coupled  to  a  cavity  mode  ah ,  which  has  h 
polarization  and  is  resonantly  driven  by  the  h  polariza¬ 
tion  component  of  the  input  single-photon  pulse.  The  v 
polarization  component  of  the  input  pulse  is  reflected  by 
the  mirror  M. 


la 


qubit  j  qubit  k 


A  A 


lb 


h 

1 

b 

TT  i°> 

FIG.  1.  (a)  Schematic  setup  to  implement  the  controlled 
phase  flip  (CPF)  gate  between  two  single-photon  pulses  j  and 
k.  With  a  polarization  beam  splitter  (PBS),  the  /i-polarized 
component  of  the  single-photon  pulse  is  reflected  by  the  cav¬ 
ity,  while  the  ^-polarized  component  is  reflected  via  the  mir¬ 
ror  M.  The  optical  paths  from  the  polarization  beam  splitter 
(PBS)  to  the  cavity  and  to  the  mirror  M  are  assumed  to  be 
equal,  (b)  The  relevant  level  structure  of  the  atom  trapped  in 
the  cavity  (e.g.,  the  states  |0)  and  |1)  could  denote  hyperfine 
states  of  an  alkali  atom  in  the  ground-state  manifold  while  |e) 
is  an  excited  state). 


Before  describing  the  detailed  model  and  supporting 
calculations,  first  we  summarize  the  basic  ideas  of  our 
scheme,  which  consists  of  two  critical  steps.  (A)  By  re¬ 
flecting  one  single-photon  pulse,  say  j,  from  the  cavity 
and  the  mirror,  a  CPF  gate  between  the  atom  and  the 
pulse  j  is  achieved  as  described  by  the  unitary  operator 
UajPF  =  .  (B)  A  composition  of  the  CPF 

gates  between  the  atom  and  the  pulses  j,  k  generates  a 
CPF  gate  between  the  pulses  j  and  k  described  by  the 
unitary  operator  U^kPF  =  e27r^-?  (h\®\h)k(h\  ?  while  restor¬ 
ing  the  atom  into  its  initial  state  |$a^).  Experimentally 
the  composition  is  performed  by  successively  “bouncing” 
the  pulses  from  the  cavity  (see  Fig.  la). 

Step  (A)  -  When  the  incoming  photon  is  v  polarized, 
it  will  be  reflected  by  the  mirror  M  without  any  phase 
and  shape  change.  When  the  incoming  photon  is  in  h  po¬ 
larization,  it  is  resonant  with  the  bare  cavity  mode  if  the 
atom  is  in  the  |0)  state  and  thus  acquires  a  phase  of  el7T 
after  its  reflection;  however,  if  the  atom  is  in  the  |1)  state, 
the  frequency  of  the  dressed  cavity  mode  from  the  reso¬ 
nant  atom-cavity  coupling  is  significantly  detuned  from 
the  frequency  of  the  incoming  pulse.  In  this  case,  the 
cavity  functions  in  the  same  fashion  as  the  mirror  M  and 
the  photon  pulse  is  reflected  without  a  phase  change.  A 
composition  of  the  above  sub-processes  realizes  the  de¬ 
sired  CPF  gate  UCpF  between  the  atom  and  the  photon. 

Step  (B)  -  Critical  to  the  second  step  of  our  protocol 
is  the  following  operator  identity: 

UfkPF  I’M  ®  I *ai)  =  UZPFRa  (-7T/2)  U%kPFRa  (tt/2) 

XU^F  |M  0  |$oi)  ,  (1) 

where  \^jk)  denotes  an  arbitrary  state  of  the  photonic 
qubits  j  and  k ,  and  Ra  ( 0 )  is  a  single-bit  rotation  on 
the  atom  which  transforms  according  to  Ra  (0)  |0)  = 


cos#/2  |0)  +  sin#/2  |1)  and  Ra  (0)  |1)  =  —  sin0/2|O)  + 
cos  6/2  |1).  The  identity  (1)  demonstrates  that  the  CPF 
gate  between  two  arbitrary  single-photon  pulses  j  and  k 
can  be  implemented  by  first  reflecting  the  pulse  j  from 
the  cavity  as  shown  in  Fig.  la,  then  applying  a  (jr/ 2)- 
pulse  laser  on  the  atom,  then  reflecting  the  pulse  k  from 
the  cavity,  then  applying  a  (— 7r/2)-pulse  laser  on  the 
atom,  and  finally  reflecting  the  pulse  j  again  from  the 
cavity. 

The  CPF  gate  U^PF  between  the  atom  and  the  pho¬ 
ton  pulse  can  also  be  used  to  achieve  quantum  non¬ 
demolition  (QND)  measurement  of  the  photon  number 
in  the  pulse.  For  this  purpose,  we  simply  prepare  the 
atom  in  the  state  |<Fa*),  reflect  the  to-be- measured  pho¬ 
ton  pulse  from  the  cavity,  apply  a  Ra  ( tt/2 )  rotation  on 
the  atom,  and  finally  perform  a  measurement  of  the 
atomic  state  in  the  basis  { |0) ,  |1)}.  The  measurement 
outcome  is  “0”  if  and  only  if  the  h  component  of  the 
pulse  has  a  photon.  By  the  same  avenue,  we  can  also 
measure  the  parity  of  several  photonic  qubits  (“parity” 
concerns  whether  a  series  of  pulses  has  a  total  even  or 
odd  photon  number  in  their  h  components)  by  succes¬ 
sively  reflecting  them  from  the  cavity,  and  can  as  well 
measure  the  total  photon  number  of  both  h  and  v  com¬ 
ponents  of  a  single  pulse  by  reflecting  it  twice  from  the 
cavity  with  a  polarization  flip  between  the  two  reflec¬ 
tions.  Such  QND  measurements  have  wide  applications 
for  quantum  information  processing  [17,18].  Note  that 
the  measurement  of  atomic  internal  states  can  be  done 
with  near  100%  efficiency  through  the  quantum  jump 
technique  [3].  So,  the  efficiency  of  our  QND  measure¬ 
ment  is  principally  only  limited  by  the  inefficiency  of  the 
CPF  gate  between  the  atom  and  the  photon  pulse  caused 
by  atomic  spontaneous  emission  loss,  which  as  we  will  see 
later,  is  significantly  less  than  the  inefficiency  of  conven¬ 
tional  destructive  single-photon  detectors. 

Now  we  present  a  detailed  theoretical  model  to  demon¬ 
strate  that  the  CPF  gate  U^PF  between  the  atom  and 
the  single-photon  pulse  j  can  be  obtained  simply  by  re¬ 
flecting  the  latter  from  the  cavity.  The  initial  state  of 
the  pulse  j  can  be  expressed  as  |Tp)^.  =  Chj  \ h)  --\-cvj  \v)  p 
where  Chj  and  cvj  are  arbitrary  superposition  coefficients. 
The  polarization  component  states  | p)  ■  (/a  =  h,v)  have 

the  form  |  p)  ■  =  fF  fj  (t)  a1^  (t)  dt  |vac),  where  fj  (t)  is 
the  normalized  pulse  shape  as  a  function  of  time  £,  T  is 
the  pulse  duration,  a ( t )  are  one-dimensional  field  op¬ 
erators  (cavity  input  operators)  with  the  standard  com¬ 


mutation  relations 


ain  (*)  >  vf  (f)  =  <W<*  (t  - 1')  [19], 
and  | vac)  denotes  the  vacuum  of  all  the  optical  modes. 
The  cavity  mode  ah  is  driven  by  the  corresponding  cavity 
input  operator  aff  (t)  through  [19] 


ah  =  - i[ah ,  H ]  -  (iA  +  k/2)  ah  -  Vko,™  (t)  ,  (2) 


where  k,  is  the  cavity  (energy)  decay  rate  and  the  Hamil¬ 
tonian 

H-hg[\e){l\ah  +  \l){e\a\^  (3) 
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describes  the  coherent  interaction  between  the  atom  and 
the  cavity  mode  a^.  The  detuning  A  in  Eq.  (2)  is  meant 
to  be  0  for  our  scheme,  but  we  retain  it  here  for  subse¬ 
quent  pedagogical  purposes.  The  cavity  output  (t)  is 
connected  with  the  input  by  the  standard  input-output 
relation 


a°hut  (; t )  =  (t)  +  v rkah  •  (4) 

As  the  v  component  of  the  pulse  is  reflected  by  the  mirror 
M,  we  simply  have  a°ut  ( t )  =  a™  ( t ). 

Equations  (2)- (4)  determine  the  evolution  of  the  joint 
state  of  atom  and  photon  pulse,  and  can  be  solved  with¬ 
out  further  approximation  through  numerical  simulation. 
However,  before  presenting  the  simulation  results,  first 
we  attack  this  problem  analytically  with  some  rough  ap¬ 
proximations  to  reveal  the  underlying  physics.  If  the 
atom  is  in  the  state  |0),  the  Hamiltonian  H  does  not 
play  a  role  in  Eq.  (2).  In  this  case,  from  Eqs.  (2)  and 
(4)  we  find 


iA  —  ft/2 
iA  +  ft/2 


<{t) 


(5) 


where  the  high-frequency  components  of  the  field  op¬ 
erators  a™  (t)  and  a™1  (t)  have  been  discarded,  which 
is  a  valid  approximation  if  the  input  pulse  shape  fj  (t) 
changes  slowly  with  time  t  compared  with  the  cavity  de¬ 
cay  rate,  i.e.,  | dtfj  (t)  /  fj  (t)  |  <C  ft.  Under  this  approx¬ 
imation,  we  have  aff1  (t)  ~  —a™  (t)  for  resonant  inter¬ 
action  A  =  0,  so  the  h  component  acquires  the  phase 
7 r  after  reflection  from  the  cavity.  However,  if  the  atom 
is  in  the  state  1 1),  the  response  function  of  the  cavity  is 
modified  by  the  coupling  (3),  where  for  the  case  of  strong 
coupling  [20] ,  the  two  dressed  cavity  modes  have  frequen¬ 
cies  that  are  effectively  detuned  from  that  of  the  input 
pulse  by  A  =  ±g,  respectively.  In  the  case  that  g  ft, 
we  have  aff1  (t)  ~  aff  (t)  from  Eq.  (5),  thereby  confirm¬ 
ing  the  preceding  analysis  to  give  the  desired  CPF  gate 
UCPF. 

aj 

Armed  with  this  understanding,  we  finally  present  ex¬ 
act  numerical  simulations  for  the  theoretical  model  de¬ 
scribed  by  Eqs.  (2)- (4).  In  the  simulation,  we  dis¬ 
cretize  the  continuum  field  operators  aff  (t)  and  aff1  (£), 
and  change  the  dynamics  into  the  Schrodinger  picture 
to  avoid  operator  ordering.  The  details  of  the  simula¬ 
tion  method  can  be  found  in  Ref.  [21].  Atomic  sponta¬ 
neous  emission  noise  is  effectively  described  by  an  imag¬ 
inary  part  (—ijs/2)  (|e)  (e\  —  |1)  (1|)  in  the  Hamiltonian 
H  [21],  where  is  the  spontaneous  emission  rate  from 
the  state  \e).  The  input  pulse  is  taken  to  be  Gaussian 

with  fj  (t)  oc  exp  —  (t  —  T/2)2  /  (T/5)2  ,  where  t  ranges 


from  0  to  T. 

The  numerical  simulations  show  that  the  CPF  gate 
UajPF  works  remarkably  well.  First  of  all,  the  condi¬ 
tional  phase  factor  is  either  e27r  or  el°  depending  on  the 
atomic  state  |0)  or  |1),  and  this  phase  factor  is  very  in¬ 
sensitive  to  the  variation  of  the  coupling  rate  g  in  the 


typical  parameter  region.  For  instance,  its  variation  is 
smaller  than  10-6  for  g  varying  from  6ft  to  ft.  This  result 
cannot  be  understood  naively  from  Eq.  (5),  from  which 
one  gets  a  phase  of  el°  only  when  g  ft.  The  reason 
for  this  discrepancy  is  that  we  have  two  addressed  cavity 
modes  with  symmetric  effective  detunings  A  =  =b g,  and 
their  joint  effect  makes  the  phase  factor  e2°  very  stable 
even  if  g  is  reduced  to  a  value  comparable  with  ft.  The 
stability  of  the  conditional  phase  against  variations  of  g 
in  the  typical  parameter  region  is  an  important  advan¬ 
tage  of  our  scheme,  as  g  in  current  experiments  suffers 
significant  random  variation  (roughly  by  a  factor  of  2) 
due  to  residual  atomic  motion  [5]. 
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FIG.  2.  (a)  The  shape  functions  | fj  (t)|  for  the  input  pulse 
(solid  curve)  and  the  reflected  pulse  with  the  atom  in  the  state 
|0)  (dashed  curve)  and  |1)  (dotted  curve),  respectively.  The 
dotted  and  solid  curves  closely  match  and  are  hardly  distin¬ 
guishable  in  the  figure,  (b)  Fidelity  F  due  to  shape  mismatch 
for  the  quantum  CPF  gate  as  a  function  of  the  input  pulse  du¬ 
ration  T  in  units  of  ft-1.  The  gate  fidelity  quickly  approaches 
1  for  ftT  ^>1.  (c)  The  probability  Ps  of  spontaneous  emission 
loss  versus  the  normalized  cavity  coupling  rate  g/ft,  assuming 
7S  =  ft  (circles).  The  solid  curve  shows  the  fit  by  the  empiri¬ 
cal  formula  Ps  ~  1/  (l  +  2g2 / ftys) .  Other  parameters  for  (a), 
(b),  g  —  3ft,  7s  =  ft,  A  =  0,  and  for  (c),  T/5  =  24/ft,  A  =  0. 

The  simulation  also  shows  that  the  output  pulse  basi¬ 
cally  has  the  same  shape  as  the  input  pulse  if  the  pulse 
duration  T  1/ft.  Fig.  2a  shows  the  output  pulse 
shapes  \fi(t)\  for  the  cases  of  the  atomic  states  |1)  and 
|0),  respectively,  and  demonstrates  very  good  overlap 
with  the  input  pulse  shape  shown  in  the  same  figure. 
In  more  quantitative  terms,  we  consider  the  fidelity  F 
of  the  CPF  gate  U^jPF  for  the  input  atom-photon  state 
|3>ai)  (£>  [|\£pi)  =  (| h)  +  \v))/y/2\.  Reductions  in  F  be- 
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low  unity  are  caused  by  shape  mismatching  between  the 
input  and  the  output  pulses  and  can  be  numerically  cal¬ 
culated.  Fig.  2b  shows  the  gate  fidelity  F  calculated  in 
this  way  for  different  pulse  durations  T.  For  T  =  240/ft 
(corresponding  to  a  pulse  width  T/5  ~  1/rs  for  the  pa¬ 
rameters  of  Ref.  [5]),  the  gate  fidelity  is  about  99.9%. 
The  shape  of  the  output  pulse  is  also  very  insensitive  to 
variation  of  the  coupling  rate  g  in  the  typical  parameter 
region.  For  instance,  the  relative  shape  change  is  smaller 
than  10-4  for  g  varying  from  6ft  to  ft. 

The  dominant  noise  in  our  CPF  gate  arises  from  pho¬ 
ton  loss  due  to  atomic  spontaneous  emission,  leading  to 
a  vacuum-state  output  when  the  input  is  a  single-photon 
pulse.  This  noise  yields  a  leakage  error  (also  called  an  era¬ 
sure  error)  which  means  that  the  final  state  is  outside  of 
the  qubit  Hilbert  space  {| h) ,  |u)}  [16].  Fig.  2c  shows  the 
probability  Ps  of  spontaneous  emission  loss  as  a  function 
of  p/ft  for  the  input  state  |1)  0  |/i),  assuming  =  ft. 
The  curve  is  well  simulated  by  the  empirical  formula 
Ps  ~  1/(1  +  2p2/ftys).  If  the  initial  state  of  the  system 
is  |4>aQ  0  | typi),  the  average  probability  of  the  leakage 
error  per  U^PF  gate  is  given  by  Pe  =  Psj 4.  In  current 
experiments  [5],  typically  (ft,  ys)  / 2tt  ~  (8,5.2)  MHz,  and 
p/27r  «  25  MHz,  which  yields  Pe  ~  0.8%.  With  these  pa¬ 
rameters,  a  typical  pulse  width  T/5^24/ft^  0.5  /is.  As 
the  pulses  j  and  k  are  injected  successively  for  the  CPF 
gate  U^kPF ,  we  need  to  introduce  a  time  delay  of  few 
/is  between  them.  For  demonstration-of-principle  experi¬ 
ments,  this  time  delay  can  be  routinely  achieved  through 
simple  fiber  loops.  To  obtain  longer  time  delay,  atomic 
ensembles  could  be  employed  to  store  photon  pulses  for 
several  seconds  [22-24]. 

Because  the  principal  noise  in  our  scheme  is  photon 
loss  during  gate  operations  which  is  modeled  as  a  leakage 
error,  very  efficient  quantum  error  correcting  codes  can 
be  incorporated  into  this  computation  scheme  to  achieve 
fault-tolerance  [16].  For  instance,  a  rough  estimate  in 
Ref.  [26]  shows  that  through  concatenated  coding,  quan¬ 
tum  computation  can  tolerate  leakage  error  at  a  percent 
level  per  gate,  as  compared  to  the  error  threshold  of 
about  10-5  for  general  quantum  errors  [16].  The  leak¬ 
age  error  only  affects  the  probability  to  register  a  photon 
from  each  pulse  and  has  no  influence  on  the  fidelity  of  its 
polarization  state  if  a  photon  is  registered  for  each  qubit 
(e.g.,  through  QND  or  destructive  measurements).  So, 
leakage  error  induces  small  inefficiency  for  each  gate  (at 
a  level  of  a  few  percents),  which  is  not  debilitating  for 
experimental  quantum  computing  up  to  dozens  of  CPF 
gates  even  without  quantum  error  correction. 

In  summary,  we  have  shown  that  a  cavity  with  a 
single-trapped  atom,  conventionally  used  as  a  single¬ 
photon  source,  can  be  exploited  to  realize  scalable,  fully- 
functional  quantum  computation.  The  proposed  scheme 
is  well  based  on  the  state-of-the-art  in  cavity  quan¬ 
tum  electrodynamics,  is  robust  to  various  experimental 
sources  of  noise,  and  offers  a  promising  approach  to  the 
realization  of  large-scale  fault-tolerant  quantum  compu¬ 
tation. 
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Our  recent  paper  reports  the  experimental  realization  of  a  one-atom  laser  in  a  regime  of  strong 
coupling  [1].  Here  we  provide  the  supporting  theoretical  analysis  relevant  to  the  operating  regime 
of  our  experiment.  By  way  of  a  simplified  four-state  model,  we  investigate  the  passage  from  the 
domain  of  conventional  laser  theory  into  the  regime  of  strong  coupling  for  a  single  intracavity  atom 
pumped  by  coherent  external  fields.  The  four-state  model  is  also  employed  to  exhibit  the  vacuum- 
Rabi  splitting  and  to  calculate  the  optical  spectrum.  We  next  extend  this  model  to  incorporate 
the  relevant  Zeeman  hyperfine  states  as  well  as  a  simple  description  of  the  pumping  processes 
in  the  presence  of  polarization  gradients  and  atomic  motion.  This  extended  model  is  employed  to 
make  quantitative  comparisons  with  the  measurements  of  Ref.  [1]  for  the  intracavity  photon  number 
versus  pump  strength  and  for  the  photon  statistics  as  expressed  by  the  intensity  correlation  function 


I.  INTRODUCTION 

Although  a  number  of  theoretical  analyses  related  to  a  one-atom  laser  have  appeared  in  the  literature  [2,  3,  4,  5, 
6,  7,  8,  9,  10,  11,  12,  13,  14,  15,  16,  17],  these  prior  treatments  have  not  been  specific  to  the  parameter  range  of  our 
recent  experiment  as  reported  in  Ref.  [1].  Because  of  this  circumstance,  we  have  carried  out  theoretical  investigations 
in  support  of  our  experimental  program,  and  present  comparisons  of  these  model  calculations  with  our  measurements 
in  this  paper.  In  Section  II  we  introduce  a  simplified  four- state  model  that  captures  the  essential  features  of  the 
operation  of  our  one-atom  laser  in  a  domain  of  strong  coupling  but  which  avoids  the  complexity  of  the  full  Zeeman 
substructure  of  the  hyperfine  levels  in  atomic  Cesium.  Sections  III  and  IV  then  present  in  turn  semiclassical  and 
quantum  solutions  for  this  four-state  model  system.  By  way  of  a  physically  motivated  transformation  for  which  the 
length  of  a  Fabry-Perot  cavity  is  made  progressively  shorter,  we  utilize  these  results  to  investigate  the  continuous 
passage  from  a  domain  in  which  conventional  laser  theory  is  applicable  into  a  regime  of  strong  coupling  for  which  the 
full  quantum  theory  is  required.  We  thereby  gain  some  insight  into  the  relationship  of  our  system  to  prior  theoretical 
treatments  related  to  the  definition  of  the  laser  threshold  and  to  “thresholdless”  lasing  [18,  19,  20,  21,  22].  The  four- 
state  model  is  further  employed  to  calculate  the  intracavity  photon  number  versus  pump  detuning,  thereby  exhibiting 
the  “vacuum-Rabi”  splitting  for  the  atom-cavity  system  [23,  24,  25]  and  to  compute  the  optical  spectrum  of  the 
intracavity  field. 

In  Section  V  we  describe  the  procedures  followed  to  obtain  solutions  for  an  expanded  model  that  incorporates 
the  relevant  Zeeman  substructure  for  the  Cesium  atom  (32  atomic  states),  two  modes  of  the  cavity  with  orthogonal 
polarizations,  and  a  simple  model  to  account  for  the  polarization  gradients  of  the  optical  fields.  Comparisons  of 
the  results  from  quantum  jumps  simulations  based  upon  this  expanded  model  with  our  measurements  of  the  mean 
intracavity  photon  number  n  versus  normalized  pump  intensity  x  (Figure  3  of  Ref.  [1])  and  with  our  experimental 
determination  of  the  intensity  correlation  function  g^2\r)  (Figure  4  of  Ref.  [1])  are  given  in  Sections  V(a)  and  V(b), 
respectively. 

Our  intent  here  is  not  to  belabor  the  comparison  of  our  experiment  with  prior  work  on  micro-masers  and  lasers, 
for  which  extensive  reviews  are  available  [26,  27,  28,  29,  30].  Instead,  our  principal  goal  is  to  establish  quantitative 
correspondence  between  our  measurements  and  fundamental  theoretical  models.  Having  thereby  validated  the  suit¬ 
ability  of  the  theoretical  treatments,  we  can  then  use  these  models  to  inform  further  experimental  investigations  of 
the  atom-cavity  system. 


II.  FOUR-STATE  MODEL 

We  begin  with  a  four-state  model  to  describe  our  experiment  in  which  a  single  Cesium  atom  is  trapped  inside  an 
optical  cavity  as  illustrated  in  Figure  1.  Although  the  actual  level  structure  of  the  Cesium  6S i/2  6P3/2  transition 

is  more  complex  due  to  the  Zeeman  substructure,  this  simpler  model  offers  considerable  insight  into  the  nature  of 
the  steady  states  and  dynamics.  Following  the  labelling  convention  in  Fig.  1,  we  introduce  the  following  set  of 


FIG.  1:  Illustration  of  a  one-atom  laser,  (a)  The  atom  is  located  in  a  high-Q  optical  cavity  of  decay  rate  ft,  and  is  driven  by 
the  fields  ^3,4.  (b)  Inset  of  the  atomic  level  scheme  relevant  to  our  experiment  with  the  664/2  6P3/2  transition  in  atomic 

Cesium.  The  “lasing”  transition  is  from  the  excited  level  F  —  3'  to  the  ground  level  F  =  4.  Pumping  of  the  excited  3'  level 
is  by  way  of  coherent  excitation  from  a  laser  with  Rabi  frequency  ^3.  Effective  decay  from  the  ground  4  level  is  provided  by 
the  combination  of  a  second  held  with  Rabi  frequency  Q4  and  spontaneous  decay  4'  — ►  3.  Various  radiative  decay  rates  7 in¬ 
appropriate  to  the  D 2  line  in  Cs  are  given  in  the  text. 


Hamiltonians  Hi  in  a  suitably  defined  interaction  picture  (fi  =  1): 


Hi  =  g43(^ag4,e3  +  <5-e3,p4a),  (1) 

H-2  =  ~^^3^g3,e3  T  &e3,g3)i 
H3  =  “^4(^3, e3  T  &e3,g3)i 

H4  =  (A  AC  +  A4)dta, 

H5  =  A3<Je35e3  T  A4(Je45e4, 

=  Hi  +  H2  +  H3  +  +  H5  . 


In  a  standard  convention,  the  atomic  operators  are  =  \i)(j\  for  states  (i,  j),  with  the  association  of  the  F  =  3,4 
ground  and  the  F'  =  3',  4'  levels  with  #3,  #4,  e3,  e4,  respectively.  The  Hamiltonian  Hi  accounts  for  the  coherent 
coupling  of  the  atomic  transition  e3  g4  to  the  held  of  a  single  mode  of  the  cavity  with  creation  and  annihilation 
operators  (a^,a).  The  upper  state  e3  of  the  lasing  transition  is  pumped  by  the  (coherent-state)  held  H3,  while 
the  lower  state  g4  is  depleted  by  the  held  H4  as  described  by  (H2,H3),  respectively.  account  for  various 

detunings,  including  A ac  f or  the  offset  between  the  cavity  resonance  and  the  e3  <-+  g4  atomic  transition,  A3  for  the 
offset  between  the  held  H3  and  the  g3  ^  e3  transition,  and  A4  for  the  offset  between  the  held  H4  and  the  #4  e4 
transition.  Beyond  these  interactions,  we  also  account  for  irreversible  processes  by  assuming  that  the  atom  is  coupled 
to  a  continuum  of  modes  other  than  the  privileged  cavity  mode,  and  likewise  for  the  coupling  of  the  cavity  mode  to 
an  independent  continuum  of  external  modes. 

With  these  preliminaries,  it  is  then  straightforward  to  derive  a  master  equation  for  the  density  operator  p  for  the 
atom-cavity  system  [31,  32]  in  the  Born-Markov  approximation.  For  our  model  system,  this  equation  is 


dp 

dt 


5 

p]  T  ^  ^  Fj, 

i=i 


(2) 


3 


Here,  the  terms  Li  account  for  each  of  the  various  decay  channels,  and  are  given  explicitly  by 

L\  =  k(2  apo)  —  ap  —  pa^  a)  ^  (3) 

L2  =  733(2d^3?e3p(Je3,^3  —  de3,e3P  —  pde3,e3), 

£3  =  743  (2dg4?e3/)(Je3?g4  —  <Je3,e3 P  ~  P&e 3,e3), 

L4.  =  734  (2d^3?e4p(Je4?^3  de4?e4  /3  pde4?e4), 

A5  =  744 (2d^,4?e4p(Je4?g,4  de4?e4 f)  pde4?e4)  , 

where  the  association  of  each  term  Li  with  the  decay  processes  in  Fig.  1  should  be  obvious.  Spontaneous  decay  of 
the  various  atomic  transitions  to  modes  other  than  the  cavity  mode  proceeds  at  (amplitude)  rate  7 ij  as  indicated  in 
Fig.  1,  while  the  cavity  (field)  decay  rate  is  given  by  k. 

The  master  equation  allows  us  to  derive  a  set  of  equations  for  expectation  values  of  atom  (0 qj)  and  field  (a) 
operators.  One  example  is  for  the  atomic  polarization  on  the  e3  #4  transition,  namely 


d((Tg4,e3) 

dt 


~  [(733  +  743)  +  ^A3]  (<3>4,e3) 
i  (^3  (^"^4,^3)  ^4(de4,e3)) 

+^43  ((^e3,e3 &)  ~  (dg^g4d)) 


(4) 


A  solution  to  this  equation  requires  not  only  knowledge  of  single-operator  expectation  values  (07  j)  and  (a),  but  also 
of  operator  products  such  as  (<re3?e3 a).  We  can  develop  coupled  equations  for  such  products  (<5yjd)  but  would  find 
that  their  solution  requires  in  turn  yet  higher  order  correlations,  ultimately  leading  to  an  unbounded  set  of  equations. 

Conventional  theories  of  the  laser  proceed  beyond  this  impasse  by  one  of  several  ultimately  equivalent  avenues. 
Within  the  setting  of  our  current  approach,  a  standard  way  forward  is  to  factorize  operator  products  in  the  fashion 

{dija)  =  (&ij ) (a)  +  ((aitja)  -  (aitj)(a))  (5) 

with  then  the  additional  terms  of  the  form  ((d^jd)  —  (&ij)(d))  treated  as  Langevin  noise.  Such  approaches  rely 
on  system-size  expansions  in  terms  of  the  small  parameters  (1/no,  1/^Vo),  where  (no,  No)  are  the  critical  photon 
and  atom  number  introduced  in  Ref.  [1]  for  our  one-atom  laser.  Within  the  context  of  conventional  laser  theory, 
these  parameters  are  described  more  fully  in  Ref.  [31,  32],  while  their  significance  in  cavity  QED  is  discussed  more 
extensively  in  Ref.  [33].  In  qualitative  terms,  conventional  theories  of  the  laser  in  regimes  for  which  (no,  No)  1 
result  in  dynamics  described  by  evolution  of  mean  values  (d ij)  and  (a)  (that  are  of  order  unity  when  suitably  scaled), 
with  then  small  amounts  of  quantum  noise  (that  arise  from  higher  order  correlations  of  order  (1/no,  1/AT0)  1  )• 

In  the  following  section,  we  discuss  the  so-called  semiclassical  solutions  obtained  from  the  factorization  (dijd)  = 
(dij)(d)  neglecting  quantum  noise.  In  Section  IV,  we  then  describe  the  full  quantum  solution  obtained  directly  from 
the  master  equation. 

III.  SEMICLASSICAL  THEORY  FOR  A  FOUR-STATE  ATOM 

We  will  not  present  the  full  set  of  semiclassical  equations  here  since  they  are  derived  in  a  standard  fashion  from 
the  master  equation  Eq.  2  [32,  34].  One  example  is  for  the  atomic  polarization  (dg4^)  on  the  e3  #4  transition,  for 
which  Eq.  4  becomes 


d(dg  4;e3) 

dt 


-  [(733  +  743)  +  iA3]  (d^4?e3) 

i  {^3{dg4,g3)  ^4(de4,e3)) 

+^43  ({de3,e3)  ~  (^4,^4))  OL  , 


(6) 


where  a  =  (a).  There  is  a  set  of  18  such  equations  for  the  real  and  imaginary  components  of  the  various  field  and 
atomic  operators,  together  with  the  constraint  that  the  sum  of  populations  over  the  four  atomic  states  be  unity.  We 
obtain  the  steady  state  solutions  to  these  equations,  where  for  the  present  purposes,  we  restrict  attention  to  the  case  of 
zero  detunings  A ac  =  A3  =  A4  =  0.  Allowing  for  nonzero  detunings  of  atom  and  cavity  would  add  to  the  complexity 
of  the  semiclassical  analysis  because  of  the  requirement  for  the  self-consistent  solution  for  the  frequency  of  emission 
[see,  for  example,  Ref.  [35]  for  the  case  of  a  (multi-atom)  Raman  laser] . 
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FIG.  2:  Results  from  the  semiclassical  theory  as  applied  to  the  atom-cavity  system  in  Fig.  1.  (a,b)  Intracavity  intensity  \a\2  in 
units  of  the  critical  photon  number  no  is  plotted  as  a  function  of  the  pump  intensity  I3  =  (§y)2.  A  threshold  for  \a\2  is  evident 
for  I3  ~  0.8.  (c,d)  Populations  an  =  (an)  versus  I3.  In  (c),  population  inversion  cre3,e3  >  ag 4,^4  occurs  over  a  wide  range  as  the 
pump  intensity  1 3  is  increased  from  0,  including  in  the  threshold  region  1 3  ~  0.8,  with  then  “population  clamping”  for  ae 3^3 
as  I3  increases  beyond  threshold.  In  all  cases,  the  recycling  intensity  I4  =  (§^)2  =  3  and  the  detunings  A ac  =  A3  =  A4  =  0. 


The  semiclassical  solutions  are  obtained  for  the  parameters  relevant  to  our  experiment  with  atomic  Cs,  namely 

,  \  .3  17  5  .  _ 

(733,743,744,734)  =  (j,  j,  y^,  Y2  ^  5  ( 7 ) 

where  these  rates  are  appropriate  to  the  (amplitude)  decay  of  the  levels  6P3/2,F'  =  3',  4'  — >  6S 1/2,  P  =  3,4  with 
7  =  2tt  x  2.6  MHz  (i.e.,  a  radiative  lifetime  r  =  l/2y  =  30.6  ns).  The  cavity  (field)  decay  rate  k  is  measured  to  be 
k  =  2tt  x  4.2  MHz.  The  rate  of  coherent  coupling  g43  for  the  e3  gA  transition  (i.e.,  6P3/2,  F'  =  3'  ^  6S i/2,  P  =  4)  is 
calculated  from  the  known  cavity  geometry  (waist  and  length)  and  the  decay  rate  7,  and  is  found  to  be  g43  =  27t  x  16 
MHz  based  upon  the  effective  dipole  moment  of  the  transition. 

Examples  of  the  resulting  steady-state  solutions  for  the  intracavity  intensity  \a\ 2  together  with  the  populations 
an  of  the  four  atomic  states  are  displayed  in  Figure  2.  Parts  (a)  and  (c)  of  the  figure  illustrate  the  behavior  of  \a\ 2 
and  an  around  the  semiclassical  threshold  as  functions  of  the  pump  intensity  J3.  Parts  (b)  and  (d)  explore  these 
dependencies  over  a  wider  range  in  J3.  For  fixed  ratios  among  the  various  decay  rates  as  in  Eq.  7,  the  semiclassical 
solutions  for  \a\2 /no  as  well  as  the  various  populations  an  plotted  in  Fig.  2  depend  only  on  the  critical  atom  number 
No  (or  equivalently,  the  cooperativity  parameter  C\  =  l/7Vo  for  a  single  atom  in  the  cavity).  Hence,  as  emphasized  in 
the  Supplementary  Information  published  with  our  paper  Ref.  [1],  these  steady  state  solutions  from  the  semiclassical 
theory  are  independent  of  the  cavity  length  /,  and  provide  a  point  of  reference  for  understanding  “lasing”  for  a  single 
atom  in  a  cavity.  This  is  because  7V0  =  77^  is  independent  of  cavity  length  l  for  a  cavity  with  constant  mirror 
reflectivity  and  cavity  waist  wo- 

Importantly,  the  semiclassical  theory  predicts  threshold  behavior  for  parameters  relevant  to  our  experiment,  in¬ 
cluding  inversion  cre3,e3  >  crp4j^4  in  the  threshold  region,  although  this  is  not  essential  for  Raman  gain  for  g3  — >  #4 
via  e3.  One  atom  in  a  cavity  can  exhibit  such  a  “laser”  transition  for  the  steady  state  solutions  in  the  semiclassical 
theory  because  the  cooperativity  parameter  C\  1.  Indeed,  in  these  calculations  we  used  our  experimental  value  for 
the  cooperativity  parameter  C\  =  l/7Vo  —  12.  Among  other  relevant  features  illustrated  in  Fig.  2  is  the  quenching 
of  the  laser  emission  around  J3  ~  6.5,  presumably  due  to  an  Autler- Townes  splitting  of  the  excited  state  e3  at  high 
pump  intensity  [8]. 
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A.  Relationship  to  a  Raman  laser 

In  many  respects  our  system  is  quite  similar  to  a  three-level  Raman  scheme,  for  which  there  is  an  extended  literature 
(e.g.,  Ref.  [35]  and  references  therein).  In  fact  we  have  carried  out  an  extensive  analysis  of  a  Raman  scheme  analogous 
to  our  system  in  Fig.  1.  Pumping  is  still  done  by  the  field  Q3  on  the  3^3'  transition.  However,  recycling  4  — »  4'  — »  3 
by  the  field  and  decay  734  is  replaced  by  direct  decay  4  — >  3  at  a  fictitious  incoherent  rate  of  decay  ^34  with  level 
4'  absent.  In  all  essential  details,  the  results  from  this  analysis  are  in  correspondence  with  those  presented  from  our 
four- level  analysis  in  this  section.  In  particular,  the  threshold  onsets  in  precisely  the  same  fashion  as  in  Fig.  2(a),  and 
the  output  is  “extinguished”  at  high  pump  levels  for  Q3.  This  turn-off  appears  to  be  associated  with  an  AC-Stark 
splitting  of  the  excited  3'  level  by  the  Q3  field  that  drives  the  3'  — ►  4  level  out  of  resonance  with  the  cavity  due  to  the 
splitting  of  the  upper  level  3'.  Over  the  range  of  intensities  explored  in  this  section,  the  “quenching”  behavior  seems 
to  be  unrelated  to  any  coherence  effect  associated  with  the  combination  of  the  field  Q4  and  decay  734. 


IV.  QUANTUM  THEORY  FOR  A  FOUR-STATE  ATOM 

A  one-atom  laser  operated  in  a  regime  of  strong  coupling  has  characteristics  that  are  profoundly  altered  from  the 
familiar  case  (described  e.g.  in  Refs.  [31,  32]),  for  which  the  semiclassical  equations  are  supplemented  with  (small) 
quantum  noise  terms.  The  question  then  arises  as  how  to  recognize  a  laser  in  this  new  regime  of  strong  coupling, 
where  we  recall  the  difficulty  that  this  issue  engenders  even  for  systems  with  critical  photon  number  much  greater 
than  unity  [19,  20,  21,  22].  The  perspective  that  we  adopt  here  is  to  investigate  the  continuous  transformation  of  a 
one-atom  laser  from  a  domain  of  weak  coupling  for  which  the  conventional  theory  should  be  approximately  valid  into 
a  regime  of  strong  coupling  for  which  the  full  quantum  theory  is  required. 

Towards  this  end,  we  consider  a  scenario  in  which  the  cavity  length  (and  hence  its  volume)  is  gradually  reduced 
from  a  “large”  value  for  which  the  conventional  theory  is  valid  to  a  “small”  value  for  which  the  system  is  well  into 
a  regime  of  strong  coupling.  As  illustrated  in  Figure  3  ,  this  transformation  is  assumed  to  be  under  conditions  of 
constant  cavity  waist  wo  and  mirror  reflectivity  R,  in  which  case  scaling  the  length  by  a  factor  /  causes  the  other 
parameters  to  scale  as  follows: 

l  -  h  =  fl,  (8) 

a  ->  as  =  a/f1/2, 

«  -*■  «/  =  «//> 

7^7, 

N0  -  No, 
no  ->  no/  =  /no  ■ 

Recall  that  in  the  semiclassical  theory  illustrated  in  Fig.  2,  the  quantity  \a\2/nof  is  invariant  under  this  transformation. 
By  contrast,  the  role  of  single  photons  becomes  increasingly  important  as  the  cavity  length  is  reduced  (i.e. ,  no/  becomes 
ever  smaller),  so  that  deviations  from  the  familiar  semiclassical  characteristics  should  become  more  important,  and 
eventually  dominant. 


FIG.  3:  Illustration  of  the  scaling  transformation  considered  in  Eqn.  8  whereby  the  length  of  a  spherical  mirror  Fabry-Perot 
cavity  is  transformed  l  fl  while  the  cavity  waist  wo  and  the  atomic  position  are  held  constant.  The  atom  is  indicated  by  the 
“dot”  in  the  center  of  the  cavity  mode. 
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FIG.  4:  The  mean  intracavity  photon  number  n/nof  (blue)  and  normalized  intensity  correlation  function  g^2\0)  (red)  are 
plotted  as  functions  of  pump  intensity  I3  —  (Q3/2J)2  in  (a)-(d).  In  (a)-(c),  the  cavity  length  is  made  progressively  shorter 
(2500Zo,  lOOZo,  Zo),  where  Zo  =  42.2  gm  is  the  length  of  our  actual  cavity.  The  corresponding  saturation  photon  numbers  are 
no/  =(33.0,  1.32,  0.013).  n/nof  and  (/^(O)  are  calculated  from  the  quantum  theory  for  the  four-state  system  in  Fig.  1,  while 
\a\2/n0f  given  by  the  black  curve  is  from  the  semiclassical  theory,  (d)  n  (blue),  </2^(0)  (red),  and  the  Mandel  Q  parameter 
(green)  shown  over  an  extended  range  of  pump  intensity  I3  for  Z  =  Zo  -  In  all  cases,  I4  =  (H4/27)2  =  2,  the  3  — >  4X  and  4  — ►  4' 
transitions  are  driven  on  resonance,  and  the  cavity  detuning  ujca  —  0.  Other  parameters  are  as  given  in  the  text. 


A.  Field  and  atom  variables  for  various  cavity  lengths 

Framed  by  this  perspective,  we  now  present  results  from  the  quantum  treatment  for  a  four-state  model  for  the 
atom.  Our  approach  is  to  obtain  steady  state  results  for  various  operator  expectation  values  directly  from  numerical 
solutions  of  the  master  equation  given  in  Eq.  2  by  way  of  the  Quantum  Optics  Toolbox  written  by  S.  Tan  [36].  Since 
such  numerical  methods  are  by  now  familiar  tools,  we  turn  directly  to  results  from  this  investigation  presented  in 
Figs.  4-9. 

These  figures  display  the  behavior  of  various  characteristics  of  the  atom-cavity  system  as  the  cavity  length  is  reduced 
from  Z  =  2500Zo  to  Z  =  lOOZo  to  Z  =  Zo  to  Z  =  Zo / 99,  where  Zo  =  42.2/xm  is  the  actual  length  of  our  cavity.  Figure 
4  provides  an  overview  of  the  evolution  and  is  reproduced  from  the  Supplementary  Information  in  Ref.  [1],  while 
Figures  5-9  provide  more  detailed  information  about  the  intracavity  field  and  atomic  populations. 

Figure  4(a-c)  and  part  (a)  in  Figs.  5,  6,  and  7  display  the  mean  intracavity  photon  number  n/nof  (where  no/ 
is  calculated  for  the  particular  length),  and  compare  this  result  to  H2/no/  from  the  semiclassical  theory.  The 
correspondence  is  close  in  Figs.  4(a)  and  5(a)  since  no/  =  33  in  this  case,  but  becomes  increasingly  divergent  in 
Figs.  4(b)  and  6(a)  for  which  no/  =  1.3,  and  in  Figs.  4(c)  and  7(a)  for  which  no/(/  =  1)  =  no  =  0.013  (as  in  our 
experiment). 

In  qualitative  terms,  the  peak  in  each  of  the  curves  for  n/no/  in  Figs.  5,  6,  and  7  arises  because  of  a  “bottleneck” 
in  the  cycle  g3  — >  e3  — >  g4  — >  e4  — >  g3.  For  our  scheme  with  one  atom  in  a  cavity,  this  cycle  can  proceed  at  a  rate 
no  faster  than  that  set  by  the  decay  rate  734 .  For  higher  pump  intensities  Is ,  the  quenching  of  the  emission  displayed 
by  the  semiclassical  theory  becomes  less  and  less  evident  with  decreasing  Z  as  the  coherent  coupling  rate  g  becomes 
larger  in  a  regime  of  strong  coupling. 

Part  (b)  in  Figs.  5,  6,  and  7  shows  the  populations  an  of  the  four  states.  A  noteworthy  trend  here  is  the  rapid 
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FIG.  5:  Steady  state  solutions  as  functions  of  pump  intensity  I3  obtained  from  the  numerical  solution  of  the  master  equation 
2  for  the  four-state  atom  in  a  cavity  illustrated  in  Fig.  1.  Here,  the  cavity  length  l  =  2500/o,  where  lo  =  42.2/xm  is  the  cavity 
length  in  our  experiment,  (a)  Mean  intracavity  photon  number  n  normalized  to  the  saturation  photon  number  no/  =33  (in 
blue).  The  corresponding  result  for  \a\2 /no f  from  the  semiclassical  theory  is  given  by  the  black  curve,  (b)  Populations  an  of  the 
four  states  as  labelled,  (c)  Mean  intracavity  photon  number  n  (blue),  Mandel  Q  parameter  (green),  and  intensity  correlation 
function  g^2\ 0)  (red),  (d)  Ratio  R  of  photon  flux  from  the  cavity  mode  Kf  n  as  compared  to  the  rate  of  atomic  fluorescence 
743073, e3  for  the  excited  state  e3.  In  all  cases,  the  depleting  intensity  h  =  (§^)2  =  3  and  the  detunings  A ac  =  A3  =  A4  =  0. 
Field  and  atom  decay  rates  are  as  specified  in  the  text. 


reduction  of  the  population  073^3  with  decreasing  cavity  length.  Again,  the  rate  g  becomes  larger  as  l  is  reduced,  and 
eventually  overwhelms  all  other  rates,  so  that  population  promoted  to  this  state  is  suppressed. 

Figure  4  and  part  (c)  in  Figs.  5,  6,  and  7  address  the  question  of  the  photon  statistics  by  plotting  the  Mandel  Q 
parameter  (or  equivalently  the  Fano  factor  F  =  Q  +  1)  as  well  as  the  normalized  second-order  intensity  correlation 
function  g(2)(0)  [37].  As  shown  in  Fig.  4(a),  for  large  l  =  2500/o,  the  region  around  the  semiclassical  threshold 
displays  the  familiar  behavior  associated  with  a  conventional  laser  [32,  34,  37,  38,  39],  namely  that  g(2)(0)  evolves 
smoothly  from  g(2)(0)  «  2  below  the  semiclassical  threshold  to  g(2)(0)  «  1  above  this  threshold.  Furthermore,  Fig. 
5(c)  shows  that  the  Mandel  Q  parameter  has  a  maximum  in  the  region  of  the  threshold  [19].  Beyond  this  conventional 
(first)  threshold,  the  Mandel  Q  parameter  in  Fig.  5  (c)  also  exhibits  a  second  maximum,  that  has  been  described  as 
a  “second”  threshold  for  one- atom  lasers  [8],  and  g(2)(0)  rises  back  from  1  to  2.  With  decreasing  cavity  length,  these 
features  are  lost  as  we  move  into  a  regime  of  strong  coupling.  For  example,  the  two  peaks  in  Q  merge  into  one  broad 
minimum  with  Q  <  0  indicating  the  onset  of  manifestly  quantum  or  nonclassical  character  for  the  emission  from  the 
atom-cavity  system. 

Finally,  part  (d)  in  Figs.  5,  6,  and  7  presents  results  for  the  ratio  i?,  where 


R  = 


*fn 

743  073, e3 


(9) 


gives  the  ratio  of  photon  flux  K$n  from  the  cavity  mode  to  the  photon  flux  743073^3  appearing  as  fluorescence  into 
modes  other  than  the  cavity  mode  from  the  spontaneous  decay  e3  — >  g4.  For  a  conventional  laser,  Kffi  743073, e3 
below  threshold,  and  K$n  743073, e3  above  threshold,  with  the  laser  threshold  serving  as  the  abrupt  transition 
between  these  cases  in  the  manner  of  a  nonequilibrium  phase  transition  [34,  39].  As  illustrated  in  Fig.  7,  no  such 
transition  is  required  in  the  regime  of  strong  coupling;  R  1  from  the  onset  as  the  pump  Is  is  increased.  This 
behavior  is  analogous  to  the  “thresholdless”  lasing  discussed  in  Refs.  [18,  20,  21,  22]  and  reviewed  by  Rice  and 
Carmichael  [19]. 

For  the  system  illustrated  in  Figure  3,  the  progression  in  length  reduction  has  a  limit  at  l  =  Aq/2  corresponding 
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FIG.  6:  Steady  state  solutions  as  functions  of  pump  intensity  I3  obtained  from  the  numerical  solution  of  the  master  equation 
2  for  the  four-state  atom  in  a  cavity  illustrated  in  Fig.  1.  Here,  the  cavity  length  l  =  100Zo,  where  Zo  =  42.2/xm  is  the  cavity 
length  in  our  experiment,  (a)  Mean  intracavity  photon  number  n  normalized  to  the  saturation  photon  number  no/  =  1.3  (in 
blue).  The  corresponding  result  for  \a\2  /no  f  from  the  semiclassical  theory  is  given  by  the  black  curve,  (b)  Populations  an  of  the 
four  states  as  labelled,  (c)  Mean  intracavity  photon  number  n  (blue),  Mandel  Q  parameter  (green),  and  intensity  correlation 
function  g^2\0)  (red),  (d)  Ratio  R  of  photon  flux  from  the  cavity  mode  n/  n  as  compared  to  the  rate  of  atomic  fluorescence 
743cre3,e3  for  the  excited  state  e3.  In  all  cases,  the  depleting  intensity  I4  =  (§^)2  =  3  and  the  detunings  A ac  =  A3  =  A4  =  0. 
Field  and  atom  decay  rates  are  as  specified  in  the  text. 
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to  a  Fabry-Perot  cavity  with  length  equal  to  the  lowest  order  longitudinal  mode  Ao/2,  where  Ao  =  852.3  nm  is  the 
wavelength  of  the  cavity  QED  transition.  To  reach  this  limit  from  the  length  Zo  appropriate  to  our  actual  cavity,  we 
must  scale  Zo  — >  /Zo  with  /  =  1/99.  In  a  continuation  of  the  sequence  shown  in  Figs.  5  ,  6,  and  7,  we  display  in  Fig.  8 
results  for  such  a  cavity  with  Z  =  Ao/2.  Note  that  although  C\  =  1/7V0  —  12  is  invariant  with  respect  to  this  scaling  and 
the  saturation  photon  number  is  reduced  to  no/  =  1.31  x  10-4,  nevertheless  the  atom-cavity  system  has  passed  out  of 
the  domain  of  strong  coupling,  even  though  (no/,  No)  <C  1.  This  is  because  strong  coupling  requires  that  go  (7,  k), 
so  that  (no,  No)  <  1  is  a  necessary  but  not  sufficient  condition  for  achieving  strong  coupling.  For  the  progression 
that  we  are  considering  with  diminishing  length  (but  otherwise  with  the  parameters  of  our  system),  Z  =  Ao/2  does 
not  lie  within  the  regime  of  strong  coupling  (#43/7  =  61,^43/n  =  0.40),  but  rather  more  toward  the  domain  of  a 
“one-dimensional  atom”,  for  which  k  g2 / k  7  (see,  for  example,  Refs.  [40,  41]  for  theoretical  discussions  and  a 
previous  experimental  investigation).  In  this  domain  of  the  Purcell  effect  [26,  28,  29,  30],  the  fractional  emission  into 
the  cavity  mode  as  compared  to  fluorescent  emission  into  free  space  for  the  3'  — >  4  transition  is  characterized  by  the 
parameter 


A43 


2  cf43) 

1  +  2Cf3) 


0.99, 


(10) 


where  c[43^  =  C\  x  (7/743)  ~  48. 

As  compared  to  Figs.  5,  6,  and  7,  a  noteworthy  feature  of  the  regime  depicted  in  Fig.  8  is  the  absence  of  a 
dependence  of  g^2\ 0)  on  the  pump  level  L 3.  In  fact,  g^2\ 0)  ~  0  over  the  entire  range  shown,  so  that  the  cavity  field  is 
effectively  occupied  only  by  photon  numbers  0  and  1.  In  correspondence  to  this  situation,  the  Mandel  Q  parameter  in 
Fig.  8(c)  is  essentially  given  by  the  mean  of  the  intracavity  photon  number,  Q  —  n,  with  n  «  1.  Furthermore,  the 
dominance  of  emission  into  the  cavity  mode  over  fluorescence  decay  becomes  even  more  pronounced  than  in  Fig.  7(d), 
as  documented  by  the  ratio  R  in  Fig.  8(d).  In  agreement  with  expectation  set  by  Eq.  10,  note  that  R  ^43/(1  —  ^43). 
All  in  all,  the  “bad-cavity”  limit  specified  by  k  g2 / k  7  [40,  41]  (toward  which  Fig.  8  is  pressing)  is  a  domain  of 


9 


10  20  30  40 


50 


FIG.  7:  Steady  state  solutions  as  functions  of  pump  intensity  I3  obtained  from  the  numerical  solution  of  the  master  equation 
2  for  the  four-state  atom  in  a  cavity  illustrated  in  Fig.  1.  Here,  the  cavity  length  l  =  Zo,  where  Zo  =  42.2/im  is  the  cavity 
length  in  our  experiment,  (a)  Mean  intracavity  photon  number  n  normalized  to  the  saturation  photon  number  no  =  0.013  (in 
blue).  The  corresponding  result  for  \a\2 /no  from  the  semiclassical  theory  is  given  by  the  black  curve,  (b)  Populations  an  of  the 
four  states  as  labelled,  (c)  Mean  intracavity  photon  number  n  (blue),  Mandel  Q  parameter  (green),  and  intensity  correlation 
function  g^2\ 0)  (red),  (d)  Ratio  R  of  photon  flux  from  the  cavity  mode  k,  n  as  compared  to  the  rate  of  atomic  fluorescence 
743cre3,e3  for  the  excited  state  e3.  In  all  cases,  the  depleting  intensity  I 4  =  (§^)2  =  3  and  the  detunings  A ac  =  A3  =  A4  =  0. 
Field  and  atom  decay  rates  are  as  specified  in  the  text. 


single-photon  generation  for  the  atom-cavity  system,  which  for  /  1  has  passed  out  of  the  regime  of  strong  coupling. 

Figures  5,  6,  7,  and  8  provide  a  step-by-step  description  of  the  evolution  of  the  atom-cavity  system  from  the  domain 
of  conventional  laser  theory  (Z  Zo  as  in  Fig.  5  with  /  =  2500),  into  the  regime  of  strong  coupling  (Z  =  Zo  as  in  Fig. 
7  with  /  =  1),  and  then  out  of  the  strong-coupling  regime  into  the  Purcell  domain  (Z  =  Zo/99  ~  Ao/2  as  approached 
in  Fig.  8  with  /  =  0.01)  [26,  28,  29,  30].  We  now  attempt  to  give  a  more  global  perspective  of  the  scaling  behavior  of 
the  atom-cavity  system  by  examining  various  field  and  atomic  variables  directly  as  functions  of  the  scale  parameter 
/  =  Z/Zo-  A  particular  set  of  such  results  is  displayed  in  Figure  9,  where  the  pump  intensity  Is  =  3  is  fixed  near  the 
peak  in  the  output  from  the  semiclassical  theory  in  Fig.  2,  and  the  recycling  intensity  I4  is  held  constant  at  I 4  =  3. 

In  Fig.  9(a)  the  mean  intracavity  photon  number  n  is  seen  to  undergo  a  precipitous  drop  as  the  cavity  length 
is  made  progressively  shorter  (i.e. ,  increasing  /_1/2,  since  Z  oc  /).  However,  when  h  is  normalized  to  the  critical 
photon  number  no/  ,  the  quantity  n/riQf  is  seen  to  approach  unity  for  small  /_1/2  (i.e.,  long  cavities  with  Z  Zo)  as 
appropriate  to  the  conventional  theory  in  Fig.  5).  With  increases  in  /_1/2  (i.e.,  shorter  cavity  lengths),  n/rtof  rises  to 
a  maximum  around  /  ^  3  for  strong  coupling  with  Z  ~  Zo  as  in  Fig.  7,  before  then  decreasing  to  approach  a  constant 
value  for  yet  larger  values  of  /_1/2  as  the  system  exits  from  the  domain  of  strong  coupling. 

Also  shown  in  Fig.  9(a)  are  the  quantities  g(2)(0)  and  Q  + 1  that  characterize  the  photon  statistics  of  the  intracavity 
field.  As  previously  noted,  g^(0)  lies  in  the  range  1  <  g^(0)  <  2  for  conventional  laser  theory,  but  drops  below 
unity  in  the  regime  of  strong  coupling  and  approaches  zero  for  /  <C  1.  In  this  same  limit  of  very  small  cavities  in  the 
Purcell  regime,  Q  ~  —n. 

Fig.  9(b)  displays  the  populations  for  the  four-state  system  as  functions  of  /_1/2.  For  the  conventional  regime 
with  /-1/2  <C  1  ,  there  is  population  inversion,  07 3?e3  >  0^4 ^4  (which  was  shown  in  Fig.  2  for  small  values  of  13),  but 
this  possibility  is  lost  for  increasing  /-1/2  (i.e.,  decreasing  cavity  length).  Strong  coupling  dictates  that  the  rate  g 
dominates  all  others,  so  that  appreciable  population  cannot  be  maintained  in  the  state  e3.  Finally,  Fig.  9(d)  displays 
the  dependence  of  the  ratio  R  =  («/n)/( 743073, e3)  on  f~x'2.  From  values  R  <  1  in  the  conventional  domain,  R  rises 
monotonically  with  decreasing  cavity  length  reaching  the  plateau  R^>  1  specified  by  Eq.  10. 


10 


aii  1 

0.75 

0.5 

0.25 

0 

R 

100 


90 

0  10  20  30  40  50 

h 

FIG.  8:  Steady  state  solutions  as  functions  of  pump  intensity  I3  obtained  from  the  numerical  solution  of  the  master  equation 
2  for  the  four-state  atom  in  a  cavity  illustrated  in  Fig.  1.  Here,  the  cavity  length  l  m  lo/99  ~  Ao/2  (i.e.,  /  —  1/99),  where 
lo  =  42.2/rm  is  the  cavity  length  in  our  experiment  and  Ao  =  852.3  nm  is  the  wavelength  of  the  cavity  QED  transition,  (a)  Mean 
intracavity  photon  number  n  normalized  to  the  saturation  photon  number  no/  =  1.31  x  10-4  (in  blue).  The  corresponding 
result  for  \a\2 /no  f  from  the  semiclassical  theory  is  given  by  the  black  curve,  (b)  Populations  an  of  the  four  states  as  labelled. 

(c)  Mean  intracavity  photon  number  n  (blue),  Mandel  Q  parameter  (green),  and  intensity  correlation  function  g(<2\ 0)  (red). 

(d)  Ratio  R  of  photon  flux  from  the  cavity  mode  Kf  n  as  compared  to  the  rate  of  atomic  fluorescence  743073, e3  for  the  excited 
state  e3.  In  all  cases,  the  depleting  intensity  I4  —  (§^)2  =  3  and  the  detunings  A ac  =  A3  =  A4  =  0.  Field  and  atom  decay 
rates  are  as  specified  in  the  text. 
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B.  Vacuum- Rabi  splitting 

In  the  preceding  discussion,  we  have  compared  various  aspects  of  our  one-atom  system  with  conventional  lasers  and 
have  restricted  the  analysis  to  the  case  of  resonant  excitation  with  A3  =  0.  Our  actual  system  operates  in  a  regime 
of  strong  coupling,  so  that  there  should  be  an  explicit  manifestation  of  the  “vacuum- Rabi”  splitting  associated  with 
one  quantum  of  excitation  in  the  4^3'  manifold  [23,  24,  25]. 

To  investigate  this  question,  we  consider  the  dependence  of  the  average  intracavity  photon  number  n  on  the  detuning 
A3  of  the  pump  field  ^3,  with  the  result  of  this  analysis  illustrated  in  Fig.  10.  For  weak  excitation  Is  <  1  (well 
below  the  peak  in  Fig.  7(a)),  the  intracavity  photon  n  is  maximized  around  A3  =  ±#43  (and  not  at  A3  =  0)  in 
correspondence  to  the  eigenvalue  structure  for  the  #4  e3  manifold  in  presence  of  strong  coupling.  The  excited  state 
e3  is  now  represented  by  a  superposition  of  the  nondegenerate  states  | ip±)  whose  energies  are  split  by  the  coupling 
energy  ±%43  .  However,  for  large  pump  intensities  Is  ~  10,  this  splitting  is  lost  as  the  Autler- Townes  effect  associated 
with  the  pump  field  on  the  g3  e3  transition  grows  to  exceed  $43. 


C.  Optical  spectrum  of  the  cavity  emission 

A  central  feature  of  a  conventional  laser  is  the  optical  spectrum  of  the  emitted  field,  defined  by 

/+00 

dr{  lim  (a\t)a(t  +  r))}  exp(— iflr)  ,  (11) 

-00  ^°° 

where  as  in  Eq.  1,  (d^,a)  are  the  creation  and  annihilation  operators  for  the  single- mode  field  of  the  cavity  coupled 
to  the  atomic  transition  e3  <-+  #4.  The  results  for  the  Schawlow- Townes  linewidth  are  well-known  and  will  not  be 
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FIG.  9:  Scaling  behavior  of  various  quantities  as  the  cavity  length  l  is  varied,  where  /  =  Z/Zo ,  and  lo  =  42.2  pm  for  our  actual 
cavity.  Note  that  g  oc  /_1//2  and  k  oc  /_1  and  that  the  range  in  /  corresponds  to  that  spanned  by  Figs.  5  to  8,  namely 
0.01  <  /  <  2500.  (a)  Mean  intracavity  photon  number  n  (blue),  the  Mandel  Q  parameter  (Q  +  1  in  green),  and  the  intensity 
correlation  function  g^2\0)  (red),  (b)  Populations  an  of  the  four  states  as  labelled,  (c)  Mean  intracavity  photon  number  ft 
normalized  to  the  saturation  photon  number  no/  =  no/  =  0.013  x  /.  (d)  Ratio  R  of  photon  flux  from  the  cavity  mode  n/n  as 
compared  to  the  rate  of  atomic  fluorescence  743cre3,e3  for  the  excited  state  e3,  where  n/  =  n//.  In  all  cases,  the  pumping  and 
recycling  intensities  /3,4  —  3  and  the  detunings  A ac  =  A3  =  A4  =  0.  Field  and  atom  decay  rates  are  as  specified  in  the  text. 


discussed  here  [31,  32,  34,  38,  39].  Instead,  in  Fig.  11  we  present  results  specific  to  the  domain  of  operation  of  our 
system. 

For  the  choice  of  parameters  corresponding  to  Fig.  7,  3>(fi)  in  Fig.  11(a)  exhibits  a  pronounced  two-peak  structure, 
with  the  positions  of  the  peaks  corresponding  to  the  Autler-Townes  splitting  of  the  ground  state  by  the  recycling  field 
^4.  Contrary  to  what  might  have  been  expected  from  the  analysis  of  the  previous  section,  <4>(f2)  shows  no  distinctive 
features  associated  with  the  vacuum-Rabi  splitting  of  the  excited  state.  For  reduced  values  of  pumping  and  recycling 
intensities  1%^  =  0.5,  there  are  small  features  in  the  optical  spectrum  at  Q  ~  ±#43,  as  is  illustrated  in  Fig.  11  when 
<F(f2)  is  plotted  on  logarithmic  scale.  With  respect  to  the  complex  degree  of  coherence  [37],  the  coherence  properties 
of  the  light  from  the  one-atom  laser  in  the  regime  of  strong  coupling  are  set  simply  by  the  inverse  of  the  spectral 
width  of  <£(fi),  which  can  be  determined  from  the  plots  in  Fig.  11. 

The  curves  shown  in  Fig.  11  are  calculated  by  way  of  the  quantum  regression  theorem  applied  to  the  four-state 
system  of  Fig.  1.  From  the  quantum  regression  theorem,  we  have  that  the  two  time  correlation  function  in  Eq.  11  is 
given  by 


(at(0)a(r))  =  Tr[pssa\0)a(r)]  =  Tr[p(r)a(  0)] 


where  p(r)  is  obtained  by  numerically  evolving 


Po  =  PssO'i  0) 

under  the  master  equation,  and  pss  is  the  steady  state  density  matrix.  By  Fourier  transforming  the  correlation 
function  according  to  Eq.  11,  we  obtain  the  optical  spectrum. 

The  optical  spectrum  of  the  emitted  light  from  our  cavity  could  in  principle  be  measured  by  way  of  heterodyne 
detection.  The  cavity  output  would  be  combined  on  a  highly  transmissive  beam  splitter  with  a  local  oscillator  beam 
that  is  frequency  shifted  by  an  interval  Ao;  that  is  large  compared  to  the  range  of  frequencies  in  the  output  field. 
The  optical  spectrum  is  then  obtained  by  taking  the  Fourier  transform  of  autocorrelation  function  of  the  resulting 
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FIG.  10:  The  mean  intracavity  photon  number  n  versus  the  detuning  A3  (in  cycles/sec)  of  the  pump  field  Q3,  where  A3  =  0 
corresponds  to  the  transition  frequency  UJ33.  The  three  curves  are  for  increasing  pump  intensity  (i)  I3  =  0.1,  (ii)  I3  =  1.0,  (Hi) 
I3  =  10.0.  The  arrows  indicate  the  positions  of  the  expected  “vacuum- Rabi”  peaks  at  ±#43,  where  #43/271-  =  16  MHz.  In  all 
cases,  the  recycling  field  H4  is  on  resonance  A4  ==  0  and  has  intensity  I 4  =  3. 
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FIG.  11:  The  optical  spectrum  <h(z/)  as  a  function  of  frequency  offset  v  (in  cycles/sec,  =  2i rz/),  where  v  —  0  corresponds  to 
the  transition  frequency  CJ43-  (a)  Three  spectra  Q{y)  for  increasing  pump  intensity  (i)  I3  =  0.1,  (ii)  I3  =  1.0,  (Hi)  I3  —  10.0, 
with  the  recycling  intensity  I 4  —  3  in  all  cases.  The  overall  normalization  of  $(1/)  is  arbitrary,  but  is  common  for  the  three 
cases,  (b)  4>(z/)  on  a  logarithmic  scale  for  decreased  intensities  I3  =  h  =  0.5,  with  the  peak  value  of  $  scaled  to  unity.  The 
arrows  indicate  the  position  of  the  expected  “vacuum- Rabi”  peaks  at  ±<743,  where  g43/2n  —  16  MHz.  In  all  cases  in  (a)  and 
(b),  the  pumping  field  ^3  and  the  recycling  field  Q4  are  on  resonance  with  their  respective  transitions  (A3  =  0  =  A4). 


heterodyne  current.  Although  we  have  not  carried  out  this  procedure  experimentally,  it  is  straightforward  to  model 
using  a  quantum  jumps  simulation  of  the  four  state  model.  We  have  computed  such  spectra  for  several  values  of 
I3 ,  using  a  local  oscillator  flux  equal  to  k.  This  is  an  experimentally  reasonable  value,  since  it  is  small  enough  so 
as  to  not  saturate  the  detectors,  yet  large  enough  that,  as  our  further  simulations  indicate,  increasing  the  flux  does 
not  significantly  change  the  resulting  spectrum.  The  results  for  the  spectrum  obtained  from  this  quantum  jumps 
simulation  agree  reasonably  well  with  results  from  the  quantum  regression  theorem  presented  in  Fig.  11. 
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V.  QUANTUM  THEORY  INCLUDING  ZEEMAN  STATES  AND  TWO  CAVITY  MODES 

In  an  attempt  to  provide  a  more  detailed  quantitative  treatment  of  our  experiment,  we  have  developed  a  model 
that  includes  all  of  the  Zeeman  states  (F,  mp)  for  the  F  =  3,4  ground  levels  and  the  F'  =  3',  4'  excited  levels  of 
the  6S 1/2  <->•  6P3/2  transition  in  atomic  Cesium,  of  which  there  are  32  in  total.  We  also  include  two  cavity  modes 
with  orthogonal  linear  polarizations  to  describe  the  two  nearly  degenerate  TFM0 0  modes  of  our  cavity  [43],  with 
three  Fock  states  for  each  mode  { |0) ,  1 1) ,  1 2) }.  The  total  dimension  of  the  Hilbert  space  for  this  set  of  atomic  and 
field  states  is  then  d  =  32  x3x3  =  288,  making  it  impractical  to  obtain  steady  state  solutions  from  the  master 
equation  directly.  Instead,  we  employ  the  Quantum  Optics  Toolbox  [36]  to  implement  a  quantum  jumps  simulation, 
with  various  expectation  values  computed  from  the  stochastic  trials. 

In  broad  outline,  our  expanded  model  includes  Hamiltonian  terms  of  the  form  of  Eq.  1,  with  now  the  terms  &ij 
generalized  to  incorporate  each  of  the  various  Zeeman  states.  Likewise,  the  coherent  coupling  of  the  atom  to  the 
cavity  takes  into  account  two  orthogonally  polarized  modes  (a,  b).  The  operators  Li  are  similarly  modified  to  obtain 
a  new  master  equation  that  includes  the  full  set  of  decay  paths  among  the  various  states  (i.e.,  cr±,7r  transitions),  as 
well  as  the  associated  quantum  collapse  terms  in  the  simulation. 

We  attempt  to  describe  the  dynamics  arising  from  the  complex  state  of  spatially  varying  polarization  associated  with 
the  Q3  4  beams  by  way  of  the  following  simple  model.  In  a  coordinate  system  with  the  x,  z  directions  perpendicular  to 
the  cavity  axis  along  y ,  the  ^4  beams  propagate  along  x,  z  with  orthogonal  a±  configurations.  The  helical  patterns 
of  linear  polarization  from  pairs  of  counter-propagating  beams  then  give  rise  to  terms  in  the  interaction  Hamiltonians 
i^2,3  of  the  form 


H2  =  ^n3[(£zg3,e3  +  £ze3,g3)sm(d3x)  (12) 

+  (£^e3  +  £xe3^g3)Sm(83z)] 

+  l^[(%,e3  +  ^ig3)(cO8(03x)  +  COs(03z))] 

and  similarly  for  JI3  to  describe  the  LI4  beams  with  independent  phases  (#4^,  O^z).  Here  Q3  and  Q4  are  Rabi  frequencies 
corresponding  to  the  incoherent  sum  of  the  intensities  of  the  four  individual  beams.  In  Eq.  12,  the  operators  E^lg 
are  linear  combinations  of  various  atomic  projection  operators  for  the  diverse  Zeeman-specific  transitions  for  linear 
polarization  along  x,y,z,  and  are  given  explicitly  by 


\x 

(g3,e  3 

—  ^W,e  3) 

(13) 

‘2/ 

'g  3,e3 

=  y|(^g31,e3  +  ’Egirf) 

(14) 

\X 

'p3,e3 

=  ^3,e3 

(15) 

where 


^3,e3  =  EE  |(/3,  m)  (3,  to;  1,  q\A,  m!)  (g4,  m!\  (16) 

rri  rn' 

The  phases  Oi  arise  from  the  spatial  variations  of  the  polarization  state  of  the  ^3^4  beams,  and  are  given,  for  example, 
by  0sx  =  k%xx  with  k%x  as  the  wave  vector  of  the  pair  of  Q3  beams  propagating  along  x. 

The  U3;4  beams  tend  to  optically  pump  the  atom  into  dark  states,  with  this  pumping  counterbalanced  by  atomic 
motion  leading  to  cooling  [44]  and  by  any  residual  magnetic  field.  In  our  case,  imperfections  in  the  FORT  polarization 
[42,  43]  result  in  a  small  pseudo- magnetic  field  along  the  cavity  axis  y  [45]  with  peak  magnitude  By  ~  0.75  G.  This 
pseudo-field  By  is  included  in  our  simulations  and  tends  to  counteract  optical  pumping  by  the  ^4  beams  into  dark 
states  for  linear  polarization  in  the  x  —  z  plane,  0sx  =  0sz  =  =  O^z  =  7r/2,  but  has  no  effect  for  polarization  along 

the  cavity  axis  y ,  O^x  =  0sz  =  O^x  =  0az  =  0. 

Overall,  the  operation  of  our  driven  atom-cavity  system  involves  an  interplay  of  cycling  through  the  levels  g3  —> 
e3  — >  g4  — >  e4  — >  g3  to  achieve  output  light  on  the  e3  — >  g4  transition,  and  of  polarization  gradient  cooling  for 
extended  trapping  times.  This  latter  process  involves  atomic  motion  through  the  polarization  gradients  of  the  ^3,4 
beams  and  is  greatly  complicated  by  the  presence  of  By  .  The  detunings  and  intensities  of  the  ^3,4  beams  are  chosen 
operationally  such  as  to  optimize  the  output  from  our  one-atom  laser  in  a  regime  of  strong  coupling,  while  at  the 
same  time  maintaining  acceptable  trapping  times,  as  shown  in  Fig.  2  of  Ref.  [1]. 
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A.  Mean  intracavity  photon  number  as  a  function  of  pump  intensity 

In  this  section,  we  present  simulation  results  for  the  mean  intracavity  photon  number  versus  pump  intensity.  In 
qualitative  terms,  we  should  expect  that  the  output  flux  nh  predicted  from  the  full  multi- state  model  is  significantly 
below  that  calculated  from  the  four-state  model  presented  in  Section  IV.  This  is  because  the  atom  necessarily  spends 
increased  time  in  manifolds  of  dark  states  associated  with  the  pumping  by  the  ^4  beams. 

We  can  modify  the  four  level  model  to  account  for  these  effects  by  reducing  the  decay  rate  734  — >  734.  The  slower 
cycling  of  the  atom  due  to  the  reduction  of  734  approximates,  in  a  phenomenological  way,  the  slowing  effect  on  the 
recycling  of  the  atom  due  to  optical  pumping  into  dark  states.  We  find  that  a  value  734  =  0.07  x  734  gives  a  good  fit 
to  the  data  (Fig.  12(a,b)).  We  plot  the  intracavity  photon  number  versus  x  =  (7/9) (13/14),  since  we  estimate  that 
either  measured  intensity  alone  is  uncertain  by  a  factor  of  about  2,  but  the  ratio  is  known  much  more  accurately. 

For  the  multi-level  simulation,  we  use  two  different  models  to  generate  mean  intracavity  photon  number  versus 
pump  intensity  curves.  In  the  first  model,  we  neglect  the  motion  of  the  atom  and  attempt  to  capture  the  essential 
features  of  the  optical  pumping  processes  via  a  single  constant  phase  0  =  6sx  =  #3 *  =  0±x  =  6 ±z.  The  choice  0  =  0  gives 
no  output  light,  since  the  ^4  beams  pump  the  atom  into  dark  states.  The  value  6  =  7r / 2  chosen  for  the  comparison 
in  Fig.  12(c,d)  gives  good  correspondence  between  the  simulations  and  our  measurements  with  the  adjustment  of  no 
other  parameters.  For  this  curve,  we  plot  the  average  (na  +  hf)/2  of  the  intracavity  held  for  the  two  cavity  modes  a 
and  b. 

As  a  second,  more  sophisticated  model,  we  assume  that  the  atom  moves  at  a  constant  velocity  in  the  radial  direction. 
This  gives  time  dependent  phases;  for  example,  if  we  assume  that  the  x  coordinate  of  the  atom  is 

x(t)  =  Xq  +  vxt 


then 


^3xif)  —  k%xX  —  $3a?,0  T 

where  #3^0  =  ksxxo,  uosx  =  ksxvx.  For  a  single  simulation  run  we  randomly  choose  the  velocity  of  the  atom  and  initial 
phases  of  the  f23?4  pumping  beams;  the  intensities  from  20  such  runs  are  averaged  for  each  value  of  x.  The  velocities 
are  chosen  uniformly  in  the  range  10  —  20cm/ 's,  which  gives  angular  frequencies  in  the  range  27rl00  —  200 kHz.  The 
resulting  input /output  curve  is  plotted  in  Fig  12(e,f).  As  before,  we  plot  the  average  of  the  intracavity  held  for  the 
two  cavity  modes. 

We  make  no  claim  for  detailed  quantitative  agreement  between  theory  and  experiment,  as  the  simulations  are 
sensitive  to  the  parameters  which  are  known  only  approximately,  such  as  the  intensity  of  the  H37  pumping  beams 
and  the  magnitude  of  the  pseudo  and  real  magnetic  fields.  Also,  the  simulations  neglect  a  number  of  features  of  the 
real  system,  such  as  atomic  motion  in  the  axial  direction,  the  dependence  of  the  cavity  coupling  g  on  the  position  of 
the  atom,  and  a  possible  intensity  imbalance  in  the  ^37  pumping  beams.  However,  the  simulations  do  support  the 
conclusion  that  the  range  of  coupling  values  g  that  contribute  to  our  results  is  restricted  roughly  to  0.5 go  <  g  <  go. 
Furthermore,  the  simulations  yield  information  about  the  atomic  populations,  from  which  we  deduce  that  the  rate  of 
emission  from  the  cavity  nh  exceeds  that  by  way  of  fluorescent  decay  3'  — >  4,  743' (0-3/3/),  by  roughly  tenfold  over  the 
range  of  pump  intensity  Is  shown  in  Fig.  12a. 


B.  Photon  statistics  as  expressed  by  the  intensity  correlation  function  g^2\r) 

In  addition  to  measurements  of  h  versus  pumping  rate,  we  have  also  investigated  the  photon  statistics  of  the  light 
emitted  by  the  TEMqo  mode  of  the  cavity  by  way  of  the  two  single-photon  detectors  D12  illustrated  in  Fig.  1  of  Ref. 
[1].  From  the  cross-correlation  of  the  resulting  binned  photon  arrival  times  and  the  mean  counting  rates  of  the  signals 
and  the  background,  we  construct  the  normalized  intensity  correlation  function  (see  the  Supplementary  Information 
accompanying  Ref.  [1]) 


(2)  =  (:  mpt  +  T)  :) 

(:  I(t)  :>2  ’ 


(17) 


where  the  colons  denote  normal  and  time  ordering  for  the  intensity  operators  /  [37]. 

Two  measurements  for  g^2\r)  from  Figure  4  of  Ref.  [1]  are  reproduced  in  (a,b)  of  Figs.  13  and  14,  together  with 
results  from  our  quantum  jumps  simulation  from  the  constant  phase  model  with  6  =  7r/2,  in  (c,d).  In  Fig.  13,  we 
again  have  I^  ~  13  and  the  pump  intensity  Is  is  set  for  operation  with  x  ~  0.17  near  the  “knee”  in  n  versus  x, 
while  in  Fig.  14,  the  pump  level  is  increased  to  x  ~  0.83.  These  measurements  demonstrate  that  the  light  from  the 
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FIG.  12:  Comparison  of  theory  and  experiment  for  the  intracavity  photon  number  n  as  a  function  of  pump  intensity  x  = 
(7/9)(/3//4)  for  fixed  /4  =  13  (corresponding  to  a  measured  intensity  of  50mW/cm2  ).  The  measurements  (points  with  error 
bars)  are  from  Figure  3  of  Ref.  [1].  (a,b)  h  versus  pump  strength  x  for  the  four  level  model  with  743  =  0.07  x  y43.  (c,d)  n  versus 
pump  strength  x  for  the  constant  phase  model  with  0  —  7r/2.  (e,f)  n  versus  pump  strength  x  for  the  constant  velocity  model 
described  in  the  text.  The  immediate  onset  of  emission  supports  the  conclusion  of  “thresholdless”  lasing.  Two  independent  sets 
of  measurements  are  shown  as  the  red  and  blue  points,  and  agree  reasonably  well  with  each  other.  Details  of  the  measurements 
can  be  found  in  Ref.  [1],  while  the  parameters  for  the  simulation  are  given  in  the  text. 


atom-cavity  system  is  manifestly  quantum  (i.e.,  nonclassical)  and  exhibits  photon  antibunching  g(2)(0)  <  g^2\r)  and 
sub-Poissonian  photon  statistics  ^^(O)  <  1  [37].  In  agreement  with  the  trend  predicted  by  the  four-state  model  in 
Fig.  7(c)  (as  well  as  by  the  full  quantum  jumps  simulation),  g(2)(0)  increases  with  increasing  pump  intensity,  with 
a  concomitant  decrease  in  these  nonclassical  effects.  The  bottleneck  associated  with  the  recycling  process  leads  to 
this  nonclassical  character,  since  detection  of  a  second  photon  given  the  first  detection  event  requires  that  the  atom 
be  recycled  from  the  F  =  4  ground  state  back  to  the  F  =  3  ground.  In  this  regard,  we  point  to  the  prior  work  on 
pump- noise  suppressed  lasers  in  multi-level  atomic  systems,  as  for  example,  in  Ref.  [47]. 

In  more  quantitative  terms,  theoretical  results  for  g ^  (r)  from  the  full  quantum  jumps  simulation  are  given  in  parts 
(c,d)  of  Figs.  13  and  14  for  x  =  0.17  and  x  =  0.83.  The  excess  fluctuations  g^2\r)  >  1  extending  over  r  ~  ±  1  gs 
appear  to  be  related  to  the  interplay  of  atomic  motion  and  optical  pumping  into  dark  states  [44],  as  well  as  Larmor 
precession  that  arises  from  residual  ellipticity  in  polarization  of  the  intracavity  FORT  [43,  45]. 
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These  results  for  g^2\r)  provide  a  perspective  on  the  issue  of  whether  the  cavity  is  effectively  “empty”  since  n  is 
quite  small.  Based  upon  the  mean  photon  flux  from  the  cavity,  this  is  a  reasonably  inference,  but  it  is  also  misleading. 
The  nonzero  values  for  g^2\r  =  0)  ~  0.3,  0.6  0.01  in  Figs.  13  and  14  are  in  fact  due  to  the  presence  of  more  than 

one  photon  in  the  cavity.  Although  the  mean  intracavity  photon  number  is  only  n  ~  0.005,  this  number  is  comparable 
to  the  saturation  photon  number  no  —  0.013.  Indeed,  the  quantum  statistical  character  of  the  intracavity  field  is 
determined  from  the  self-consistent  interplay  of  atom  and  cavity  field  as  in  standard  laser  theories,  even  though  it 
might  appear  as  this  interplay  is  not  relevant  to  the  determination  of  a  dynamic  steady  state.  Figure  9  attempts  to 
illustrate  this  point  by  investigating  the  passage  from  the  domain  of  conventional  laser  theory  through  the  regime  of 
strong  coupling  and  thence  into  a  domain  of  single  photon  generation  with  g^2\r  =  0)  ~  0  over  the  entire  range  of 
pumping  conditions. 


FIG.  13:  The  intensity  correlation  function  g^2\r)  of  the  one-atom  laser.  (a,b)  g^2\r)  for  x  ~  0.17  as  experimentally  determined 
in  Ref.  [1].  (c,d)  Theoretical  result  for  g^2\r)  for  x  —  0.17  from  a  quantum  jumps  simulation  with  6  =  7t/2.  All  traces  have 
been  “smoothed”  by  convolution  with  a  Gaussian  function  of  width  a  =  5  ns. 


C.  Discussion  of  possible  coherence  effects 

In  Section  IIIA  we  briefly  described  our  analysis  of  an  equivalent  Raman  scheme  to  address  the  question  of  possible 
coherence  effects  associated  with  the  H4  recycling  beam.  Beyond  this  analysis,  we  have  also  considered  the  possibility 
that  various  other  coherent  processes  associated  with  the  pump  fields  might  be  important.  One  concern  relates  to  the 
possibility  that  4- wave  mixing  processes  could  be  important,  as  for  example,  in  a  wave- mixing  process  that  cycles  the 
atom  3  — >  3'  — >  4  ^  4'  ^3  [46].  From  an  operational  perspective,  if  there  were  to  be  a  correlated  process  involved  in 
the  cycling  of  the  atom  3  ^  3'  ^  4  4'  — >3,  then  two  photons  would  be  emitted  into  the  cavity  mode  (the  “signal” 

on  the  3'  — >  4  transition  and  the  “idler”  on  the  4'  — >  3  transition).  In  this  case  since  we  employ  no  filter  to  block 
the  “idler”  field  separated  by  9.2GHz,  the  measured  intensity  correlation  function  g^2\r)  for  the  emitted  light  from 
the  cavity  would  exhibit  bunching  around  r  =  0,  instead  of  the  observed  antibunching  and  sub-Poissonian  character. 
The  measured  character  of  g^2\r)  therefore  argues  against  a  coherent  process  that  cycles  the  atom  from  an  initial 
quantum  state  and  back  to  that  state  by  way  of  coherent  processes  involving  coupling  to  the  cavity  field. 
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FIG.  14:  The  intensity  correlation  function  g^2\r)  of  the  one-atom  laser.  (a,b)  fl>2>  (t)  for  x  ~  0.83  as  experimentally  determined 
in  Ref.  [1].  (c,d)  Theoretical  result  for  g{2>  (r)  for  x  =  0.83  from  a  quantum  jumps  simulation  with  9  =  7r/2.  All  traces  have 
been  “smoothed”  by  convolution  with  a  Gaussian  function  of  width  a  =  5  ns. 


We  also  note  that  the  coherent  coupling  of  the  cavity  field  and  atom  for  the  4'  — »  3  transition  is  greatly  suppressed 
due  to  the  large  detuning  ~  9. 2 GHz,  leading  to  an  effective  coupling  coefficient  (geff/2ir  ~  30  kHz)  <C  (7||/27r  ~  5.2 
MHz).  Therefore,  for  whatever  mixing  processes,  the  coupling  to  the  external  vacuum  modes  characterized  by  the 
rate  yy  should  dominate  that  due  to  geff-  In  this  regard,  note  that  we  have  included  the  effect  of  off-resonant  coupling 
of  the  4'  excited  state  in  our  simulations  (which  is  only  200  MHz  detuned).  The  relevant  process  is  then  excitation 
4^4'  via  the  pump  field,  followed  by  emission  into  the  cavity  mode  due  to  the  coherent  coupling  of  the  transition 
4'  — >  4.  This  coupling  increases  the  intracavity  photon  number  by  only  about  10%,  suggesting  that  coupling  for  the 
4'  — >  3  transition  9.2  GHz  away  is  negligible. 

In  support  of  these  comments,  our  detailed  numerical  simulations  agree  sensibly  well  with  the  observed  behavior  of 
g(2\r)  (as  in  Figures  13  and  14),  and  do  not  include  any  “wave- mixing”  effects.  This  statement  is  likewise  valid  for 
the  dependence  of  photon  number  versus  pump  level  Furthermore,  as  previously  discussed,  the  model  calculation 
for  a  four-state  system  agrees  well  in  its  essential  characteristics  with  a  three-state  system  where  the  decay  of  the 
ground  state  4  — >  3  is  via  an  ad  hoc  spontaneous  process  (as  in  a  Raman  laser)  rather  than  by  pumping  4^4'  and 
decay  4'  — »  3. 

A  final  general  comment  relates  to  the  nature  of  phase- matching  (e.g.,  as  applied  to  4- wave  mixing  and  parametric 
down  conversion)  for  a  single  atom  in  a  cavity.  For  a  sample  of  atoms  (or  a  crystal),  there  is  a  geometry  that  defines 
directions  for  which  fields  from  successive  atoms  might  add  constructively  for  various  waves  (e.g.,  pump,  signal, 
idler).  Cavities  can  then  be  placed  around  these  directions  to  enhance  the  processes  (e.g.,  the  threshold  for  an  optical 
parametric  oscillator  is  reduced  by  a  factor  of  the  square  of  the  cavity  finesse  for  resonant  enhancement  of  both  signal 
and  idler  fields).  Clearly  a  cavity  would  be  ineffective  if  its  geometry  did  not  match  the  preferred  geometry  defined  by 
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the  sample  and  pump  beams.  However,  for  a  single  atom  as  in  our  experiment,  these  considerations  do  not  apply  in 
nearly  the  same  fashion.  The  relevant  issues  are  the  coherent  coupling  coefficients  gij  of  the  various  atomic  transitions 
to  the  cavity  field. 


VI.  SUMMARY 

We  have  presented  a  simplified  four- level  model  which  describes  the  qualitative  features  of  our  experiment.  We 
have  shown  how  decreasing  the  cavity  length  causes  the  model  system  to  move  from  a  regime  of  weak  coupling,  where 
the  semiclassical  laser  theory  applies,  into  a  regime  of  strong  coupling,  where  quantum  deviations  become  important. 
The  four-state  model  predicts  many  of  the  observed  features  of  our  experimental  system,  including  the  qualitative 
shape  of  the  intracavity  photon  number  versus  pumping  intensity  curve,  and  photon  antibunching. 

In  addition,  to  predict  quantitative  values  for  comparison  with  our  experimental  results,  we  have  developed  a 
full  multi-level  model  which  correctly  describes  optical  pumping  and  Larmor  precession  effects  within  the  Zeeman 
substructure.  We  have  shown  that  these  effects  play  an  important  role  in  describing  the  observed  input/  output 
characteristics  of  the  system,  and  that  by  including  a  simple  model  for  the  motion  of  the  atom  we  can  obtain 
reasonable  agreement  with  the  experimentally  observed  curve.  We  have  also  used  the  simulation  to  calculate  intensity 
correlation  functions,  and  have  compared  these  results  to  measurements  of  g^2\r)  from  our  experiment. 
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We  prove  the  security  of  the  Bennett-Brassard  (BB84)  quantum  key  distribution  protocol 
in  the  case  where  the  source  and  detector  are  under  the  limited  control  of  an  adversary. 

Our  proof  applies  when  both  the  source  and  the  detector  have  small  basis-dependent 
flaws,  as  is  typical  in  practical  implementations  of  the  protocol.  We  derive  a  general  lower 
bound  on  the  asymptotic  key  generation  rate  for  weakly  basis-dependent  eavesdropping 
attacks,  and  also  estimate  the  rate  in  some  special  cases:  sources  that  emit  weak  coherent 
states  with  random  phases,  detectors  with  basis-dependent  efficiency,  and  misaligned 
sources  and  detectors. 
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1  Introduction 

The  security  of  quantum  cryptography  is  founded  on  principles  of  fundamental  physics,  rather 
than  assumptions  about  the  resources  available  to  a  potential  adversary.  In  the  BB84  quantum 
key  distribution  protocol  [1],  two  parties  (Alice  and  Bob)  establish  a  secret  key  about  which 
the  eavesdropper  (Eve)  cannot  obtain  a  significant  amount  of  information.  Alice  sends  a  key 
bit  to  Bob  by  preparing  a  qubit  in  one  of  two  conjugate  bases  and  Bob  measures  the  qubit  in 
one  of  the  two  bases;  Eve,  who  does  not  know  the  basis  chosen  by  Alice  or  by  Bob,  cannot 
collect  information  about  the  key  without  producing  a  detectable  disturbance.  This  protocol, 
when  suitably  augmented  by  classical  error  correction  and  privacy  amplification,  is  provably 
secure  against  any  attack  by  Eve  allowed  by  quantum  mechanics  [2,  3,  4,  5,  6]. 

Though  security  can  be  proven  without  imposing  any  restriction  on  Eve’s  attack  (other 
than  the  requirement  that  she  has  no  a  priori  information  about  the  basis  used),  it  is  necessary 
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to  place  conditions  on  the  performance  of  the  source  and  detector  employed  in  the  protocol. 
In  the  Shor-Preskill  proof  [5],  it  is  assumed  that  any  flaws  in  the  source  and  detector  can 
be  absorbed  into  Eve’s  basis-independent  attack.  In  the  proof  by  Mayers  [2],  the  source  is 
assumed  to  be  perfect,  but  the  detector  is  completely  uncharacterized.  In  the  Koashi-Preskill 
proof  [6],  the  detector  is  perfect,  but  the  source  is  uncharacterized,  aside  from  the  proviso 
that  it  leaks  no  information  about  the  basis  choice  to  Eve.  In  all  of  these  cases,  serious  faults 
in  the  apparatus  can  be  detected  in  the  protocol,  so  that  Alice  and  Bob  will  reject  the  key  if 
the  equipment  performs  badly. 

But  none  of  these  proofs  apply  when  both  the  source  and  detector  have  small  imperfec¬ 
tions  that  depend  on  the  basis  used  in  the  protocol,  the  case  relevant  to  typical  real-world 
implementations  of  quantum  key  distribution.  Since  the  BB84  protocol  with  perfect  sources 
and  detectors  is  secure,  it  is  intuitively  clear  that  BB84  should  remain  secure  if  the  imperfec¬ 
tions  are  “sufficiently  small.”  We  will  sharpen  this  intuition  into  a  quantitative  statement,  by 
calculating  how  the  rate  of  generation  of  private  key  depends  on  the  tolerance  to  which  the 
equipment  is  characterized. 

The  simplest  way  to  analyze  the  consequences  of  characterized  imperfections  is  to  absorb 
the  defective  performance  of  the  equipment  into  the  eavesdropper’s  attack.  Primarily  for  this 
reason,  we  are  led  to  consider  the  security  of  the  BB84  protocol  in  a  different  framework  than 
in  previous  security  proofs:  the  flaws  in  the  source  and  detector  may  depend  on  the  bases 
chosen,  and  furthermore  Eve  may  know  these  bases,  but  her  power  to  exploit  this  knowledge 
is  limited.  We  will  prove  security  under  an  assumption  that  limits  the  basis  dependence  of 
Eve’s  attack. 

It  is  natural  to  ask  whether  this  assumption  can  be  verified  by  conducting  suitable  tests 
on  the  source  and  detector  (perhaps  with  testing  equipment  that  is  also  not  fully  trustworthy, 
as  in  [7]).  For  now  we  put  aside  the  issue  of  testing  the  equipment,  and  we  will  trust  that  our 
equipment  performs  approximately  as  expected.  However,  as  cautious  crypt ologists  we  will 
assume  that,  within  the  prescribed  limits,  the  performance  of  the  equipment  is  controlled  by 
Eve  and  maximally  exploited  by  her  to  gain  information  about  the  shared  key. 

Our  analysis  follows  the  method  of  Shor  and  Preskill  [5] ,  who  proved  the  security  of  BB84 
by  relating  it  to  an  entanglement  distillation  protocol.  Their  argument  exploited  a  symmetry 
between  the  two  bases  used  in  the  protocol,  whose  consequence  is  that  the  entangled  pairs 
to  be  distilled  have  equal  rates  of  bit  errors  and  phase  errors.  Our  task  in  this  paper  is  to 
extend  the  analysis  to  the  case  where  the  symmetry  between  the  bases  is  broken  because  the 
adversary  has  information  about  the  basis  used.  We  will  give  a  general  argument  showing 
that  if  the  basis-dependence  of  the  attack  is  sufficiently  weak,  then  the  gap  between  the  bit 
error  rate  and  the  phase  error  rate  is  small;  this  argument  allows  us  to  establish  security 
against  arbitrary  attacks  that  satisfy  a  particular  criterion  for  weak  basis  dependence,  and  to 
derive  a  lower  bound  on  the  asymptotic  key  generation  rate. 

To  formulate  our  criterion  for  the  attack  to  be  weakly  basis  dependent,  we  focus  on  the 
coin  that  is  flipped  to  determine  the  basis  —  the  basis  dependence  is  weak  if  the  adversary 
interacts  only  weakly  with  the  coin.  The  Shor-Preskill  argument  shows  that,  for  the  purpose 
of  analyzing  the  security  of  BB84,  it  is  convenient  to  imagine  that  Alice  chooses  each  of  her 
key  bits  by  measuring  half  of  an  entangled  state,  and  that  she  delays  these  measurements  until 
after  Eve  has  attacked  the  signals.  Likewise,  for  analyzing  weakly  basis-dependent  attacks, 
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we  find  it  convenient  to  imagine  that  the  coin  flip  that  determines  the  basis  is  realized  by 
measuring  a  qubit,  and  that  this  measurement  is  delayed  until  after  the  adversary’s  attack. 
Then  we  can  quantify  the  extent  of  the  adversary’s  interactions  with  the  coin  according  to  how 
much  the  state  of  the  coin  is  disturbed.  Our  general  argument  shows  that  if  the  disturbance 
of  the  coin  is  slight,  then  a  secure  key  can  be  generated  at  a  calculable  nonzero  rate. 

Aside  from  presenting  this  general  argument,  we  will  also  apply  our  methods  to  a  few 
specific  scenarios  in  which  quantum  key  distribution  is  executed  with  imperfect  devices.  In 
some  of  these  special  cases,  we  can  derive  tighter  lower  bounds  on  the  key  generation  rate 
than  are  obtained  by  the  general  argument.  The  examples  we  discuss  include: 

Tagging.  A  faulty  source  may  “tag”  some  of  the  qubits  with  information,  readable  by  the 
eavesdropper,  that  reveals  the  basis  used  in  the  preparation.  An  important  special  case, 
also  recently  analyzed  by  Inamori,  Lutkenhaus  and  Mayers  [8],  is  a  source  emitting  weak 
coherent  states  which  with  nonnegligible  probability  contain  multiple  photons  prepared 
in  the  same  polarization  state.  An  adversary  might  intercept  the  extra  photons  and 
collect  information  about  the  basis  used  without  causing  any  disturbance,  compromising 
security. 

Basis- dependent  detector  efficiency.  If  the  detector  sometimes  misfires,  the  probability  that 
a  qubit  is  successfully  detected  might  depend  on  the  basis  used.  An  adversary  that  can 
control  whether  the  detector  fires  can  use  this  power  to  disguise  eavesdropping. 

Basis -dependent  misalignment  in  the  source  or  detector.  The  source  or  detector  might  not 
be  properly  aligned  to  emit  or  detect  a  qubit  in  the  desired  basis.  The  adversary  can 
exploit  her  freedom  to  rotate  these  devices  to  reduce  the  disturbance  caused  by  her 
eavesdropping. 

Our  results  do  not  subsume,  nor  are  they  subsumed  by,  the  results  of  [2,  6,  8].  Mayers  and 
Koashi-Preskill  assume  that  the  detector  or  source  is  uncharacterized,  but  that  the  adversary 
is  unable  to  influence  the  behavior  of  the  devices  to  suit  her  purposes.  We  assume  that  the 
flaws  in  the  devices  are  limited,  but  that  the  adversary  controls  the  apparatus  within  these 
limits;  furthermore,  our  security  proof  (unlike  the  Koashi-Preskill  proof)  applies  to  a  source 
that  leaks  a  small  amount  of  information  about  the  choice  of  basis.  And  more  important, 
while  Mayers  assumes  that  the  source  is  perfect,  and  Koashi  and  Preskill  assume  that  the 
detector  is  perfect,  our  new  techniques  apply  when  both  the  source  and  the  detector  have 
small  basis-dependent  imperfections,  the  generic  case  in  practical  settings.  In  addition,  while 
Koashi  and  Preskill  assume  that  the  signals  emitted  by  Alice’s  source  are  uncorrelated  with 
one  another  (the  state  describing  the  emission  of  n  signals  is  a  tensor  product  of  n  individual 
signals),  and  while  Mayers  likewise  assumes  that  the  signals  are  detected  individually  rather 
than  collectively  by  Bob,  our  results  are  not  inherently  subject  to  such  limitations.  (However, 
we  do  assume  that  the  signals  are  emitted  and  detected  individually  in  many  of  the  examples 
that  we  analyze.) 

Aspects  of  the  security  of  quantum  key  distribution  in  realistic  settings  have  been  analyzed 
previously  [9,  10,  11,  12,  13,  14].  However,  our  proof  of  security  holds  for  arbitrary  collective 
attacks  by  the  eavesdropper,  while  individual  attacks  were  considered  in  most  previous  work. 
(An  important  exception  is  the  recent  study  by  Inamori,  Lutkenhaus  and  Mayers  [8]  of  sources 
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that  emit  weak  coherent  states.)  Although  our  results  do  not  yet  constitute  a  definitive 
analysis  of  the  security  of  realistic  quantum  cryptography,  we  expect  that  the  tools  we  have 
developed  will  prove  useful  in  further  studies. 

Beyond  any  of  our  particular  results,  we  have  broadened  notably  the  domain  of  applicabil¬ 
ity  of  the  Shor-Preskill  method  for  proving  security.  This  method  has  many  further  applica¬ 
tions,  and  in  particular  allows  one  to  easily  analyze  the  effectiveness  of  various  enhancements 
of  the  protocol  such  as  two-way  postprocessing  [15]. 

Our  findings  are  of  both  practical  and  conceptual  interest.  It  is  important  to  address 
whether  practical  implementations  of  quantum  key  distribution  are  truly  secure,  and  in  real- 
world  implementations  the  apparatus  is  never  flawless.  And  apart  from  practical  concerns, 
quantum  key  distribution  provides  a  fascinating  theoretical  laboratory  for  quantitatively  ex¬ 
ploring  the  unavoidable  tradeoff  between  collecting  information  about  a  quantum  system  and 
disturbing  the  system. 

We  note  that  the  security  against  arbitrary  eavesdropping  attacks  of  quantum  key  distri¬ 
bution  performed  with  imperfect  devices  has  also  been  analyzed  by  Ben-Or  [16]. 

The  rest  of  this  paper  is  organized  as  follows.  In  Sec.  2,  we  clarify  the  setting  of  our  analysis 
by  introducing  Eve’s  collaborator  Fred,  who  controls  the  flaws  in  the  source  and  detector. 
We  review  the  connection  between  the  BB84  protocol  and  entanglement-based  quantum  key 
distribution  in  Sec.  3,  and  reprise  the  Shor-Preskill  argument  which  is  the  foundation  for 
all  that  follows.  We  carefully  formulate  our  models  for  sources  and  detectors  in  Sec.  4,  and 
point  out  in  Sec.  5  some  ways  in  which  these  models  fail  to  capture  fully  the  properties  of 
real  devices.  In  Sec.  6  we  introduce  the  concept  of  a  quantum  coin,  which  is  a  useful  tool 
for  analyzing  the  power  of  Fred’s  basis-dependent  attack  on  the  equipment,  and  in  Sec.  7  we 
present  our  security  proof  for  a  general  class  of  attacks  that  depend  sufficiently  weakly  on 
the  basis.  We  then  proceed  to  explore  various  applications  of  this  result:  In  Sec.  8,  we  prove 
security  for  the  case  where  the  detector  is  perfect  but  the  source  has  small  generic  flaws,  and 
in  Sec.  9,  we  treat  the  case  where  the  detector  has  small  flaws  and  the  source  is  flawed  but 
oblivious;  that  is,  it  leaks  no  information  about  the  basis  used.  Sec.  10  analyzes  the  case  where 
the  source  and  detector  are  both  slightly  misaligned,  and  in  Sec.  11  we  state  without  proof  a 
result  for  the  case  where  the  both  the  source  and  detector  have  small  generic  flaws  (where  the 
source  is  not  necessarily  oblivious).  The  case  in  which  a  fraction  of  the  signals  emitted  by  the 
source  are  tagged  with  basis  information  is  dealt  with  in  Sec.  12;  this  analysis  is  relevant  to 
sources  that  emit  weak  coherent  states  with  random  phases,  sources  that  are  close  to  single 
photon  sources,  and  a  scenario  where  some  of  the  basis  and  key  bits  are  selected  by  flipping 
biased  coins.  Finally,  in  Sec.  13  we  discuss  the  case  of  a  detector  with  imperfect  efficiency 
that  is  controlled  by  the  adversary,  and  Sec.  14  contains  some  concluding  comments. 

2  Alice  and  Bob  and  Eve  and  Fred 

To  clarify  our  assumptions  about  the  source  and  detector  imperfections,  it  is  helpful  to  imagine 
that  two  collaborating  adversaries  are  trying  to  foil  the  key  distribution  protocol:  Eve  and 
Fred.  The  goal  of  Alice  and  Bob  is  to  generate  a  shared  key  not  known  to  the  Eve/Fred 
alliance. 

Fred  knows  the  basis  chosen  by  Alice  and/or  Bob,  and  he  can  tamper  with  the  source 
and/or  detector,  but  only  within  certain  prescribed  limits.  Because  the  basis  dependence  of 


D.  Gottesman,  H.-K.  Lo,  N.  Lutkenhaus,  and  J.  Preskill  329 


his  attack  is  limited,  Fred  can  acquire  only  limited  knowledge  of  what  signal  was  emitted  by 
the  source  and  what  outcome  was  recorded  by  the  detector. 

Eve  on  the  other  hand  has  no  a  priori  knowledge  of  the  basis  chosen  by  Alice  or  by  Bob, 
and  she  has  no  direct  control  over  the  source  or  the  detector.  But  Eve  is  permitted  to  attack  all 
of  the  signals  sent  by  Alice  to  Bob  collectively  in  any  manner  allowed  by  quantum  mechanics. 
For  example,  Eve  may  entangle  an  ancilla  that  she  controls  with  each  signal  after  the  signal 
is  emitted  by  the  source  and  before  it  is  absorbed  by  the  detector.  Then  Eve  may  delay  the 
measurement  of  her  ancilla  until  after  all  public  discussion  by  Alice  and  Bob  is  concluded, 
choosing  her  measurement  to  optimize  her  information  about  the  key. 

While  Eve  can  send  to  Fred  any  quantum  or  classical  message  of  her  choice,  communication 
from  Fred  to  Eve  is  restricted.  Before  Eve  interacts  with  the  signals,  Fred  may  wish  to  notify 
her  about  Alice’s  basis  choice,  but  his  only  means  of  conveying  this  information  is  through 
his  limited  ability  to  control  the  source.  After  Bob  confirms  receipt  of  the  signals,  Fred  is 
permitted  to  share  further  information  with  Eve  by  sending  it  via  a  classical  or  quantum  side 
channel.  Apart  from  this  restriction  on  their  communication,  Eve  and  Fred  are  free  to  choose 
a  common  strategy  that  optimally  exploits  Fred’s  limited  power  to  manipulate  the  source  and 
detector. 

Various  security  proofs  apply  to  settings  that  can  be  distinguished  by  describing  Fred’s 
role.  In  the  setting  considered  by  Mayers  and  by  Koashi  and  Preskill,  Fred  does  not  share 
information  with  Eve,  and  the  goal  of  Alice  and  Bob  is  to  generate  a  shared  key  that  Eve 
does  not  know.  Mayers  assumes  that  the  source  is  perfect,  but  Fred  is  free  to  choose  the 
measurement  performed  by  the  detector,  which  can  depend  at  Fred’s  discretion  on  Bob’s 
declared  basis,  and  to  report  to  Bob  a  portion  of  the  information  collected  in  the  measurement. 
Koashi  and  Preskill  assume  that  the  detector  is  perfect,  but  Fred  is  free  to  choose  the  states 
emitted  by  the  source  except  for  one  proviso:  the  emitted  state,  averaged  over  Alice’s  key  bit, 
is  independent  of  Alice’s  basis.  In  the  setting  considered  in  this  paper,  Eve  again  applies  an 
arbitrary  basis-independent  quantum  operation  to  her  probe  and  the  transmitted  signals.  And 
again,  Fred,  who  has  information  about  the  declared  bases,  can  influence  how  the  equipment 
operates.  But  now,  the  basis  dependence  of  Fred’s  attack  is  limited,  and  Fred  and  Eve  can 
pool  their  knowledge  after  the  signals  are  detected. 

All  of  these  settings  are  interesting.  In  the  Mayers  model,  the  detector  can  be  arbitrarily 
flaky,  and  in  the  Koashi-Preskill  model,  the  source  leaks  no  basis  information  but  is  otherwise 
arbitrary.  In  the  model  we  consider,  both  the  source  and  detector  are  “pretty  good”  but 
controlled  (within  limits)  by  the  adversary.  Our  models  of  the  source  and  the  detector  will 
be  described  in  more  detail  in  Sec.  4. 

3  Distillation  and  privacy 

Our  analysis  follows  the  method  of  proof  used  by  Shor  and  Preskill  [5],  which  we  will  now 
briefly  review.  In  this  method,  security  is  first  established,  following  [3],  for  a  protocol  based 
on  an  entanglement  distillation  protocol  (EDP).  Then  the  security  of  a  “prepare- and- measure” 
protocol,  namely  BB84,  is  established  through  a  reduction  to  the  EDP  protocol. 

We  remark  that  entanglement  distillation  was  first  discussed  in  [17],  that  its  relevance  to 
the  security  of  quantum  key  distribution  was  emphasized  in  [18],  and  that  this  connection 
was  established  rigorously  in  [3].  Entanglement  distillation  protocols  have  also  been  called 
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“entanglement  purification  protocols,”  abbreviated  EPP.  We  prefer  to  say  “distillation”  rather 
than  “purification”  as  “purification”  now  has  another  widely  accepted  meaning  in  quantum 
information  theory. 

In  the  EDP  protocol,  Alice  creates  n-\-  m  pairs  of  qubits,  each  in  the  state 

|0+)  =  -^(|OO)  +  |ll))  ,  (1) 

the  simultaneous  eigenstate  with  eigenvalue  one  of  the  two  commuting  operators  X  0  X  and 
Z  0  Z,  where 

x=(i  <0  •  z=(J  -i)  <2) 

are  the  Pauli  operators.  Then  she  sends  half  of  each  pair  to  Bob.  Alice  and  Bob  sacrifice  m 
randomly  selected  pairs  to  test  the  “error  rates”  in  the  X  and  Z  bases  by  measuring  X  0  X 
and  Z  0  Z.  If  the  error  rate  is  too  high,  they  abort  the  protocol.  Otherwise,  they  conduct 
the  EDP,  extracting  ft  high-fidelity  pairs  from  the  n  noisy  pairs.  Finally,  Alice  and  Bob  both 
measure  Z  on  each  of  these  pairs,  producing  a  ft- bit  shared  random  key  about  which  Eve  has 
negligible  information.  The  protocol  is  secure  because  the  EDP  removes  Eve’s  entanglement 
with  the  pairs,  leaving  her  powerless  to  discern  the  outcome  of  the  measurements  by  Alice 
and  Bob. 

If  the  EDP  protocol  has  special  properties,  then  proving  the  security  of  BB84  can  be 
reduced  to  proving  security  of  the  EDP.  Shor  and  Preskill  considered  EDP’s  with  one-way 
communication  [19],  which  are  equivalent  to  quantum  error-correcting  codes,  and  furthermore, 
considered  the  specific  class  of  codes  known  as  Calderbank-Shor-Steane  (CSS)  codes  [20,  21]. 
(Gottesman  and  Lo  [15]  have  described  how  a  similar  reduction  can  be  applied  to  certain 
EDP’s  with  two-way  communication.)  Like  any  quantum  error-correcting  code,  a  CSS  code 
can  correct  both  bit  errors  (pairs  with  Z§§Z  =  —1)  and  phase  errors  (pairs  with  X®X  =  —1). 
But  the  crucial  property  of  a  CSS  code  is  that  the  bit  and  phase  error  correction  procedures 
can  be  decoupled  —  Z  errors  can  be  corrected  without  knowing  anything  about  the  X  errors 
and  vice-versa. 

In  the  EDP  protocol,  the  key  is  affected  by  the  bit  error  correction  but  not  by  the  phase 
error  correction.  The  phase  error  correction  is  important  to  expunge  entanglement  with  Eve 
and  so  ensure  the  privacy  of  the  key.  But  Eve’s  information  about  the  final  key  is  unaffected 
if  Alice  and  Bob  dispense  with  the  phase  error  correction.  What  is  essential  is  not  that  the 
phase  error  correction  is  actually  done,  but  rather  that  it  would  have  been  successful  if  it  had 
been  done. 

With  the  phase  error  correction  removed,  the  extraction  of  the  final  key  from  the  n  noisy 
pairs  is  much  simplified.  Rather  than  first  carrying  out  the  EDP  and  then  measuring  Z  for 
each  of  the  ft  distilled  pairs,  Alice  and  Bob  can  instead  measure  Z  for  each  of  the  n  noisy 
pairs,  and  then  do  classical  postprocessing  of  their  measurement  results  to  extract  the  final 
key.  In  this  form,  the  entanglement-based  protocol  becomes  equivalent  to  BB84. 

We  can  see  the  equivalence  more  clearly  by  adding  one  further  wrinkle  to  the  entanglement- 
based  protocol.  In  the  BB84  protocol,  Alice  and  Bob  choose  their  bases  at  random,  so  that 
about  half  of  the  sifted  key  bits  are  transmitted  in  the  X  basis  and  about  half  in  the  Z 
basis.  But  in  the  entanglement-based  protocol  as  we  have  described  it,  all  of  the  final  key  bits 
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are  generated  by  measuring  in  the  Z  basis.  To  relate  the  two  protocols,  suppose  that  in  the 
entanglement-based  protocol  a  subset  of  the  pairs  is  selected  at  random,  and  that  for  each  pair 
in  this  subset,  Alice  and  Bob  apply  the  Hadamard  transformation  H  :  X  Z  to  their  qubits 
before  measuring  Z.  Equivalently,  we  can  instruct  Alice  and  Bob  to  measure  X  rather  than 
Z  for  these  selected  qubits.  Each  measurement  by  Alice  in  the  entanglement-based  protocol 
prepares  a  qubit  to  be  sent  to  Bob  in  one  of  the  four  BB84  states:  X  =  ±1,  Z  =  ±1,  chosen 
at  random.  In  BB84,  Bob  measures  either  X  or  Z,  and  through  public  discussion  Alice  and 
Bob  reject  the  key  bits  where  they  used  different  bases;  the  remaining  key,  for  which  their 
bases  agree,  is  called  the  “sifted  key.”  As  far  as  an  eavesdropper  is  concerned,  there  is  no 
difference  between  generating  a  bit  of  sifted  key  in  BB84,  where  a  qubit  is  prepared  by  Alice 
in  a  randomly  chosen  eigenstate  of  either  X  or  Z  and  measured  by  Bob  in  the  same  basis, 
and  generating  a  bit  of  key  in  the  entanglement-based  protocol,  where  Alice  and  Bob  both 
measure  their  halves  of  an  entangled  pair  of  qubits. 

A  vestige  of  the  CSS  code  of  the  EDP  survives  as  a  scheme  for  error  correction  and 
privacy  amplification  in  this  prepare- and-measure  protocol.  In  a  CSS  code,  classical  linear 
codes  Ci  and  C 2  are  used  for  bit  and  phase  error  correction  respectively,  where  C2  C  G\.  The 
entanglement-based  protocol  is  secure  (whether  or  not  the  phase  error  correction  is  done)  if, 
with  “high  probability”  (probability  of  success  exponentially  close  to  unity),  C\  can  correct 
the  bit  errors  and  can  correct  the  phase  errors.  In  the  BB84  protocol,  C\  is  used  to 
correct  bit  errors  in  the  key,  and  C2  to  amplify  privacy.  Specifically,  Alice  transmits  the 
random  string  w  through  the  quantum  channel,  randomly  selects  a  codeword  u  of  Ci,  and 
announces  u  +  w.  Bob  receives  the  corrupted  string  w- l-e,  computes  u  +  e,  and  corrects  to  u. 
The  final  key  is  the  coset  u  +  C2  of  C2  in  C\ . 

If  this  method  is  used  to  compute  the  final  key  in  the  BB84  protocol,  and  if  the  key  being 
distributed  is  very  long,  at  what  asymptotic  rate  can  secure  final  key  be  extracted  from  the 
sifted  key?  The  answer  is  the  rate  k/n  at  which  high-fidelity  pairs  can  be  distilled  from  noisy 
pairs  in  the  EDP,  which  depends  on  how  noisy  the  pairs  are.  The  purpose  of  the  verification 
test  included  in  the  protocol  is  to  obtain  a  reliable  estimate  of  the  noise.  In  the  EDP,  a  useful 
way  to  characterize  the  noise  is  to  imagine  that,  after  the  final  Hadamard  transformations 
are  applied  to  the  pairs,  all  n  pairs  are  measured  in  the  Bell  basis  —  that  is,  both  Z  0  Z  and 
are  measured.  If  there  were  no  noise  at  all,  we  would  find  Z®Z  —  X®X  —  1  for 
every  pair.  Denote  by  n5  the  number  of  pairs  for  which  we  have  Z  (g>  Z  =  —  1  instead;  we  say 
that  6  is  the  bit  error  rate  of  the  noisy  pairs.  Denote  by  nSp  the  number  of  pairs  for  which 
we  have  X  0  X  =  —  1;  we  say  that  Sp  is  the  phase  error  rate  of  the  pairs. 

For  a  given  state  of  the  n  pairs,  the  rates  5  and  Sp  are  actually  random  variables,  be¬ 
cause  the  quantum  measurement  of  the  pairs  is  nondeterministic.  But  suppose  that  from 
the  verification  test,  we  can  infer  that  for  sufficiently  large  n  and  any  e  >  0,  the  inequalities 
5  <  6  +  £  and  Sp  <  Sp  +  e  are  satisfied  with  high  probability.  Furthermore,  we  may  imagine 
that  the  key  bits  are  subjected  in  the  protocol  to  a  publicly  announced  random  permutation 
(or  equivalently  that  the  CSS  code  is  correspondingly  randomized),  so  that  the  bit  and  phase 
errors  are  randomly  distributed  among  the  qubits.  It  can  then  be  shown  [22,  23]  that,  for 
sufficiently  large  n  and  any  e'  >  0,  there  exists  a  CSS  code  such  that  the  EDP  distills  k 
high-fidelity  pairs  from  the  n  noisy  pairs,  where 


k/n  >  1  —  H2(S  +  €  +  s')  —  H2(Sp  -f-  s  -f-  s') 


(3) 
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and  H2(S)  =  —  8  log2  8  —  (1  —  £)  log2(l  —  S)  is  the  binary  entropy  function.  Therefore,  in  BB84, 
we  establish  an  asymptotically  achievable  rate  of  extraction  of  secure  final  key  from  sifted  key 
(“key  generation  rate”): 

R=1-H2(S)-H2(8P)  ,  (4) 

That  is,  in  the  BB84  protocol,  a  fraction  H2(S)  of  the  sifted  key  bits  are  sacrificed  asymptot¬ 
ically  to  perform  error  correction  and  a  fraction  H2(SP)  of  the  sifted  key  bits  are  sacrificed  to 
perform  privacy  amplification. 

We  note  that,  although  the  permutation  randomizes  the  positions  of  both  the  bit  errors 
and  the  phase  errors,  correlations  between  bit  errors  and  phase  errors  may  remain.  However, 
these  correlations  do  not  affect  the  achievable  rate,  because  with  CSS  codes  the  bit  error 
correction  and  phase  error  correction  are  performed  separately.  We  also  remark  that  the  code 
Ci  used  to  correct  bit  errors  can  be  chosen  to  be  efficiently  decodable  [24].  It  may  not  be 
possible  to  simultaneously  choose  the  code  C2  to  be  efficiently  decodable,  but  this  is  not 
important,  since  the  phase  error  correction  using  C2  is  not  actually  carried  out  in  the  BB84 
protocol  —  it  need  only  be  possible  in  principle. 

Our  arguments  so  far  have  reduced  the  problem  of  demonstrating  the  security  of  BB84  to 
inferring  sufficiently  stringent  upper  bounds  on  both  the  bit  error  rate  and  the  phase  error 
rate  of  the  pairs  used  to  generate  the  key  in  the  corresponding  entanglement-based  protocol, 
based  on  the  results  of  the  verification  test.  Inferring  the  upper  bound  on  the  bit  error  rate 
is  straightforward.  Let  us  consider  the  version  of  the  entanglement-based  protocol  in  which 
Alice  and  Bob  measure  both  the  test  pairs  and  the  key  generating  pairs  in  the  Z  basis,  but  a 
Hadamard  transformation  is  applied  to  randomly  selected  pairs  just  prior  to  the  measurement. 
When  Eve  interacts  with  the  qubits  traveling  from  Alice  to  Bob,  she  has  no  a  priori  knowledge 
concerning  which  pairs  will  be  used  for  the  test  and  which  will  be  used  for  key  generation. 
Therefore,  the  test  pairs  are  a  fair  sample;  it  follows  from  classical  sampling  theory  that  the 
joint  probability  of  observing  nS  errors  in  the  test  set  and  more  than  n(S  +  s)  errors  in  the 
key  set  is  exponentially  small  for  any  e  >  0  and  n  sufficiently  large.  Note  that  this  argument 
works  even  if  Eve’s  attack  induces  strong  correlations  among  the  pairs;  all  that  is  required  is 
that  the  sample  selected  for  the  test  is  chosen  randomly. 

Inferring  an  upper  bound  on  the  phase  error  rate  requires  an  extra  step.  In  the  Shor- 
Preskill  argument,  it  is  assumed  that  the  adversary  has  no  a  priori  knowledge  about  the  basis 
that  Alice  uses  to  send  her  signals  and  Bob  uses  to  detect  them.  In  the  entanglement-based 
protocol,  this  becomes  the  statement  that  the  adversary  does  not  know  to  which  pairs  the 
Hadamard  transformation  is  applied.  But  since  the  Hadamard  interchanges  the  bit  errors  and 
the  phase  errors,  it  enforces  a  symmetry  between  the  two  types  of  errors.  Therefore,  the  error 
rate  measured  in  the  test  serves  as  an  estimate  of  the  phase  error  rate  as  well  as  the  bit  error 
rate:  with  high  probability  the  phase  error  rate  of  the  key  generating  pairs  is  also  less  than 
S  +  £.  We  conclude  that  final  key  can  be  extracted  from  sifted  key  at  the  rate  R=  1  —  2H2{8). 

Now  we  have  sketched  the  complete  proof  of  security  of  BB84,  except  for  one  technicality. 
The  sampling  theory  argument  actually  shows  that  the  joint  probability  of  a  error  rate  8  in 
the  test  pairs  and  an  error  greater  than  8  4-  s  for  the  key  generating  pairs  is  exponentially 
small.  For  a  security  analysis,  we  should  show  that  the  conditional  probability  of  an  error  rate 
for  the  key  generating  pairs  greater  than  8  4-  £  is  exponentially  small,  given  the  error  rate  8 
found  in  the  test.  The  desired  result  follows  from  Bayes’s  theorem  as  long  as  we  assume  that 
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Eve’s  attack  “passes”  the  verification  test  with  a  probability  that  is  not  itself  exponentially 
small  That  is,  we  exclude  strategies  by  Eve  such  that  extraordinary  luck  is  required  to  induce 
the  (small)  error  rate  J  found  in  the  test.  With  this  caveat  in  mind,  we  propose  this  definition 
of  security: 

Definition.  Security  of  quantum  key  distribution.  A  quantum  key  distribution  protocol 
is  secure  if  for  any  attack  by  Eve  that  passes  the  verification  test  with  a  probability  that  is 
not  exponentially  small ,  with  high  probability  Alice  and  Bob  agree  on  a  final  key  that  is  nearly 
uniformly  distributed  and  Eve's  information  about  the  final  key  is  exponentially  small.  Here 
“ exponentially  small ”  means  bounded  above  by  e~CN  where  N  is  the  number  of  signals  trans¬ 
mitted  in  the  protocol  and  C  is  a  positive  constant ,  a high  probability  ”  means  exponentially  close 
to  1 ,  and  “nearly  uniformly  distributed ”  means  with  a  probability  distribution  exponentially 
close  to  the  uniform  distribution. 

And  we  conclude: 

Theorem  1.  Security  of  BB84  against  basis-independent  attacks.  The  BB8f  protocol 
is  secure  if  Eve  launches  a  basis-independent  attack.  Secure  final  key  can  be  extracted  from 
sifted  key  at  the  asymptotic  rate 


R  =  Max(l  —  2i?2(5),  0)  (5) 

where  8  is  the  bit  error  rate  found  in  the  verification  test  (assuming  8  <  1/2). 

To  reiterate,  two  error  rates  are  relevant  to  whether  quantum  key  distribution  is  successful 
The  bit  error  rate  is  “measured”  by  conducting  a  verification  test  on  a  randomly  sampled 
subset  of  the  sifted  key  bits;  that  is,  the  observed  bit  error  rate  5  found  in  the  test  provides  an 
estimate  of  the  error  rate  8  in  the  key  generating  bits  that  is  accurate  with  high  probability.  If 
8  is  low  enough,  we  can  be  confident  that  error  correction  will  succeed,  so  that  Alice  and  Bob 
share  a  common  key.  The  phase  error  rate  8P  is  not  measured  by  direct  sampling  —  rather 
an  upper  bound  8P  <  8p-\-e  is  inferred  from  the  bit  error  rate.  If  the  inferred  phase  error  rate 
8P  is  low  enough,  we  can  be  confident  that  phase  error  correction  (if  done)  will  succeed,  so 
that  Eve  will  have  a  negligible  amount  of  information  about  the  key. 

If  the  adversary  has  no  knowledge  of  the  basis,  then  with  high  probability  the  gap  \8P  — 
£|  between  the  bit  and  phase  error  rates  is  asymptotically  negligible,  and  the  inference  is 
straightforward.  For  example,  if  the  effect  of  Eve’s  attack  is  to  apply  X  to  Bob’s  qubit,  this 
action  will  induce  a  bit  error  if  Alice  and  Bob  both  measure  Z  to  generate  a  key  bit  in  the 
entanglement-based  protocol,  and  it  will  induce  a  phase  error  if  Alice  and  Bob  both  measure 
X.  Since  Eve  doesn’t  know  the  basis,  her  action  generates  bit  errors  and  phase  errors  with 
the  same  probability.  But  in  this  paper,  going  beyond  Shor’s  and  Preskill’s  original  argument, 
we  will  allow  Fred  to  know  the  basis,  enabling  him  to  enhance  8P  relative  to  8.  In  many  cases 
of  interest,  the  basis  dependence  of  Fred’s  attack  is  limited;  we  can  infer  an  upper  bound 
8P  <  8P  +  £  and  so  through  Eq.  (4)  establish  an  achievable  key  length. 

In  the  BB84  protocol,  Alice  and  Bob  can  measure  both  the  error  rate  8x  when  they  use 
the  X  basis  and  the  error  rate  8z  when  they  use  the  Z  basis.  These  rates  need  not  be  equal 
even  if  Eve  does  not  know  the  bases  that  Alice  and  Bob  use.  For  example,  Eve  might  measure 
in  the  Z  basis  each  qubit  she  receives  from  Alice,  and  resend  to  Bob  the  Z  eigenstate  found 
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by  her  measurement,  resulting  in  expected  values  Sz  =  0  and  Sx  =  1/2.  We  emphasize  that 
S  and  6P  should  not  be  confused  with  Sz  and  Sx-  The  bit  error  rate  5  ^  (Sx  +  $z)  / 2  is 
observed  in  the  verification  test,  but  the  phase  error  rate  Sp  is  not  directly  “measured”  in  the 
protocol.0 

4  Model  devices 

Because  the  Shor-Preskill  argument,  both  in  its  original  incarnation  and  in  its  extension 
to  basis-dependent  attacks,  makes  use  of  an  EDP,  there  are  limitations  on  the  sources  and 
detectors  to  which  it  applies.  In  the  entanglement-based  protocol,  Alice  and  Bob  both  measure 
qubits,  in  either  the  X  basis  or  the  Z  basis  —  what  we  will  call  standard  measurements. 
In  the  corresponding  prepare  and  measure  protocol,  Alice’s  source  need  not  emit  a  qubit, 
but  whatever  it  emits  can  be  simulated  by  a  standard  measurement  performed  on  half  of  a 
bipartite  state  [22].  The  state  that  arrives  at  Bob’s  detector  also  might  not  be  a  qubit,  but 
the  measurement  can  be  realized  as  a  standard  measurement  preceded  by  an  operation  that 
“squashes”  the  incoming  state  to  a  two-dimensional  Hilbert  space. 

To  be  more  specific,  the  source  model  that  we  adopt  is  as  follows:  Alice’s  source  emits  a 
state  in  a  Hilbert  space  A,  where  A  can  be  arbitrary,  and  she  launches  a  state  by  acting  on 
an  auxiliary  qubit  A? .  Alice’s  basis  choice  a  G  {0, 1}  is  determined  by  flipping  a  coin.  Then  a 
state  pa  of  Ha  <S>  Ha*,  which  can  depend  on  the  basis  a,  is  prepared  by  Fred.  Alice  proceeds 
to  perform  a  standard  measurement  on  her  qubit  A!  in  the  basis  indicated  by  a;  that  is,  a 
Hadamard  transformation  is  performed  on  A'  if  and  only  if  a  =  1,  and  then  Alice  measures 
the  qubit  in  the  Z  basis.  Her  measurement  determines  her  key  bit:  g  =  0  for  outcome  +1, 
g  =  1  for  outcome  —1.  (Note  that,  depending  on  the  state  pQ,  the  key  bits  g  =  0, 1  need  not 
be  equiprobable.)  If  Fred’s  states  po  and  p\  are  close  to  one  another,  then  the  states  emitted 
by  the  source,  averaged  over  the  key  bit,  depend  only  weakly  on  the  basis. 

Actually  we  can  generalize  this  source  model  to  allow  successive  emissions  to  be  correlated 
with  one  another.  Now  let  A'  denote  a  system  of  n  qubits,  A  the  system  in  which  Alice’s 
n  signals  reside,  and  pa  a  state  of  Ha  ®  Ha ' ;  the  state  pa  may  depend  on  the  n-bit  string 
a  that  specifies  Alice’s  basis  choice  for  each  of  the  n  signals.  Alice  applies  a  Hadamard 
transformation  to  the  ith  qubit  if  and  only  if  a*  =  1,  then  measures  the  qubits  in  the  Z  basis. 
The  measurement  outcomes  determine  her  n-bit  key  g.  Some  of  the  results  we  report  in  this 
paper  (Theorem  2,  for  example)  apply  to  this  more  general  source  model. 

The  n  signals  emitted  by  the  source  are  attacked  by  Eve,  who  sends  to  the  detector  a 
state  that  lives  in  a  Hilbert  space  Hb •  We  model  the  detector  as  follows:  Bob’s  basis  choice 
b  G  {0,  l}0n  for  the  n  signals  is  determined  by  flipping  n  coins.  Then  Fred  applies  to  the 
state  received  by  the  detector  a  quantum  channel  £*,  that  “squashes”  Hb  to  the  n- qubit  space 
Hb<\  this  squash  operation  may  depend  on  the  basis  b .  Bob  proceeds  to  perform  standard 
measurements  on  the  qubits;  a  Hadamard  is  performed  on  the  ith  qubit  if  and  only  if  bi  =  1, 
then  Bob  measures  the  qubits  in  the  Z  basis.  The  measurements  determine  his  n-bit  key  h 
:  hi  =  0  if  the  outcome  of  the  measurement  of  the  zth  qubit  is  +1  and  h{  —  1  for  outcome 
—  1.  Since  the  channel  taking  Hb  to  Hb>  can  act  collectively  on  the  incoming  signals,  our 
model  allows  the  detector  to  perform  a  collective  measurement  on  the  n  signals  it  receives. 

aIf  Alice  and  Bob  perform  a  refined  error  analysis  [25]  (measuring  separate  error  rates  for  the  two  bases),  they 
can  improve  the  key  generation  rate  to  R  =  1  —  #2(^2)  —  #2 (At)  [23]. 
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Fig.  1.  Roles  of  Alice,  Bob,  Eve,  and  Fred  in  the  key  distribution  protocol.  Fred  prepares  an 
entangled  state  pa  (which  may  depend  slightly  on  Alice’s  basis  choice  a)  of  Alice’s  n- qubit  Hilbert 
space  A1  and  the  signal  space  A;  then  Alice  triggers  the  source  by  performing  a  standard  measure¬ 
ment  on  Ar .  Eve  applies  an  arbitrary  basis-independent  attack,  and  then  Fred  applies  a  channel  Eh 
(which  may  depend  slightly  on  Bob’s  basis  choice  b)  that  “squashes”  the  signals  from  the  Hilbert 
space  B  to  the  n-qubit  space  Bf .  Finally,  the  qubits  in  Br  are  subjected  to  a  standard  measure¬ 
ment  by  Bob.  The  goal  of  the  protocol  is  to  generate  a  key  that  is  not  known  by  Eve  and  Fred, 
who  may  communicate  freely  after  Bob  measures. 


The  basis-dependence  in  the  detector’s  performance  is  encoded  in  Fred’s  channel 

The  prepare- and- measure  BB84  protocol,  for  the  source  and  detector  models  we  have 
described,  is  depicted  in  Fig.  1.  It  can  be  related  to  a  protocol  in  which  entangled  pairs  of 
qubits  are  prepared  by  Eve  (with  help  from  Fred).  Half  of  each  pair  is  delivered  to  Alice, 
half  to  Bob,  and  they  then  proceed  to  perform  standard  measurements.  The  security  of  this 
latter  protocol  follows  from  the  security  of  the  corresponding  EDP.  Therefore,  for  this  model 
of  source  and  detector,  we  can  use  the  Shor-Preskill  method  to  analyze  the  security  of  BB84. 

It  may  be  instructive  to  contrast  our  models  of  the  source  and  detector  with  those  con¬ 
sidered  in  the  proofs  of  Mayers  [2]  and  Koashi  and  Preskill  [6] .  Mayers  allows  the  detector  to 
perform  an  arbitrary  two-outcome  POVM  on  each  signal  it  receives,  while  in  our  model  the 
POVM  must  be  one  that  can  be  realized  by  a  squash  followed  by  a  standard  measurement. 
In  principle,  our  model  entails  no  loss  of  generality,  since  the  Mayers  POVM  could  be  fol¬ 
lowed  by  the  preparation  of  a  qubit  in  a  state  chosen  so  that  the  standard  measurement  will 
reproduce  the  outcome  of  the  POVM.  However,  our  security  proof  works  only  if  the  channel 
£b  applied  by  Fred  depends  sufficiently  weakly  on  the  basis  b,  while  Mayers  requires  no  such 
condition.  Koashi  and  Preskill  consider  a  source  that  can  be  realized  by  the  preparation  of  a 
basis-independent  state  of  an  bipartite  system,  followed  by  an  arbitrary  two-outcome  POVM 
on  half  of  the  system,  while  in  our  model  the  POVM  must  be  a  standard  measurement  of 
a  qubit.  For  the  signals  emitted  by  a  general  Koashi-Preskill  source,  though  it  would  be 
possible  to  launch  the  same  signals  by  performing  a  standard  measurement  on  a  qubit,  this 
can  be  done  only  by  choosing  bipartite  states  that  depend  strongly  on  the  basis,  and  our 
security  proof  works  only  when  the  dependence  of  the  states  on  the  basis  is  sufficiently  weak. 
Therefore,  our  analysis  of  security  does  not  apply  to  the  general  Mayers  detector  or  the  gen¬ 
eral  Koashi-Preskill  source.  On  the  other  hand,  Mayers  does  not  allow  Fred  to  attack  the 
source,  Koashi  and  Preskill  do  not  allow  Fred  to  attack  the  detector,  and  the  signals  emitted 
by  the  Koashi-Preskill  source  reveal  no  information  about  the  basis  used.  In  contrast,  our 
model  allows  the  performance  of  both  the  source  and  the  detector  to  depend  on  the  basis, 
and  allows  the  source  to  leak  some  information  about  the  basis. 

Another  noteworthy  difference  between  our  model  and  those  of  Mayers  and  Koashi-Preskill 
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is  that  our  model  allows  Alice’s  source  to  emit  successive  signals  that  are  entangled  with  one 
another,  and  allows  the  detector  to  measure  the  signals  collectively.  In  contrast,  Koashi  and 
Preskill  assume  that  the  signals  emitted  by  Alice’s  source  are  unentangled  with  one  another, 
and  Mayers  likewise  assumes  that  the  signals  are  detected  by  Bob  individually  rather  than 
collectively.  This  assumption  is  used  because  a  crucial  step  in  the  Mayers  proof  is  to  show 
that  Eve’s  information  about  Alice’s  key  would  be  unchanged  if  Bob  were  to  flip  the  basis 
in  which  he  measures  the  key  bits,  but  not  the  basis  in  which  he  measures  the  test  bits.  A 
general  collective  measurement  of  the  signals  by  Bob  would  generate  correlations  between  key 
bit  measurements  and  test  bit  measurements;  therefore,  when  Bob  announces  the  outcome 
of  his  measurements  of  the  test  bits  he  might  reveal  to  Eve  some  information  about  his 
choice  of  basis  for  the  measurement  of  the  key  bits.  For  this  reason,  we  do  not  know  how 
to  justify  the  invariance  of  Eve’s  information  under  the  basis  flip  in  the  case  of  a  collective 
measurement  (though  it  is  not  inconceivable  that  the  argument  can  be  extended  to  cover  that 
case).  Similarly,  the  Koashi-Preskill  proof  uses  the  property  that  Eve’s  information  about 
Bob’s  key  would  be  unchanged  if  Alice  were  to  flip  the  basis  in  which  she  sends  the  key  bits 
but  not  the  basis  in  which  she  sends  the  test  bits,  which  cannot  be  justified  unless  Alice’s 
signals  are  unentangled. 

Our  model  of  the  detector  can  easily  be  generalized  by  endowing  the  detector  with  im¬ 
perfect  efficiency,  so  that  it  sometimes  misfires  and  fails  to  record  an  outcome.  One  simple 
modification  attaches  an  additional  flag  bit  di  to  each  of  Bob’s  qubits.  If  di  =0,  then  the  ith. 
qubit  is  measured  as  above,  but  if  di  =  1  then  the  zth  qubit  is  discarded  and  no  outcome  is 
recorded. 

Detector  inefficiencies  and  other  types  of  losses  can  be  incorporated  into  the  Shor-Preskill 
security  analysis  easily  enough.  Through  public  discussion,  Alice  and  Bob  can  eliminate 
from  their  sifted  key  all  signals  for  which  Bob  failed  to  record  a  measurement  result.  In  the 
entanglement-based  protocol,  then,  we  consider  an  EDP  applied  to  all  the  pairs  from  which 
sifted  key  bits  will  be  successfully  extracted  when  the  measurements  are  performed.  That  is, 
before  the  EDP  is  applied  we  discard  all  pairs  for  which  Alice  and  Bob  chose  different  bases 
or  for  which  the  detector  misfired,  as  well  as  the  pairs  consumed  by  the  verification  test. 
Security  is  then  proven  if  we  can  infer  from  the  test  that,  with  high  probability,  the  remaining 
pairs  have  sufficiently  low  rates  of  bit  errors  and  phase  errors.  However,  this  inference  must 
take  into  account  any  basis  dependence  in  the  detector  efficiency  that  might  contribute  to 
the  gap  between  5  and  Spj  as  we  will  discuss  further  in  Sec.  13.  Basis-dependent  detector 
inefficiencies  are  more  problematic  for  the  Mayers  argument,  since  the  basis  dependence  may 
spoil  the  invariance  of  Eve’s  information  about  Alice’s  key  when  Bob  flips  his  basis  for  the 
key  bits  (but  not  the  test  bits). 

5  Real  devices 

We  are  interested  in  analyzing  the  security  of  quantum  key  distribution  with  imperfect  equip¬ 
ment  because  we  seek  assurance  that  our  protocols  are  secure  not  just  in  an  ideal  world  but 
also  in  the  real  world.  Therefore,  the  inherent  limitations  of  our  source  and  detector  models 
should  be  soberly  contemplated. 

For  example,  real  sources  typically  emit  not  qubits  but  bosonic  modes  of  the  electromag¬ 
netic  field,  and  if  the  likelihood  that  a  mode  is  multiply  occupied  is  too  high,  security  may 
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be  compromised.  To  evaluate  this  security  threat  in  our  limited  framework,  we  will  need  to 
adjust  our  source  model  (as  we  will  discuss  in  Sec.  12)  to  incorporate  the  relevant  features, 
even  if  not  all  the  detailed  physics,  of  the  real  source. 

A  similar  comment  applies  to  detectors.  In  a  typical  detector  setup  for  BB84,  the  incoming 
photonic  mode  encounters  a  polarizing  beam  splitter  that  routes  the  Z  —  1  and  Z  —  —  1 
polarization  states  (or  the  X  =  ±1  states)  to  two  different  photon  detectors  —  threshold 
detectors  that  do  not  distinguish  one  photon  from  many.  If  one  or  the  other  detector  fires, 
the  polarization  state  is  identified.  But  if  more  than  one  photon  is  present,  both  detectors 
might  fire,  an  ambiguous  result.  If  Bob  is  equipped  with  such  a  detector,  Eve  can  trigger  the 
ambiguous  result  at  will  by  flooding  the  detector  with  photons.  Even  more  troubling,  Eve  can 
arrange  that  Bob  receive  the  ambiguous  result  if  he  chooses  one  basis  but  not  the  other.  For 
example,  Eve  can  intercept  and  measure  in  the  Z  basis  the  signal  emitted  by  Alice,  and  then 
send  on  to  Bob  many  Z-polarized  photons  in  the  state  she  detects.  Then  Bob  will  reproduce 
Eve’s  result  if  he  measures  in  the  Z  basis,  but  will  obtain  the  ambiguous  result  if  he  measures 
in  the  X  basis  [27].  Thus,  by  exploiting  the  flaw  in  the  design  of  the  detector,  Eve  can  launch 
a  “Trojan  horse”  attack,  in  effect  switching  Bob’s  detector  off  when  it  is  poised  to  detect 
eavesdropping  [28].  Although  our  detector  model  may  not  fully  incorporate  all  the  physics  of 
the  polarization  beam  splitter,  we  will  nonetheless  be  able  to  investigate  in  Sec.  13  the  power 
of  a  Trojan  horse  attack  within  an  EDP  framework. 

6  Choosing  the  basis  quantumly 

For  a  security  analysis  that  is  applicable  to  BB84  performed  with  imperfect  equipment,  we 
wish  to  bound  the  adversary’s  information  in  the  case  of  an  attack  that  depends  weakly  on 
the  basis  used  to  send  and  detect  the  signals.  For  this  purpose,  we  should  find  a  precise 
formulation  of  what  it  means  for  the  basis  dependence  to  be  “weak.”  Therefore,  let  us  focus 
attention  on  the  coins  that  Alice  and  Bob  flip  to  determine  their  random  choices  of  basis.  An 
attack  that  depends  weakly  on  the  basis  is  one  that  depends  only  slightly  on  the  outcomes  of 
the  coin  flips. 

In  the  entanglement-based  protocol  as  we  have  described  it  up  to  now,  the  coin  flip  is 
treated  classically,  and  the  outcome  of  the  flip  determines  whether  a  Hadamard  transformation 
is  applied  to  a  qubit  before  it  is  measured  in  the  Z  basis.  Denote  by  a*  G  {0, 1}  the  outcome 
of  the  flip  of  the  ith  coin  and  by  the  length  n  string  a  the  outcome  of  the  flip  of  n  coins.  (In 
the  BB84  protocol,  Alice  and  Bob  flip  separate  coins.  But  for  our  security  analysis  we  may 
confine  our  attention  to  the  sifted  key,  for  which  their  coin  flips  agree;  therefore  in  effect  there 
is  only  one  basis  choice  a*  for  each  signal.)  Denote  by  H(a)  the  operation  which  applies  a 
Hadamard  to  the  ith  qubit  if  a*  =  1  and  the  identity  to  the  ith  qubit  if  a*  =  0.  Then  in  the 
setting  where  Eve  knows  nothing  about  the  basis  choice,  the  effect  of  the  randomly  applied 
Hadamards  by  Alice  and  Bob  (after  the  attack  by  Eve)  is  to  transform  the  state  of  the  n  pairs 
according  to 

i  2"-1 

P^P'  =  ^T,  ®  #  (“))  P  ( H(a )  ®  H{a ))  •  (6) 

<2  =  0 

Then  since  H(b)H(a)  =  H(a  ©  6),  pf  has  the  property  of  Hadamard  invariance:  for  any  bit 
string  6, 


(H(b)  0  H(b))  pf  ( H(b )  0  H(b))  =  pf  . 


(7) 
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In  the  Shor-Preskill  argument,  this  symmetry  of  pf  is  used  to  infer  that  the  bit  error  rate  and 
phase  error  rate  of  the  key  generating  pairs  are,  with  high  probability,  nearly  the  same. 

In  order  to  analyze  (weakly)  basis-dependent  attacks,  it  is  convenient  to  treat  the  coin 
flip  quantumly  rather  than  classically  —  we  can  imagine  that  each  coin  is  in  a  coherent 
superposition  of  heads  and  tails,  and  that  the  Hadamard  transform  is  conditioned  on  the 
state  of  the  coins.  In  the  ideal  protocol,  the  n  coins  are  prepared  in  the  state 

73(|0>  +  |1»)  <8) 

and  the  Hadamard  is  applied  to  the  ith  pair  if  a*  =  1  —  therefore,  if  |^)  is  the  state  of  the 
pairs,  then  the  effect  of  the  random  basis  choice  can  be  expressed  as 

I* >  0  \a)  ->  ( H{a )  0  H{a))  |tt>  0  \a)  .  (9) 

When  we  trace  over  the  state  of  the  coin,  the  effect  on  the  quantum  state  of  the  pairs  is  just 
as  in  eq.  (6). 

Now,  in  this  formulation,  it  is  easy  to  describe  the  distinction  between  Eve’s  basis- 
independent  attack  and  Fred’s  basis-dependent  attack.  Eve  interacts  only  with  the  pairs, 
but  Fred  is  permitted  to  tamper  with  both  the  pairs  and  the  coins,  as  in  Fig.  2.  In  the  actual 
protocol,  the  coin  is  classical,  but  it  will  not  make  Fred  any  less  powerful  if  we  allow  him  to 
attack  a  quantum  coin  instead.  (When  we  say  that  the  coin  is  “classical,”  we  mean  Fred’s 
attack  is  a  quantum  operation  applied  to  the  pairs  that  is  conditioned  on  the  state  of  the  coin 
in  a  preferred  basis.  We  will  prove  security  for  general  attacks  by  Fred  with  weak  dependence 
on  the  state  of  the  coin,  so  our  results  will  apply  in  particular  to  the  case  of  a  classical  coin.) 
Furthermore,  it  is  easy  to  state  precisely  what  it  means  for  the  attack  to  depend  only  weakly 
on  the  basis:  the  basis  dependence  is  weak  if  Fred’s  attack  disturbs  the  coin  only  slightly. 
This  notion  of  weak  basis  dependence  applies  even  if  we  allow  Fred  to  attack  the  signals  twice, 
at  the  source  (before  Eve’s  attack)  and  at  the  detector  (after  Eve’s  attack).  Actually,  once 
we  introduce  the  quantum  coin  in  this  way,  it  is  not  so  important  to  keep  Fred  in  the  picture 
at  all  —  we  can  go  back  to  the  usual  picture  in  which  there  is  only  one  adversary,  but  limit 
Eve’s  attack  on  the  coin. 

Definition.  A-balanced  attack.  Suppose  that  after  n  pairs  and  the  n  corresponding  coins 
are  attacked  by  the  adversary  (but  before  the  final  Hadamard  transformations ,  conditioned 
on  the  coins,  that  precede  the  measurement  of  the  pairs  in  the  Z  basis),  the  n  coins  are  all 
measured  in  the  X  basis.  The  attack  is  A-balanced  if,  with  high  probability,  the  number  of 
coins  for  which  the  measurement  outcome  is  X  —  —  1  is  less  than  nA. 

If  A  ss  0,  the  attack  is  balanced  —  that  is,  basis-independent.  We  will  prove  in  Sec.  7  that  if 
the  attack  is  A-balanced  and  A  is  sufficiently  small,  then  secure  quantum  key  distribution  is 
possible,  and  we  will  obtain  a  lower  bound  on  the  achievable  key  generation  rate.  Later  we 
will  discuss  some  more  specific  examples  of  A-balanced  attacks,  and  for  some  of  those  attacks 
we  will  obtain  stronger  lower  bounds  on  the  rate. 

We  emphasize  again  that  for  a  security  analysis  it  suffices  to  imagine  that  Alice  and  Bob 
share  a  single  quantum  coin  that  determines  the  choice  of  basis  for  each  signal.  Of  course, 
in  the  BB84  protocol,  Alice  and  Bob  use  separate  classical  coins  to  determine  whether  to  use 
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coin 

signal 


Fig.  2.  The  quantum  coin.  The  basis  choice  for  the  detector  (and  the  source)  is  determined 
by  measuring  a  qubit  (the  coin)  in  the  Z  basis.  To  generate  each  bit  of  sifted  key,  a  conditional 
Hadamard  transformation,  controlled  by  the  coin,  is  applied  to  the  signal  qubit,  and  then  the  signal 
qubit  is  measured  in  the  Z  basis.  Fred’s  basis-dependent  attack  on  the  signal  can  be  described  as  a 
joint  attack  on  the  coin  and  the  signal.  To  quantify  how  the  coin  is  disturbed  by  Fred’s  attack,  we 
consider  measuring  the  coin  in  the  X  basis  after  the  attack  and  before  the  conditional  Hadamard. 


the  Z  basis  or  the  X  basis.  But  the  quantum  coin  is  not  intended  to  provide  an  accurate 
portrayal  of  the  actual  protocol;  it  is  a  mathematical  device  for  analyzing  the  impact  of  the 
basis  dependence  of  the  attack.  For  the  purpose  of  this  analysis,  we  replace  the  two  classical 
coins  by  a  single  quantum  coin  only  after  discarding  the  cases  in  which  Alice’s  classical  coin 
flip  and  Bob’s  classical  coin  flip  yield  different  outcomes. 

We  have  seen  that  for  the  analysis  of  the  BB84  protocol,  it  is  convenient  to  imagine  that 
Alice  delays  the  measurement  that  launches  her  signals  until  after  Eve’s  attack.  That  way, 
we  can  relate  the  classical  privacy  amplification  in  BB84  to  an  EDP,  and  so  establish  security. 
Here  we  are  taking  this  idea  a  step  further.  It  is  convenient  to  imagine  that  the  measurement 
of  the  coin  that  determines  the  basis  is  also  delayed  until  after  the  attack  by  Eve  and  Fred. 
That  way,  we  can  infer  a  bound  on  the  asymmetry  between  the  bit  error  rate  and  the  phase 
error  rate  for  pairs  subjected  to  the  EDP,  and  so  establish  that  the  EDP  will  be  effective. 

7  Security  proof  for  small  basis-dependent  flaws 

To  analyze  security,  we’d  like  to  relate  the  asymmetry  of  the  coin  (as  parametrized  by  A)  to 
the  gap  between  the  bit  error  rate  5  and  the  phase  error  rate  Sp  when  the  pairs  are  measured 
in  the  Bell  basis.  First  we  need  to  write  down  a  convenient  expression  for  this  gap. 

In  the  entanglement-based  protocol,  the  random  variables  nS  and  nSp  are  defined  as  the 
number  of  bit  errors  and  phase  errors  that  would  be  found  if  the  n  key  generating  pairs  were 
all  measured  in  the  Bell  basis.  For  a  particular  pair,  consider  the  observable 

l-{z®z-x®x)  =  \<i>-){4>-\-\^+)^+\ ,  (io) 

where  \<fi~)  and  |^+)  are  the  Bell  states 


i<n 

=  -h(|00)  -  111))  , 

i^+> 

=  +(|01>  +  |10»  . 

(11) 

This  observable  has  eigenvalues  {+1,0,  —1}.  If  the  pair  is  to  be  measured  in  the  Z  basis,  we 
say  that  there  is  a  bit  error  if  Z  (g>  Z  =  —1  and  that  there  is  a  phase  error  if  X  0  X  =  —  1. 
Therefore,  the  eigenvalue  of  ^(Z  Z  —  X  ®  X)  is  —1  if  there  is  a  bit  error  but  no  phase  error, 
+1  if  there  is  a  phase  error  but  no  bit  error,  and  0  if  either  there  are  no  errors  or  both  a  bit 


340  Security  of  quantum  key  distribution  with  imperfect  devices 


error  and  a  phase  error.  Similarly,  if  the  pair  is  to  be  measured  in  the  X  basis,  we  say  that 
there  is  a  phase  error  if  Z  <S>  Z  —  —  1  and  that  there  is  a  bit  error  if  X  ®  X  =  —1.  Therefore, 
the  eigenvalue  of  \(Z  <g)  Z  —  X  (g>  X)  is  +1  if  there  is  a  bit  error  but  no  phase  error,  —1  if 
there  is  a  phase  error  but  no  bit  error,  and  0  if  either  there  are  no  errors  or  both  a  bit  error 
and  a  phase  error. 

Suppose  that  the  basis  choice  is  decided  by  flipping  a  coin,  where  the  pairs  are  to  be 
measured  in  the  Z  basis  if  the  outcome  of  the  coin  flip  is  |0)  ( Zcoin  =  1),  and  the  pairs  are 
to  be  measured  in  the  X  basis  if  the  outcome  of  the  coin  flip  is  |1)  ( Zcoin  =  —  1).  Then  the 
observable 

^(Z®Z-X®X)pair®Zcoin  (12) 

has  the  eigenvalue  —1  if  the  pair  has  a  bit  error  but  no  phase  error,  the  eigenvalue  +1  if  the 
pair  has  a  phase  error  but  no  bit  error,  and  the  eigenvalue  0  otherwise.  We  see,  then,  that 
for  the  n  key  generating  pairs,  the  gap  between  the  number  of  phase  errors  and  the  number 
of  bit  errors  can  be  expressed  as 

^gap  =  ^  ^^p 

n  1 

=  —  {Z  0  Z  —  X  0  ^Opair)i  &  ^coin,2  *  (13) 

i— 1 

Eq.  (13)  means  that  ngap  is  a  random  variable  whose  probability  distribution  is  the  distribution 
of  outcomes  if  the  observable  on  the  right-hand  side  of  eq.  (13)  is  measured.  We  might  imagine 
that  Z  0  Z  and  X  0  X  are  measured  for  every  pair  (this  is  a  complete  Bell  measurement) 
and  that  Zcoin  is  measured  for  every  coin;  then  ngap  is  found  by  summing  up  all  the  results 
of  these  measurements.  But  since  the  Bell  measurements  and  the  coin  measurements  all 
commute  with  our  expression  for  ngap  in  eq.  (13),  we  could  just  as  well  imagine  that  ngap 
is  measured  first,  and  that  the  other  measurements  are  completed  later  —  the  probability 
distribution  for  ngap  will  be  the  same  either  way.  In  any  case,  our  expression  for  ngap  is  valid 
even  if  there  are  strong  correlations  among  the  pairs  and  the  coins. 

If  we  imagine  that  all  of  the  coins  are  measured  in  the  X  basis  (as  in  the  definition  of  a 
A-balanced  attack),  then  the  random  variable  that  represents  the  number  of  coins  for  which 
the  outcome  is  X  —  —  1  can  be  expressed  as 

=  E  (7  0  7)Pair ,i  ®  ~  *)coin ,i  •  (14) 

i= 1 

We  wish  to  obtain  a  bound  on  ngap  that  will  hold  with  high  probability  for  any  possible 
state  of  the  pairs  and  the  coins  such  that  is  less  than  nA  with  high  probability.  It  is 

convenient  to  express  the  gap  as  a  sum  of  two  terms,  ngap  =  ngap  +  ngap ,  and  to  bound  each 
term  separately.  First,  consider 

1  n 

ngap  =  2  ^  ^)pair,i  ^  ^coin ,i  •  (15) 

2—1 

For  each  value  of  i,  imagine  that  we  perform  two  successive  controlled-NOT  gates,  one  with 
Alice’s  qubit  as  the  control  and  the  coin  as  the  target,  and  the  other  with  Bob’s  qubit  as  the 
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control  and  the  coin  as  the  target.  Acting  by  conjugation,  the  effect  of  these  gates  is 


{^Z  ®  Z^j pair  &  Zco[n  y  (/  &  -f)pair  ®  ^coin  ? 

(/  0  -f)pair  ®  A^coin  ^  (-f  &  -f)pair  &  Acoin  .  (16) 


Therefore,  this  change  of  basis  has  no  effect  on  the  statistics  of  the  observable  n^,  while 

( z) 

transforming  the  observable  ngap  according  to 


n 


(Z) 

gap 


(Z) 

coin  ’ 


(17) 


where 

n  1 

nioL  =  £  (J  ®  /)Pair,i  ®  2  (/  “  Z)c°in’i  (18) 

i— 1 

(the  number  of  coins  for  which  Z  =  —  1,  if  all  n  are  measured  in  the  Z  basis). 

We  are  interested  in  analyzing  how  the  statistics  of  ngfp  is  related  to  the  statistics  of 
Let  p  denote  the  state  of  the  n  coins  and  the  n  pairs,  and  suppose  that  the  controlled-NOT 
gates  described  above  transform  this  state  to  a  new  state  pf .  We  see  that  the  statistics  of 
ncoin  an-d  ngap  hi  the  state  p  is  identical  to  the  statistics  of  and  ^  —  n).oi'n  in  the  state  pf . 
Therefore,  to  derive  a  relation  between  and  ngfp  that  holds  with  high  probability  for  an 
arbitrary  state  p,  it  suffices  to  analyze  how  n),oi'n  and  are  related  for  an  arbitrary  state 
pf .  For  this  we  appeal  to  the  following  lemma,  which  asserts  that  if  is  small,  then  n^f 
is  close  to  n/2: 

Lemma  1  For  a  quantum  state  of  n  coins ,  suppose  that  with  high  probability  n^oi^  <  nA. 
Then ,  for  any  positive  £,  with  high  probability 

\n/2  -  nioL\  <  ™(/(A)  +  £)  »  (19) 

where  A  and  /(A)  are  related  by 

#2(1/2  —  /(A))  +  H2(A)  =  1  .  (20) 


Proof:  The  proof  for  the  case  of  a  pure  quantum  state  |V>)  of  n  coins,  where  <  nA  with 
probability  1,  is  in  Appendix  A  of  [26].  But  if  instead  <  nA  with  high  probability,  we 
can  write  IV7)  =  |V>)good  +  |  VO  bad?  where  the  (unnormalized)  state  |  VO  good  h-as  n^  <  nA  with 
probability  1,  and  ||  V^bad  ||  is  exponentially  small.  Hence  IV7) good?  and  therefore  also  IV7)?  has 
the  property  eq.  (19)  with  high  probability.  Therefore,  Lemma  1  holds  for  pure  states.  Now, 
a  mixed  state  can  be  realized  as  an  ensemble  of  pure  states.  By  the  hypothesis  of  Lemma 
1,  all  of  the  pure  states  in  this  ensemble,  except  for  those  occuring  with  exponentially  small 
probability,  satisfy  n ^  <  nA  with  high  probability,  and  therefore  also  satisfy  eq.  (19)  with 
high  probability.  This  proves  Lemma  1. 

We  note  that  by  expanding  H2(  1/2  —  /)  as  a  power  series  in  /,  and  using  the  convexity 
of  #2?  we  can  derive  from  eq.  (20)  a  useful  inequality  satisfied  by  /(A): 

(/(A))2  <  l(ln2)ff2(A)  . 


(21) 
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Expanding  this  expression  for  small  A  and  using  convexity  again,  we  obtain 

m  <  V^ln(£)  •  (22) 

From  Lemma  1  and  eq.  (17),  we  infer  that,  for  a  A-balanced  attack,  |ngfp|  <  n(/(A)  +s) 
with  high  probability.  A  similar  argument  shows  that  also  |ng^|  <  n(/( A)  +  e)  with  high 
probability,  where 

1  n 

=  ~  2  E  (X  0  "^0 pairs, i  ®  ^coin ,i  •  (23) 

i— 1 

(For  this  argument,  we  apply  Hadamard  transformations  to  all  pairs  before  applying  the 
CNOT  gates.)  Since  ngap  =  ngfp  +  ng^p,  we  have  proved: 

Lemma  2  For  a  A-balanced  attack  on  n  pairs  and  n  coins,  the  state  of  the  pairs  has  the 
property 

\8p-5\<2(f(A)  +  e)  (24) 

with  high  probability ,  for  any  positive  £. 

With  Lemma  2  in  hand,  we  can  now  complete  the  proof  of  security  following  the  steps 
outlined  in  Sec.  3.  If  the  error  rate  found  in  the  test  is  S ,  then  the  number  of  bit  errors 
in  the  key-generating  pairs  is  less  than  n(S  +  e)  with  high  probability  (assuming  that  Eve’s 
attack  passes  the  test  with  a  probability  that  is  not  exponentially  small).  For  a  A-balanced 
attack,  we  infer  that  the  number  of  phase  errors  in  the  key-generating  pairs  is  less  than 
n(S  +  2 / (A)  +  s)  with  high  probability.  By  introducing  a  random  permutation  (not  known 
by  Eve  or  Fred)  we  can  ensure  that  the  errors  are  randomly  distributed  among  the  pairs. 
Therefore,  for  a  suitable  CSS  code,  high  fidelity  pairs  (and  hence  secure  key)  can  be  extracted 
at  a  rate  1  —  H2(S  +  e)  —  #2^  +  2/(A)  +  e),  for  any  positive  e.  We  have  proved: 

Theorem  2.  Security  of  BB84  against  weakly  basis-dependent  attacks.  The  BB8f 
protocol  is  secure  if  Eve  and  Fred  launch  a  A-balanced  attack.  Secure  final  key  can  be  extracted 
from  sifted  key  at  the  asymptotic  rate 

R  =  Max(l  -  H2(S)  -  H2{5  +  2/(A)),  0)  (25) 

where  S  is  the  bit  error  rate  found  in  the  verification  test  and  /(A)  is  defined  as  in  eq.  (20). 
( We  assume  6  +  2/(A)  <  1/2.) 

We  note  that  the  key  generation  rate  found  in  Theorem  2  is  nonzero  only  for  2/(A)  <  1/2, 
or  A  <  .0289. 

Theorem  2  is  our  central  result  concerning  security  for  equipment  with  generic  flaws.  In 
the  remainder  of  this  paper,  we  will  analyze  some  specific  examples.  As  we  will  see,  for  some 
special  cases  we  can  establish  a  key  generation  rate  exceeding  the  rate  eq.  (25)  found  for  the 
general  case. 
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8  Individual  source  flaws  and  a  perfect  detector 

As  our  first  application  of  Theorem  2,  we  consider  the  case  where  the  detector  is  perfect,  but 
the  source  is  subject  to  individual  flaws  that  may  leak  some  information  to  Eve  about  Alice’s 
basis  choice.  We  will  prove  security  by  showing  that  the  attack  is  A-balanced. 

Suppose  that  Alice’s  source  emits  one  of  four  possible  states  of  a  single  qubit.  In  the  ideal 
protocol,  these  states  are  the  four  BB84  states,  chosen  equiprobably.  Suppose,  though,  that 
the  source  is  imperfect,  so  that  the  four  states  differ  from  the  corresponding  BB84  states,  but 
only  slightly. 

Let  a  G  {0, 1}  denote  Alice’s  declared  basis  choice  (ideally,  the  Z  basis  for  a  =  0  and  the  X 
basis  for  a  =  1)  and  let  g  G  {0, 1}  denote  Alice’s  key  bit.  Suppose  that  a  and  g  are  chosen  with 
the  joint  probability  pa,g?  and  that  once  the  values  of  a  and  g  are  chosen,  Alice’s  source  emits 
a  state  pa,g.  The  Koashi-Preskill  analysis  applies  if  po,oPo,o  +Po,iPo,i  =  Pi,oPi,o  +Pi,iPi,ij  the 
case  in  which  the  source  does  not  reveal  any  information  about  a.  We  will  say  that  the  source 
is  oblivious  when  it  has  this  property.  Now  we  are  interested  in  the  case  were  the  source  is 
nonoblivious  —  it  leaks  a  small  amount  of  information  about  the  basis  choice. 

We  can  characterize  the  flawed  source  by  imagining  that  Alice  prepares  her  states  by 
performing  an  ideal  measurement  on  half  of  an  entangled  pair.  The  state  of  the  pair  (prior  to 
Alice’s  measurement)  is  po  for  a  =  0  and  pi  for  a  —  1.  The  basis-dependence  of  the  source  is 
weak  in  the  sense  that  the  states  po  and  Pi  differ  only  slightly  —  their  fidelity  is  close  to  one: 

yjF(p^p1)  =  II  y/p^^Jpl  lltr  >1-2 e8  .  (26) 

If  n  signals  are  sent,  the  state  that  Fred  prepares  is  a  product  state:  pi*\  where  ai 

denotes  the  basis  choice  for  the  it h  signal,  and  \J F(p^  \  p^)  >  1  —  2 es  for  each  i.  Thus  we  say 
that  Fred’s  attack  on  the  source  is  individual ,  and  that  the  basis-dependence  is  characterized 
by  £s.  We  will  suppose  for  now  that  any  flaws  in  the  detector  are  basis  independent,  so  that 
Fred  attacks  only  the  source. 

The  states  po  and  pi  may  be  mixed  in  general,  but  they  can  be  “purified”  by  introducing 
a  suitable  “environment”  E;  that  is,  there  are  pure  states  |\ko)  and  l^i)  such  that 

trB(|*o>(*o|)  =  Po  ,  trB(|tf1>(*'1|)  =  pi  .  (27) 

Furthermore,  it  follows  from  eq.  (26)  that  the  purifications  can  be  chosen  to  have  a  large 
overlap  [29,  30]: 

Re  <*i|tt0>  >  l-2e*  .  (28) 

Now  suppose  that,  as  in  Sec.  6,  we  imagine  that  the  basis  choice  is  determined  by  a 
“quantum  coin.”  Then,  the  state  of  the  coin,  the  pair,  and  the  environment  can  be  described 
as  a  pure  state 

T(l'fo)®  |0>  +  |*i)®|l»  .  (29) 

If  the  state  of  the  pair  used  by  Alice  to  prepare  her  signal  depends  on  the  choice  of  basis,  then 
the  coin  will  be  entangled  with  the  pair  and  environment,  and  the  strength  of  this  entangle¬ 
ment  will  depend  on  how  much  |\k0)  and  |^i)  differ.  Of  course,  the  quantum  coin  is  merely  a 
mathematical  fiction  that  we  invoke  for  the  purpose  of  analyzing  the  basis  dependence  of  the 
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pairs  that  are  used  to  generate  the  key  in  the  entanglement-based  key  distribution  protocol. 
Furthermore,  the  state  of  the  pairs  does  not  depend  on  how  we  choose  the  purifications  of  po 
and  p\.  But  the  state  of  the  coins  does  depend  on  this  choice,  and  we  may  exploit  our  freedom 
in  choosing  the  purifications  to  obtain  the  strongest  possible  bound  on  the  basis  dependence 
of  the  pairs. 

Since  we  are  assuming  that  any  flaws  in  the  detector  are  basis  independent,  these  may 
be  absorbed  into  Eve’s  basis-independent  attack.  Then  since  Eve’s  attack  has  no  effect  on 
the  coins,  the  state  of  any  coin  can  be  completely  characterized  by  tracing  out  the  pair  and 
environment  from  eq.  (29).  If  the  state  of  the  coin  is  now  measured  in  the  X  basis,  the 
outcome  X  =  —  1  occurs  with  probability 

P  =  \  II  |*o>  -  |*1>  II2 

=  l(l-Re  (*i|$o>)  <e,  .  (30) 

Because  the  attack  is  individual,  the  coins  are  independent  and  this  bound  on  p  applies  to 
each  one  of  the  n  coins;  therefore  we  conclude  that  the  attack  is  (e5  +  e} -balanced,  for  any 
positive  e.  Hence  from  Theorem  2  we  obtain 

Theorem  3.  Security  of  BB84  for  a  source  with  individual  weakly  basis-dependent 
flaws.  Suppose  that  the  flaws  in  the  detector  are  basis-independent ,  and  that  the  flaws  in  the 
source  are  individual.  The  ith  signal  sent  by  Alice  is  prepared  by  performing  a  standard  qubit 
measurement  on  half  of  an  entangled  state  —  this  state  is  p^  when  the  Z  basis  is  declared 

and  p^  when  the  X  basis  is  declared ,  where  F{p$\  p^)  >  1  —  2es  for  all  i.  Then  the  BB84 
protocol  is  secure ,  and  secure  final  key  can  be  extracted  from  sifted  key  at  the  asymptotic  rate 

R  =  Max  (1  -  H2(S)  -H2(5  +  2 f{es),  0)  (31) 

where  8  is  the  bit  error  rate  found  in  the  verification  test  and  f(e8)  is  defined  as  in  eq.  (20). 
( We  assume  6  +  2 f(e8)  <  1/2.) 

Note  that  in  the  formulation  of  Theorem  3  we  have  assumed  that  all  signals  are  detected 
—  we  have  not  considered  the  effects  of  loss  in  the  channel  or  imperfect  detector  efficiency. 
In  principle,  Eve  can  amplify  the  basis-dependence  of  Fred’s  attack  by  eliminating  some  of 
the  signals.  In  the  worst  case,  the  coin  is  an  X  =  1  eigenstate  for  each  of  the  signals  that  Eve 
removes.  Then,  if  a  fraction  f  of  all  the  signals  are  lost,  A  is  enhanced  according  to 

A  — ^  A'  <  A/(l  -  /)  .  (32) 

The  effects  of  loss  will  be  discussed  further  in  Sec.  12  and  Sec.  13. 

9  Imperfect  oblivious  source  and  imperfect  detector 

We  recall  that  Koashi  and  Preskill  [6]  proved  the  security  of  BB84  in  the  case  where  the 
detector  is  perfect  and  the  signals  emitted  by  the  source,  when  averaged  over  the  key  bits, 
are  basis  independent  (an  oblivious  source).  The  situation  they  considered  can  be  depicted  as 
in  Fig.  3.  In  effect,  Eve  prepares  an  entangled  state  of  n  qubits,  which  are  delivered  to  Bob, 
and  n  general  signals,  which  are  delivered  to  Alice.  To  generate  the  sifted  key,  Alice  performs 
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anything  Uncharacterized 


Measure  Z 


Fig.  3.  An  uncharacterized  oblivious  source  and  a  perfect  detector.  Eve  prepares  an  entangled 
state  of  n  signals  and  n  qubits.  Alice  prepares  an  uncharacterized  measurement  on  the  n  signals 
and  Bob  performs  standard  measurements  on  the  n  qubits.  Interchanging  the  roles  of  Alice  and 
Bob,  we  obtain  the  case  of  a  perfect  source  and  an  uncharacterized  detector. 


Fig.  4.  An  oblivious  source  and  an  imperfect  detector. 


an  uncharacterized  measurement  on  each  of  her  n  signals,  and  Bob  performs  a  standard 
measurement  on  each  of  his  n  qubits.  By  simply  reversing  the  roles  of  Alice  and  Bob,  we 
obtain  the  situation  considered  by  Mayers,  in  which  the  source  is  perfect  and  the  detector  is 
uncharacterized  [2]. 

We  will  now  consider  a  special  case  of  the  Koashi-Preskill  source:  the  source  is  oblivi¬ 
ous,  but  we  further  assume  that  the  source  can  be  realized  by  the  preparation  of  a  basis- 
independent  entangled  state  of  the  signal  space  and  a  qubit ,  followed  by  a  basis-dependent 
channel  applied  to  the  qubit,  and  finally  a  standard  measurement  of  the  qubit.  However,  we 
will  go  beyond  Koashi  and  Preskill  by  allowing  the  detector  to  have  basis-dependent  flaws, 
as  shown  in  Fig.  4.  Actually,  it  will  be  no  harder  to  analyze  the  more  general  case  shown  in 
Fig.  5:  Eve  prepares  an  arbitrary  state  of  n  entangled  signals,  which  is  mapped  by  Fred  to 
a  state  of  n  pairs  of  qubits;  then  the  pairs  are  distributed  to  Alice  and  Bob,  who  perform 
standard  measurements.  An  important  feature  of  this  setting  is  that,  although  Fred’s  channel 
can  depend  on  the  basis  in  which  Alice  and  Bob  measure,  there  is  no  way  for  Fred  to  convey 
any  information  about  the  basis  to  Eve.  In  this  sense  the  source  is  oblivious. 

We  will  further  assume  that  the  channel  applied  by  Fred  is  a  product  of  n  individual 
channels,  and  that  each  of  these  n  channels  depends  only  weakly  on  the  basis.  For  analyzing 
the  impact  on  the  quantum  coin,  it  will  be  convenient  to  characterize  the  basis  dependence 
of  Fred’s  attack  as  follows:  A  channel  £  that  takes  Eve’s  arbitrary  pair  to  a  pair  of  qubits 
can  be  realized  by  its  dilation ,  an  isometric  embedding  U  of  Eve’s  space  into  the  space  of 
the  qubit  pair  and  a  suitable  ancilla.  Thus  Fred’s  basis-dependent  individual  attack  can  be 
expressed  as  the  tensor  product  Ua  =  ®r=i  Ua) ,  where  i  labels  the  pairs,  and  denotes  the 
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Fig.  5.  Stronger  version  of  the  attack  in  Fig.  4. 


basis  choice  for  the  ith  signal.  Furthermore  the  attack  depends  only  weakly  on  the  basis,  in 
the  sense  that  ||| —  U-[^||gUp  <  s  for  each  i . 

Using  this  characterization,  we  can  analyze  how  Fred’s  attack  affects  the  coins  that  de¬ 
termine  the  basis.  The  basis  choice  is  determined  by  n  quantum  coins,  each  a  qubit  initially 
prepared  in  the  X  =  1  eigenstate,  and  suppose  that  the  initial  state  of  the  n  pairs  and  their 
environment  (before  Fred’s  attack)  is  the  pure  state  |<^).  Then  after  Fred’s  attack,  the  state 
of  the  coins,  the  pairs,  and  the  environment  can  be  written  as 

^72  lZ;a>  ’  (33) 


where  a  is  the  n  bit  string  indicating  the  basis  choice,  and  the  states  {\Z;a}}  are  the  basis 
states  of  the  coins  in  the  Z  basis.  After  Fred’s  attack,  suppose  that  all  of  the  n  coins  are 
measured  in  the  X  basis.  Let  x  be  an  n-bit  string,  and  let  \X;x)  denote  a  product  of  n  X 
eigenstates,  such  that  Xi  =  1  for  X{  —  0  and  Xi  =  —1  for  X{  =  1.  Then  the  probability  that 
the  measurement  of  the  coins  yields  the  outcome  x  is 


P{x)  = 


< 


a 


The  sum  in  this  expression  can  be  factorized: 


^  V. 

a 


i— 1 


(34) 


(35) 


Furthermore,  the  sup  norm  of  a  tensor  product  is  a  product  of  sup  norms.  Since  for  each  i 


sK’ +0 


<  £  , 


sup 

2 


<  1 , 


(36) 
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we  have 

2 

<  (£)|x|  ,  (37) 

sup 

where  |x|  denotes  the  Hamming  weight  of  x,  and  therefore 

P(x)  <  (e)1*1  .  (38) 

From  this  bound  on  P(ar),  it  is  elementary  to  show  that  the  probability  that  \x\  >  (ee  +  ef)n 
is  exponentially  small  for  any  positive  e!  and  e  =  2.71828  *  *  *.  Therefore  we  have 

Lemma  3.  A  weakly  basis-dependent  individual  oblivious  attack  by  Fred  is  A- 

(7) 

balanced.  Consider  an  individual  attack  by  Fred ,  in  which  Fred  applies  Uq  }  to  the  ith  pair 
if  the  basis  choice  is  a{  =  0  and  applies  if  the  basis  choice  is  a*  =  1,  where  |||t/Q^  — 
Ui^\ | sup  <  e  for  each  i.  This  attack  is  A  —  balanced  for  any  A  >  ee. 

And  from  Theorem  2  we  obtain: 

Theorem  4.  Security  of  entanglement-based  key  distribution  against  weakly  basis- 
dependent  individual  oblivious  attacks..  Consider  an  individual  attack  by  Fred ,  in  which 
Fred  applies  U^  to  the  ith  pair  if  the  basis  choice  is  ai  =  0  and  applies  u[ ^  if  the  basis  choice 

is  ai  =  1 j  where  |||[/q^  —  u[^  1 1  JUp  <  e  for  each  i.  Then  the  entanglement-based  key  distribution 

protocol  is  secure ,  and  secure  final  key  can  be  extracted  from  sifted  key  at  the  asymptotic  rate 

R  =  Max  (1  -  H2(5)  -H2(8  +  2 /(ee),  0)  (39) 

where  8  is  the  bit  error  rate  found  in  the  verification  test  and  f(ee)  is  defined  as  in  eg.  (20). 
( We  assume  8  +  2 f(ee)  <  1/2.) 

In  our  formulation  of  Theorem  4,  we  have  chosen  to  characterize  the  basis-dependence 
of  the  attack  in  terms  of  the  sup  norm  distance  between  the  two  isometric  embeddings  Uo 
and  U\  that  realize  Fred’s  channels  So  and  S\.  It  would  be  more  natural  to  use  the  intrinsic 
distance  ||£T0  —  £i||<>  defined  by  the  “diamond  norm”  [31].  But  the  proof  of  Lemma  3  uses 
the  property  that  Uo  and  U\  are  close  in  the  sup  norm;  therefore  if  we  want  to  reformulate 
Theorem  4  using  the  characterization  that  the  channels  are  close  in  the  diamond  norm,  we 
need  to  show  that  if  two  channels  are  close  to  one  another  in  the  diamond  norm,  then  the 
dilations  of  the  channels  can  be  chosen  to  be  close  in  the  sup  norm.  The  following  lemma, 
proved  in  Appendix  A,  partially  solves  this  problem: 

Lemma  4.  Similar  channels  have  similar  dilations.  Suppose  that  So  and  S\  are  quantum 
channels  mapping  a  d-dimensional  system  S  to  a  df -dimensional  system  T,  such  that  ||  So  — 
S\  || 0<  e.  Then  there  are  dilations  Uo  and  U\  of  the  channels  (isometric  embeddings  of  S  in 
TE ,  where  E  is  dd! -dimensional)  such  that  ||  Uo  —  Ui  ||sUp<  d£- 

However,  Lemma  4  has  the  unpleasant  property  that  the  dimension  d  appears  in  the  upper 
bound  on  ||  Uo  —  U±  1 1  Jup  -  In  principle,  the  state  that  Eve  delivers  to  the  detector  could  have 
arbitrarily  high  dimension,  and  Theorem  4  no  longer  applies  if  we  fix  e  while  allowing  the 
dimension  to  grow  without  bound.  For  that  reason,  we  prefer  to  formulate  the  statement  of 
Theorem  4  in  terms  of  the  sup  norm,  rather  than  inferring  a  bound  on  the  distance  between 
dilations  in  the  sup  norm  from  a  bound  on  the  distance  between  channels  in  the  diamond 


norm. 
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10  Misalignment 

Suppose  that  Bob  is  unable  to  control  the  orientation  of  his  detector  perfectly.  When  he  tries 
to  measure  the  polarization  of  his  qubit  along  the  z-axis,  he  actually  measures  along  an  axis 
that  lies  somewhere  in  a  cone  around  the  z-axis  with  opening  half-angle  0 ;  similarly  when  he 
tries  to  measure  along  the  x-axis,  he  can  only  guarantee  that  his  axis  is  within  angle  6  of  the 
desired  axis.  This  scenario  is  equivalent  to  one  in  which  Bob’s  measurement  is  perfect,  but 
Fred  rotates  the  polarization  of  the  qubit  by  an  angle  up  to  0  right  before  the  measurement. 
Furthermore  the  rotation  Fred  applies  may  depend  on  whether  Bob  is  trying  to  measure  Z 
or  X,  possibly  enhancing  the  phase  error  rate  relative  to  the  bit  error  rate. 

Suppose,  in  addition,  that  Alice  is  unable  to  control  the  orientation  of  her  source  perfectly 
—  it  too  might  be  rotated  by  an  angle  up  to  0  from  the  ideal  orientation.  Equivalently,  we  may 
suppose  that  Alice’s  source  is  perfect,  but  that  Fred  rotates  the  qubit  slightly  (exploiting  his 
knowledge  of  the  basis)  immediately  after  it  is  emitted  by  the  source.  One  way  to  realize  such 
a  source  is  for  Alice  to  prepare  a  perfect  Bell  pair  |</>+)  and  give  half  to  Fred  (who  rotates  his 
half);  then  Alice  performs  a  standard  measurement  on  her  half.  But  a  unitary  transformation 
U  applied  by  Fred  to  his  qubit  is  equivalent  to  UT  applied  to  Alice’s  (where  UT  denotes  the 
transpose  of  U);  therefore  it  would  make  no  difference  if  Alice’s  qubit  were  rotated  instead 
of  Fred’s.  Looked  at  another  way,  the  reason  we  can  replace  Fred’s  rotation  by  a  rotation 
acting  on  Alice’s  qubit  is  that  the  source  is  oblivious  —  the  emitted  state,  averaged  over  the 
key  bits,  is  maximally  mixed,  and  Fred’s  attack  does  not  change  this  property. 

In  the  entanglement-based  protocol,  then,  the  attack  in  which  Fred  rotates  the  orientation 
of  the  source  and  detector  is  equivalent  to  an  attack  in  which  pairs  of  qubits  are  prepared 
by  Eve  however  she  pleases  and  distributed  to  Alice  and  Bob,  and  then  Fred  rotates  both 
Alice’s  and  Bob’s  qubits  slightly  (by  no  more  than  0)  just  before  standard  measurements  are 
performed.  Furthermore,  we  are  assuming  that  Fred’s  attack  is  individual  —  the  rotation  he 
applies  to  the  ith  pair  is  controlled  by  only  the  outcome  of  the  flip  of  the  ith.  coin.  Therefore, 
Theorem  4  applies.  We  can  estimate  the  rate  of  generation  of  secure  key  by  calculating  the 
maximum  value  of 


-||!7o  -  I7i 


I2 

I  sup 


=  sup 

W 


|(1  — Re(^|  Ur'UoW) 


(40) 


where  Uo  and  Ui  are  unitary  transformations  applied  to  the  pairs  that  are  consistent  with 
our  characterization  of  the  source  and  detector. 

It  is  not  hard  to  see  that  the  supremum  occurs  for  | ip)  a  maximally  entangled  state,  which, 
after  a  suitable  choice  of  basis  and  phase  conventions  we  may  choose  to  be  Fred  applies 

separate  single-qubit  rotations  to  Alice’s  qubit  and  to  Bob’s;  acting  on  the  combined 

effect  of  the  two  is  equivalent  to  a  rotation  applied  to  Bob’s  qubit  alone,  by  an  angle  no  larger 
than  29.  The  overlap  (</)+|Lr1_1f/o|0+)  is  minimized  (for  \9\  <  7r/4)  if  U^1  =  Uo)  we  may 
choose  Uq  to  be  the  transformation  /  0  C/q,  where  Uq  is  the  single-qubit  rotation 

6“=(eo  .-»)•  <41> 


We  find  that 


{(j)+\I  ®ul\<j)+)  =  cos 20 


(42) 
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which  implies 

^||C/o  -C/illLp  =  sin20  .  (43) 

From  Lemma  3,  then,  we  find  that  Fred’s  attack  is  (e  *  sin2  6  +  e} -balanced  for  any  positive  e, 
and  we  therefore  obtain 

Theorem  5.  Security  of  BB84  against  individual  misalignment  of  the  source  and 
detector.  Suppose  that ,  for  each  signal ,  Fred  can  perform  a  basis- dependent  adjustment  of 
the  polarization  axes  of  the  source  and  detector  by  any  angle  up  to  6.  Then  the  BB84  protocol 
is  secure ,  and  secure  final  key  can  be  extracted  from  sifted  key  at  the  asymptotic  rate 

R  =  Max  (1  -  H2(S)  -  H2(S  +  2/(e  *  sin2  0),  0)  (44) 

where  5  is  the  bit  error  rate  found  in  the  verification  test  and  f(e  *  sin2  6)  is  defined  as  in 
eg.  (20).  ( We  assume  S  +  2 f(e  •  sin2  6)  <  1/2  and  6  <  n/ 4).) 

Thus  for  5  =  0  we  obtain  a  nonzero  rate  of  key  generation  for  6  <  5.92°. 

We  remark  again  that  in  the  formulation  of  Theorem  5  the  misalignment  of  the  detector 
or  source  is  assumed  to  be  adversarial,  within  the  angular  tolerance  specified  in  our  charac¬ 
terization  of  the  device  —  Alice  and  Bob  wish  to  conceal  the  key  from  the  Eve/Fred  alliance. 
The  arguments  of  Mayers  [2]  (for  detectors)  and  Koashi-Preskill  [6]  (for  sources)  apply  to  an 
uncharacterized  misalignment  that  is  not  adversarial  —  Alice  and  Bob  wish  to  conceal  the  key 
from  Eve  and  don’t  care  what  Fred  knows.  In  that  case,  the  large  potential  misalignments 
do  not  reduce  the  key  generation  rate  below  that  achievable  with  perfect  devices,  given  a 
specified  bit  error  rate  S  observed  in  the  test.  However,  the  conclusion  of  [2]  about  security 
in  the  case  of  an  uncharacterized  detector  applies  only  if  the  source  is  perfect,  and  likewise 
the  conclusion  [6]  about  the  case  of  an  uncharacterized  source  applies  only  if  the  detector  is 
perfect.  In  contrast,  our  analysis  applies  to  the  case  where  both  the  detector  and  the  source 
are  subject  to  a  characterized  misalignment. 

11  Generic  individual  flaws  in  source  and  detector 

Suppose  that  the  source  and  detector  are  both  subject  to  individual  flaws  that  depend  weakly 
on  the  basis.  The  source  can  be  modeled  as  in  Sec.  8:  For  each  signal  to  be  sent,  Fred  first 
prepares  a  joint  state  of  a  qubit  A!  and  a  general  system  A.  The  state  that  Fred  prepares  can 
depend  on  the  basis.  Alice  then  launches  the  signal  by  performing  a  standard  measurement  on 
the  qubit.  If  n  signals  are  to  be  sent,  Fred  prepares  a  product  state  <£§)”=  i  P^l  j  where  a*  denotes 
the  basis  choice  for  the  ith  signal.  Thus  we  say  that  Fred’s  attack  on  the  source  is  individual. 

The  basis  dependence  of  the  source  is  weak  in  the  sense  that  \J F(p$  \pi^)  >  1  —  2es  for  each 
i. 

We  model  the  detector  as  follows:  Each  signal,  after  Eve’s  basis-independent  attack,  is 
a  state  of  a  general  system  B.  The  signal  is  received  by  Fred,  who  applies  a  channel  that 
“squashes”  the  signal  to  a  qubit  Br;  Fred’s  channel  may  depend  on  the  basis  in  which  Bob 
will  conduct  his  measurement.  Then  Bob  performs  a  standard  single-qubit  measurement 
on  the  qubit.  Fred’s  basis-dependent  squash  can  be  realized  as  a  basis-dependent  isometric 
embedding  of  B  in  B! E  where  E  is  a  suitable  ancilla.  If  n  signals  are  received  by  the  detector, 
this  transformation  can  be  expressed  as  the  tensor  product  (£§)?=1  Ua} .  Thus  we  say  that  Fred’s 
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attack  on  the  detector  is  individual.  Furthermore  the  attack  depends  only  weakly  on  the  basis, 
in  the  sense  that  —  J7i^||sUp  <  Ed  for  each  i . 

By  simultaneously  allowing  basis-dependent  flaws  in  the  source  and  in  the  detector,  we 
are  going  beyond  the  analysis  in  Sec.  8  and  Sec.  9.  But  we  may  anticipate  that,  as  in  those 
cases  considered  previously,  we  can  show  that  the  attack  is  A-balanced  for  small  A,  if  and 
Ed  are  small.  Indeed,  this  is  the  case;  for  example,  if  es  =  ed  =  £  we  can  show 

Lemma  5.  Suppose  that  the  source  and  detector  are  subject  to  basis- dependent  flaws.  The 
ith  signal  sent  by  Alice  is  prepared  by  performing  a  standard  qubit  measurement  on  half  of  an 
entangled  state  —  this  state  is  p^  when  the  Z  basis  is  declared  and  when  the  X  basis  is 

declared ,  where  \J F{p$  \  pf*)  >1  —  2 e  for  all  i.  The  ith  signal  received  by  the  detector  is  first 
squashed  to  a  qubit  and  then  a  standard  measurement  is  performed.  The  squash  is  described 
by  a  channel  that  can  be  realized  by  the  isometric  embedding  when  the  Z  basis  is  declared 
and  by  when  the  X  channel  is  declared ,  where  !||{7<j^  —  ^l^llsup  <  6  for  eac h  This 
attack  is  A  —  balanced  for  any  A  >  8y/£  4-  4e. 

Lemma  5,  together  with  Theorem  2,  provides  a  proof  of  security  for  generic  individual  flaws 
in  the  source  and  detector  that  depend  sufficiently  weakly  on  the  basis.  We  omit  the  proof  of 
Lemma  5,  which  is  rather  long  and  unenlightening. 

A  surprising  feature  of  Lemma  5  is  the  term  scaling  like  y/e  in  our  bound  on  A  —  one 
might  reasonably  have  expected  a  stronger  result,  that  the  attack  is  A-balanced  for  some  A 
linear  in  e.  However,  we  have  not  succeeded  in  proving  a  linear  bound. 

12  Tagged  signals 

Suppose  that  a  fraction  A  of  the  qubits  emitted  by  the  source  are  tagged  by  Fred.  The  tag 
informs  Eve  which  basis  was  used,  so  that  she  can  measure  the  qubit  without  disturbing  it. 
Eve  has  no  information  about  the  basis  used  for  the  untagged  qubits  (a  fraction  1  —  A  of  the 
total). 

Note  that  tagged  qubits  arise  in  QKD  with  weak  coherent  states.  The  phase  of  a  signal 
emitted  by  a  coherent  light  source  may  be  regarded  as  random  if  Eve  has  no  information 
about  the  phase  [11,  32],  so  that  the  signal  state  is  a  mixture  of  photon  number  eigenstates. 
If  the  source  emits  more  than  one  photon,  we  pessimistically  assume  that  Eve  stores  the  extra 
photons  until  after  the  bases  are  broadcast,  and  then  measures  in  the  proper  basis  to  learn  the 
key  bit  without  introducing  any  disturbance.  Then  the  tagging  probability  is  A  =  Pm/pd > 
where  pm  is  the  probability  of  emitting  a  multiphoton,  and  po  is  the  probability  that  an 
emitted  photon  is  detected  (we  pessimistically  assume  that  all  of  the  photons  that  fail  to  arrive 
were  emitted  as  single  photons).  Arguably  we  know  pm  if  we  understand  our  source  well,  and 
Pd  can  be  measured.  Hence  A  is  a  known  (or  at  least  knowable)  parameter  characterizing  a 
practical  implementation  of  quantum  key  distribution. 

We  can  incorporate  tagging  into  our  source  model  by  allowing  Fred  to  append  to  each 
qubit  emitted  by  Alice’s  source  an  auxiliary  qutrit  that  conveys  information  about  the  basis 
to  Eve.  For  a  fraction  A  of  the  signals  (Fred  gets  to  decide  which  ones),  he  sets  the  value 
of  the  qutrit  to  | a),  where  a  =  0  indicates  the  Z  basis  and  a  =  1  indicates  the  X  basis. 
For  the  remaining  fraction  1  —  A  of  the  qubits  sent  by  Alice,  Fred  sets  the  qutrit  to  |2), 
passing  no  basis  information  to  Eve.  Eve  can  read  the  auxiliary  qutrit  to  learn  the  basis  for 
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each  tagged  qubit,  and  so  measure  the  key  bit  without  introducing  any  disturbance.  If  each 
coin  that  determines  the  basis  choice  is  a  qubit  initially  prepared  in  the  X  —  1  eigenstate 
(|0)  +  |l))/\/2,  then  Fred’s  attack  causes  the  coin  to  decohere  in  the  {|0),  1 1) }  basis  if  the 
corresponding  signal  is  tagged,  but  leaves  the  coin  undisturbed  if  the  signal  is  untagged. 

It  follows  that  the  attack  is  (A/2  +  e)-balanced  for  any  positive  e,  and  we  could  prove 
security  by  applying  Theorem  2.  But  in  this  case  it  is  possible  to  prove  a  stronger  result, 
because  we  know  more  about  the  quantum  state  of  the  coins.  Suppose  that,  as  in  Sec.  7,  we 
apply  controlled-NOT  gates  from  the  pairs  to  the  coins,  transforming  ngfp  to  |  ^coin,i* 

The  action  on  the  coin  of  a  controlled-NOT  gate  preserves  an  X-eigenstate.  Therefore,  the 
probability  distribution  governing  the  value  of  ngap  is  the  same  as  the  probability  distribution 
governing  \  l  ^coin,i  in  a  state  of  the  n  coins  with  the  property  that  n(l  —  A)  of  the  coins 
are  in  eigenstates  of  X  with  eigenvalue  +1.  Hence  with  high  probability  |nga/|  <  nA/2  +  s  for 
any  positive  e.  A  similar  argument  applies  to  |ngap'  |,  and  we  find  that  |ngap|  =  |ngap  +  ngap  |  < 
nA  +  e  for  any  positive  e.  We  conclude  that  secure  key  can  be  extracted  from  sifted  key  at 
the  asymptotic  rate 

R=l-H2{5)-H2(5  +  A)  ,  (45) 

where  we  have  assumed  that  5  +  A  <  1/2. 

Note  that  to  obtain  the  upper  bound  on  ngap,  all  that  we  needed  was  the  property  that 
Fred  interacts  with  no  more  than  nA  of  the  coins.  Therefore,  the  argument  can  be  applied 
more  broadly  than  to  the  particular  tagging  model  that  we  have  defined  above.  For  example, 
it  applies  to  a  setting  where  there  are  flaws  in  the  random  number  generators  used  by  Alice 
and  Bob  to  select  the  basis  and  the  key  bits.  Suppose  that  for  a  fraction  n(l  —  A)  of  the 
signals,  the  basis  choice  and  the  key  bit  are  chosen  by  flipping  fair  coins,  but  for  a  fraction 
nA  of  the  signals,  Fred  is  free  to  choose  the  basis  and  the  key  bit  however  he  chooses.  In  this 
model,  if  the  source  and  the  detector  are  perfect  otherwise,  Fred  need  not  touch  n(l  —  A)  of 
the  coins,  and  secure  key  can  be  generated  at  the  rate  eq.  (45).  (In  this  estimate  of  the  rate, 
however,  we  have  continued  to  assume  that  the  qubits  selected  for  the  verification  test  are  a 
fair  sample,  and  so  provide  an  accurate  estimate  of  the  error  rate  for  the  key  generating  pairs. 
The  argument  can  be  extended  further  to  cover  the  case  where  Fred  is  permitted  to  select  a 
small  portion  of  the  test  set,  by  adjusting  the  estimate  of  the  error  rate  to  take  into  account 
the  bias  in  the  test.) 

With  a  more  sophisticated  argument  we  can  obtain  a  higher  rate  of  secure  key  generation 
than  eq.  (45).  After  correcting  errors  in  the  sifted  key  (sacrificing  a  fraction  H2(5)  of  the 
key,  asymptotically)  we  imagine  executing  privacy  amplification  on  two  different  strings,  the 
sifted  key  bits  arising  from  the  tagged  qubits  and  the  sifted  key  bits  arising  from  the  untagged 
qubits.  Since  the  privacy  amplification  scheme  described  in  Sec.  3  is  linear  (the  private  key  can 
be  computed  by  applying  the  C2  parity  check  matrix  to  the  sifted  key  after  error  correction), 
the  key  obtained  is  the  bitwise  XOR 


Suntagged  ®  ^tagged  (46) 

of  keys  that  could  be  obtained  from  the  tagged  and  untagged  bits  separately.  If  s untagged  is 
private  and  random,  then  it  doesn’t  matter  if  Eve  knows  everything  about  lagged  -  the  sum 
is  still  private  and  random. 


352  Security  of  quantum  key  distribution  with  imperfect  devices 


Therefore  we  ask  if  privacy  amplification  is  successful  applied  to  the  untagged  bits  alone. 
Under  the  worst  case  assumption  that  the  bit  error  rate  is  zero  for  tagged  qubits,  the  overall 
bit  error  rate  5  is  related  to  the  bit  error  rate  5untagged  for  the  untagged  qubits  by 

s={  1  -  A)4ntagged  -  (47) 

Since  the  bit  errors  and  phase  errors  are  related  by  symmetry  for  the  untagged  qubits,  the 
phase  error  rate  SPj untagged  for  the  untagged  qubits  satisfies 

8p, untagged  ^  ^untagged  ~\~  £  —  ~  ^  £  (48) 

with  high  probability.  Since  the  error  rate  5  observed  in  the  test  provides  a  good  estimate  of 
8  for  the  key  generating  pairs,  we  conclude  that 

8p, untagged  ^  ^  ^  H"  £  *  (49) 

with  high  probability,  for  any  positive  ef  and  sufficiently  large  n.  If  there  are  n  bits  of  sifted 
key,  then  (1  —  A )n  of  these  bits  come  from  untagged  qubits,  and  (since  bit  errors  are  already 
corrected)  we  can  extract  a  private  key  by  sacrificing  a  fraction  H2(8P, untagged  +  £n)  of  these 
for  privacy  amplification.  Thus  we  have  proved: 

Theorem  6.  Security  of  BB84  against  tagging.  Suppose  that  Fred  interacts  with  only 
nA  of  the  n  coins  that  determine  the  basis  used  by  Alice  and  Bob .  Then  the  BB8f  protocol  is 
secure,  and  secure  final  key  can  be  extracted  from  sifted  key  at  the  asymptotic  rate 

R  =  Max  ((1  -  A)  -  H2(S)  -  (1  -  A )H2  (j/^)  ,«)  (50) 

where  8  is  the  bit  error  rate  found  in  the  verification  test  (assuming  8/(1  —  A)  <  1/2 ).  In 
particular,  this  rate  of  key  generation  is  achievable,  assuming  that  the  source  and  the  detector 
are  perfect  otherwise,  if  Fred  reveals  the  basis  to  Eve  for  n A  of  the  signals,  or  if  Fred  chooses 
the  basis  and  key  bits  for  nA  of  the  signals . 

In  the  case  where  the  source  emits  weak  coherent  states  with  random  phases,  a  rate  of  key 
generation  similar  to  eq.  (50)  was  established  by  Inamori,  Liitkenhaus,  and  Mayers  (ILM)  [8]. 
Actually,  the  rate  quoted  by  ILM  is  below  R  in  Eq.  (50)  —  in  their  Eq.  (18)  the  argument 
of  H2  in  the  last  term  is  25/(1  —  A)  rather  than  5/(1  —  A).  However,  we  believe  that  their 
argument  can  be  refined  to  match  the  rate  Eq.  (50).  With  that  refinement  the  ILM  result 
is  stronger  in  a  sense  than  what  we  have  derived  here,  as  it  applies  to  the  case  of  a  general 
uncharacterized  detector. 

Theorem  6  can  be  applied  if  there  is  loss  in  the  quantum  channel  connecting  Alice  and  Bob 
and/or  if  Bob’s  detector  has  imperfect  efficiency,  provided  that  the  loss  is  basis-independent. 
For  example,  suppose  that  each  signal  emitted  by  Alice’s  source  is  a  phase-randomized  weak 
coherent  state  with  mean  photon  number  p  <C  1,  so  that  the  signal  is  a  single  photon  with 
probability  pi  =  p  +  O  (p2),  and  more  than  one  photon  with  probability  pm  =  \p2  + 
O  ()U3) .  We  can  describe  these  signals  by  imagining  a  source  that  never  emits  multiple  photons, 
followed  by  a  basis-dependent  attack  by  Fred  in  which  Fred  interacts  with  a  fraction  pM  of 
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all  the  coins.  Now  suppose  that  Eve’s  attack  can  be  modeled  by  a  basis-independent  lossy 
channel,  such  that  a  fraction  rj  of  all  the  nonvacuum  signals  are  detected.  (Here  by  “basis- 
independent”  we  mean  that  Eve’s  attack  has  no  a  priori  dependence  on  the  basis,  though 
of  course  Eve  can  exploit  the  multiphotons  to  acquire  some  information  about  the  basis;  the 
important  thing  is  that  Eve  can  launch  her  attack  without  interacting  with  the  coins.)  Then 
a  fraction  pz>  =  r)  (ju  +  0(ju2))  of  all  the  signals  are  detected,  and  of  the  coins  associated  with 
detected  signals,  Fred  interacts  with  at  most  a  fraction 

A  =Pm/pd  = -^{v  +  O  (ij2))  .  (51) 


Sifted  key  is  generated  at  the  rate 


1 

- 


-vim  k  urj2 A  , 


(52) 


where  v  is  the  repetition  frequency  of  the  source.  Therefore,  if  A  (and  hence  also  the  rate 
R  of  generation  of  final  key  from  sifted  key)  is  held  fixed  as  rj  gets  small,  then  the  overall 
key  generation  rate  ^vppR  is  0(?72),  as  ILM  observed  [8].  This  scaling  of  the  rate  with 
r]  holds  approximately  as  long  as  dark  counts  in  the  detector  are  not  too  important,  so 
that  the  bit  error  rate  <5  is  roughly  independent  of  77.  In  some  current  implementations  of 
quantum  key  distribution  using  weak  coherent  states  transmitted  through  optical  fibers,  dark 
counts  are  relatively  unimportant,  and  our  analysis  of  security  is  applicable,  up  to  a  range  of 
approximately  20  km. 

Theorem  6  applies  if  Fred  tags  any  nA  of  the  signals.  But  it  does  not  apply  to  a  coherent 
superposition  of  such  attacks.  Suppose,  for  example,  that  in  the  entanglement-based  protocol, 
Fred’s  attack  on  the  pairs  and  the  coins  produces  a  state 


I®)  =  E  a^> ; 

S:|S|<nA 


(53) 


here  the  sum  is  over  subsets  S  that  contain  no  more  than  nA  of  the  n  pairs,  and  \^s)  is  the 
state  resulting  from  tagging  the  pairs  in  the  set  S.  Although  Theorem  6  does  not  apply  to  a 
general  superposition  of  tagged  states  as  in  eq.  (53),  Theorem  2  does  apply  to  this  case.  After 
we  trace  out  Fred’s  labeling  qutrits,  the  state  of  the  coin  can  be  realized  as  an  ensemble  of 
states,  where  for  each  state  in  the  ensemble  at  least  n(l  — A)  of  the  coins  are  X  =  1  eigenstates 
and  the  rest  are  mixtures  of  Z  eigenstates.  Therefore,  if  the  coins  are  all  measured  in  the 
X  basis,  with  high  probability  the  number  of  coins  for  which  the  outcome  X  =  —  1  is  found 
will  be  less  than  n( A/2  +  5).  Thus  the  attack  is  (A/2  +  £)-balanced  for  any  positive  e  and 
sufficiently  large  n,  and  it  follows  from  Theorem  2  that  secure  key  can  be  generated  at  the 
corresponding  rate  (a  lower  rate  than  found  in  Theorem  6). 

In  particular,  then,  Theorem  2  can  be  applied  to  a  general  source  that  emits  signals  that 
are  sufficiently  close  to  perfect  single  photon  pulses,  even  if  the  multiphotons  occur  with 
nonrandom  phases.  Unfortunately,  though,  our  arguments  do  not  allow  us  to  address  the 
case  where  the  source  emits  weak  coherent  states  with  nonrandom  phases  —  in  that  case  the 
states  are  dominated  by  the  amplitude  to  emit  the  vacuum  state,  and  the  tagging  model  we 
have  analyzed  here  does  not  apply.  This  difficulty  seems  to  be  more  than  a  mere  shortcoming 
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of  our  model;  the  deeper  problem  is  that  weak  coherent  states  with  nonrandom  phases  leak  a 
significant  amount  of  basis  information,  which  may  compromise  security. 

As  for  all  of  the  cases  that  we  consider  in  this  paper,  the  crux  of  our  analysis  of  tagging  is 
a  bound  on  the  phase  error  rate  5P  of  the  key  generating  pairs  that  holds  with  high  probability 
—  it  does  not  suffice  for  Sp  to  be  bounded  after  averaging  over  Fred’s  strategy.  Therefore,  our 
security  proof  need  not  apply  for  a  highly  correlated  basis-dependent  attack  on  the  signals, 
even  if  the  bit  error  rate  5  and  phase  error  rate  5P  resulting  from  the  attack  have  mean  values 
that  are  nearly  equal.5 

For  example,  suppose  that  with  a  small  probability  r,  Fred  tells  Eve  the  basis  for  every 
signal,  while  with  probability  1  —  r,  Fred  tells  Eve  nothing.  Then  on  average  the  disparity 
between  the  bit  error  rate  and  the  phase  error  rate  is  small.  However,  with  a  fixed  probability 
r  that  does  not  depend  on  the  key  length,  Eve  can  learn  the  whole  key.  Therefore,  the 
quantum  key  distribution  protocol  is  insecure  for  a  source  of  this  type. 


13  Trojan  pony 


Suppose  that  the  detector  is  not  perfectly  efficient.  A  fraction  A  of  the  signals  that  enter  the 
detector  fail  to  trigger  it,  resulting  in  no  recorded  outcome.  Suppose  further  that  Fred,  who 
knows  Bob’s  basis,  controls  whether  the  detector  fires  or  not,  subject  to  the  constraint  that 
only  a  fraction  A  of  the  detection  events  can  be  eliminated.  Note  that  the  parameter  A  can 
be  measured  in  the  protocol. 

Fred  can  use  his  power  to  disguise  Eve’s  attack,  enhancing  the  detection  rate  when  Bob 
measures  in  the  same  basis  as  Eve  did  and  suppressing  the  detection  rate  when  Bob  measures 
in  a  different  basis  than  Eve’s.  This  is  a  limited  version  of  the  “Trojan  horse”  attack  [28]  — 
we  call  it  the  “Trojan  pony.”  As  we  remarked  in  Sec.  5,  one  version  of  the  Trojan  pony  attack 
can  be  launched  if  Bob’s  detector  is  configured  as  a  polarization  beam  splitter  that  directs 
the  signals  to  a  pair  of  threshold  detectors;  Eve  can  ensure  that  the  detector  fails  to  register  a 
conclusive  result  by  flooding  it  with  many  photons.  We  will  analyze  this  attack  in  a  different 
setting,  in  which  Bob’s  detector  receives  qubits  rather  than  bosonic  modes. 

In  the  EDP  setting,  we  allow  Fred  to  eliminate  a  fraction  A  of  the  pairs  (corresponding 
to  the  qubits  for  which  he  “turns  off”  Bob’s  detector).  In  the  worst  case,  every  pair  that  he 
eliminates  has  a  bit  error  and  no  phase  error.  Before  any  pairs  were  eliminated,  the  error  rate 
was  essentially  the  same  in  both  bases  —  call  this  rate  p.  After  eliminating  the  undetected 
pairs,  the  error  rates  are 


5  & 


p  —  A 
1- A 


P 

1- A 


(54) 


(assuming  A  <  p  <  1  —  A).  Note  that,  for  ease  of  presentation,  we  have  not  included  the 
£’s  in  eq.  (54);  instead  we  have  used  the  symbol  &  to  indicate  relations  that  are  satisfied  to 
arbitrarily  good  accuracy  with  high  probability  asymptotically.  Eliminating  p  we  find 


1- A 


(55) 


and,  since  the  error  rate  5  measured  in  the  test  provides  a  reliable  estimate  of  <5,  we  infer  that 


bWe  thank  Dominic  Mayers  for  a  helpful  discussion  of  this  point. 
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final  key  can  be  generated  from  sifted  key  at  the  achievable  rate 

R  =  l-H2{5)-H2(s+^Fj  ,  (56) 

where  we  have  assumed  that 

6+ ^^<1/2.  (57) 

We  can  use  similar  reasoning  if  the  detector  efficiency  is  low,  but  we  trust  that  most 
of  the  instances  where  the  detector  fails  to  fire  are  chosen  at  random,  and  only  a  small 
percentage  of  all  the  detector  failures  are  due  to  Fred’s  intervention.  In  the  absence  of  other 
imperfections,  random  misfires  merely  reduce  the  number  of  sifted  key  bits,  but  without 
breaking  the  symmetry  between  the  bases.  Eq.  (56)  still  applies  if  a  fraction  /  of  detection 
events  are  removed  by  random  errors,  and  a  fraction  A  of  the  remaining  events  are  removed 
adversarially,  resulting  in  an  overall  efficiency  77  =  (1  —  /)(1  —  A).  Thus  we  have  proved 

Theorem  7.  Security  of  BB84  against  basis-dependent  detector  efficiency.  Suppose 
that  of  the  signals  that  arrive  at  Bob  detector ,  a  fraction  f  chosen  at  random  are  removed , 
and  of  those  that  remain  a  fraction  A  chosen  adversarially  by  Fred  are  also  removed ,  so  that 
the  overall  efficiency  of  the  detector  is  7 7  =  (1  —  /)(1  —  A).  Then  the  BB8f  protocol  is  secure , 
and  secure  final  key  can  be  extracted  from  the  (detected)  sifted  key  at  the  asymptotic  rate 

R  =  Max(.-H2(S)-H2  (<5+^^)  >o)  (58) 

where  S  is  the  bit  error  rate  found  in  the  verification  test  (assuming  S  +  A/(l  —  A)  <  1/2). 

Note  that  we  can  measure  the  efficiency  77  in  the  protocol,  but  can  determine  A  only  by 
acquiring  a  good  understanding  of  the  vulnerability  of  the  detector  to  tampering.  In  fact, 
in  current  implementations  the  typical  efficiency  for  detection  of  single  photons  at  telecom¬ 
munication  wavelengths  is  about  15%  [33].  Theorem  7  can  also  be  applied  to  the  case  where 
basis-dependent  losses  occur  in  the  quantum  channel  connecting  Alice  and  Bob,  with  A  para¬ 
metrizing  the  basis  dependence. 

ILM  [8,  27]  discussed  the  specific  type  of  Trojan  pony  attack  in  which  Eve  floods  Bob’s 
polarization  beam  splitter  with  many  photons  of  the  same  polarization,  generating  a  “double 
click”  in  Bob’s  two  photon  detectors  when  he  tries  to  measure  the  polarization  in  the  conju¬ 
gate  basis.  For  this  case  they  proposed  that  Bob  choose  his  key  bit  randomly  each  time  he 
encounters  a  double  click  event.  Security  of  this  scheme  is  ensured  by  the  result  of  Mayers 
[2]:  the  POVM  that  assigns  a  random  outcome  to  the  “double-click”  subspace  is  a  possible 
measurement  that  Fred  could  arrange,  and  Mayers  proved  security  for  an  arbitrary  detector 
POVM.  If  double  clicks  occur  a  fraction  A  of  the  time,  and  the  bit  error  rate  is  5  when  single 
clicks  occur,  then  the  overall  error  rate  under  the  ILM  prescription  will  be  (1  —  A )S  +  A/2, 
resulting  in  a  key  generation  rate 

R=1-2H2  ((1  -  A)5  +  A/2)  .  (59) 

The  rate  is  further  enhanced  by  the  factor  (1  —  A)-1  relative  to  Eq.  (56),  since  all  detection 
events,  including  the  double  clicks,  contribute  to  the  sifted  key.  Thus  the  achievable  rate 
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established  by  ILM  exceeds  the  rate  we  have  derived,  except  for  relatively  large  A  and  rela¬ 
tively  small  8 .  However,  the  two  results  cannot  be  compared  directly,  because  they  apply  to 
two  different  models  of  the  adversary.  The  ILM  result  Eq.  (59)  applies  to  a  particular  Trojan 
pony  attack  that  can  be  launched  by  Eve  if  the  Bob/Fred  POVM  has  suitable  properties; 
it  provides  a  condition  for  Eve  (but  not  Fred)  to  have  negligible  information  about  the  key. 
Eq.  (56)  is  the  rate  at  which  key  can  be  extracted  under  a  Trojan  pony  attack  in  which  Fred 
receives  qubits,  and  can  prevent  some  of  the  qubits  from  registering  in  Bob’s  detector.  But 
in  this  case  the  key  is  kept  secret  not  just  from  Eve  but  from  the  Eve/Pred  alliance. 

A  different  type  of  issue  relating  to  detector  efficiency  arises  if  the  detector  failures  occur  at 
different  rates  when  measuring  in  the  X  and  Z  bases,  but  are  otherwise  randomly  distributed. 
The  bias  in  the  detector  efficiency  breaks  the  symmetry  between  the  bases,  but  a  simple  variant 
of  the  Shor-Preskill  argument  still  applies.  In  the  entanglement  distillation  picture,  we  may 
imagine  that  Alice  and  Bob  at  first  share  many  noisy  pairs;  furthermore,  after  a  random 
permutation  unknown  to  the  adversary  is  applied,  the  pairs  are  symmetrized  so  that  all  have 
the  same  marginal  density  operator.  Then  some  of  the  pairs  are  removed  from  the  sample 
by  a  random  process.  Though  the  probability  of  removal  may  depend  on  the  basis  used  to 
generate  the  key  bit,  Alice  and  Bob  can  still  infer  the  phase  error  rate  from  the  bit  error  rate 
if  they  conduct  a  refined  data  analysis  [25],  measuring  separate  error  rates  8x  and  8z  for 
the  X  and  Z  bases  respectively.  If  a  fraction  px  of  the  sifted  key  bits  are  generated  in  the 
X  basis  and  a  fraction  pz  in  the  Z  basis  (where  px  ~\~Pz  —  1)?  so  that  the  bit  error  rate  is 
8  =  px8x  -\~Pz8z ,  then  the  phase  error  rate  to  insert  in  Eq.  (4)  becomes  8P  =  px8z  ~\~Pz8x • 

14  Conclusions 

We  have  shown  that  the  BB84  quantum  key  distribution  protocol  is  secure  when  the  source 
and/or  detector  are  subject  to  small  errors  that  are  controlled  by  an  adversary  who  knows 
the  basis  used  by  Alice  and  Bob.  We  have  formulated  a  method  for  estimating  the  key 
generation  rate  in  the  presence  of  such  errors,  and  we  have  applied  the  method  to  various 
model  sources  and  detectors.  Our  results  are  complementary  to  earlier  proofs  of  security  [2,  6] 
that  apply  to  flaws  in  the  apparatus  that  may  be  large  but  are  nonadversarial;  furthermore, 
our  results  unlike  those  of  [2,  6]  apply  when  both  the  source  and  the  detector  have  small 
basis- dependent  flaws,  as  will  be  the  case  in  typical  real-world  implementations  of  quantum 
key  distribution.  We  have  argued  that  the  security  holes  of  real  sources  and  detectors  can 
be  usefully  investigated  within  our  framework,  and  we  expect  that  the  methods  we  have 
developed  will  find  further  applications. 

However,  the  model  sources  and  detectors  to  which  our  analysis  applies  are  not  completely 
general.  In  our  model  of  the  source,  each  signal  is  launched  by  preparing  an  entangled  state 
of  a  qubit  and  a  general  system,  followed  by  an  ideal  measurement  of  the  qubit.  To  establish 
security,  we  require  that  the  entangled  state  depend  only  weakly  on  the  basis  used  in  the 
protocol.  With  this  model,  we  are  unable  to  treat,  for  example,  the  case  where  the  source 
emits  weak  coherent  states  with  nonrandom  phases.  Likewise,  we  model  the  detector  as  a 
quantum  channel  followed  by  an  ideal  measurement  of  a  qubit,  and  to  establish  security  we 
require  that  the  quantum  channel  depend  only  weakly  on  the  basis.  In  particular,  we  are 
unable  to  treat  the  case  where  the  signals  received  by  the  detector  reside  in  a  Hilbert  space 
of  arbitrarily  high  dimension. 
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Various  other  issues  regarding  the  security  of  BB84  and  other  quantum  key  distribution 
protocols  have  not  been  addressed  here.  We  have  not  considered  how  to  characterize  devices 
reliably  using  testing  equipment  that  is  itself  untrustworthy  (as  in  [7]).  We  have  not  discussed 
how  to  improve  the  rate  of  key  generation  beyond  the  rate  in  Eq.  (4)  through  privacy  ampli¬ 
fication  schemes  that  use  two-way  communication  between  Alice  and  Bob  [15].  Finally,  our 
security  analysis  applies  to  the  asymptotic  limit  of  an  infinite  key  —  we  have  not  analyzed  the 
practical  aspects  of  error  correction  and  privacy  amplification  in  the  case  of  finite  key  length. 
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Appendix  A 

Here  we  will  prove: 

Lemma  4.  Similar  channels  have  similar  dilations.  Suppose  that  So  and  S\  are  quantum 
channels  mapping  a  d-dimensional  system  S  to  a  d! -dimensional  system  T,  such  that  ||  £T0  — 
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E\  || 0<  £.  Then  there  are  dilations  Uq  and  U\  of  the  channels  (isometric  embeddings  of  S  in 
TE?  where  E  is  ddf -dimensional)  such  that  ||  Uq  —  Ui  ||gUp<  de. 

It  is  convenient  to  characterize  a  quantum  channel  £  mapping  system  S'  to  system  T  by 
considering  the  action  of  /  0  £  on  a  reference  system  R  and  the  system  5,  where  dim  R  = 
dim  S  =  d.  Let 

l*>  =  £|i>*®|i>s  (A.l) 


denote  an  unconventionally  normalized  maximally  entangled  pure  state  on  RS ,  satisfying 
($|$)  =  d.  We  may  define 

p  =  J®£(|S><$|)  ,  (A. 2) 

where  p  is  an  unconventionally  normalized  density  operator  on  RT ,  satisfying  tr  p  =  d.  The 
action  of  £  on  a  pure  state  |p)  =  ^fi  ai\i)  on  S  can  then  be  expressed  as 

£{W){v\)  =  {<P*\P\<P*)  (A.3) 

where  |<p*)  =  Yliai\i)  1S  the  “index  state”  on  R  corresponding  to  | ip).  If  we  introduce  an 
additional  system  E  (the  “environment,”  of  dimension  dd'),  we  can  construct  a  purification 
|<l')  of  p  on  RTE  such  that  =  d .  This  purification  defines  a  “dilation”  U  of  the 

channel  £  that  realizes  £  as  an  isometric  embedding  of  S  in  TE.  The  action  of  the  dilation 
on  the  pure  state  |<p)  is 

\ip)  ^  u\ip)  =  (ip*\&)  .  (A.4) 

Now  suppose  that  £q  and  £\  are  two  channels  acting  on  S,  satisfying  the  inequality 

||£o  -  £i||o  <  £  ,  (A. 5) 


and  that  po  and  Pi  are  the  corresponding  states  obtained  from  the  action  of  I  (g)  £q  and  I  (&£ \ 
on  |$).  The  diamond  norm  [31]  is  defined  by 


Iloilo  = 


sup 

X^O 


JSgpQlltr 

l|V||tr  ’ 


(A.6) 


since  {3>|<1>}  =  d,  it  follows  that  the  trace  distance  between  po  and  pi  satisfies 

II  Po  Pi  1 1 tr  <  de  .  (A.7) 

Since,  for  conventionally  normalized  density  operators,  the  fidelity  and  trace  distance  are 
related  by 

VF(P0,Pi )  >  1  -  2  II  Po  -  Pi  lltr  ,  (A. 8) 

it  follows  [29,  30]  that  po  and  pi  have  purifications  \&0)  and  \^[)  on  RSE  with  norm  \fd  and 
overlap  satisfying 

Re<$'1|$')>d(l-|)  .  (A.9) 

This  large  overlap  of  the  purifications  |$q)  and  l^)  implies  that  the  corresponding  dila¬ 
tions  Uq  and  U\  of  the  channels  are  close  to  one  another  in  the  sup  norm.  Given  any  |<p)  on 
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S,  we  may  regard  it  as  one  element  of  a  basis  {|<£i)}  for  S.  Then  eq.  (A. 9)  may  be  rewritten 
as 


d 

Re  ’^2((Pi\uiuo\<Pi} 

i— 1 

=  Re  ><tf  l*o>  >  d  (l  -  |)  •  (A.10) 

i— 1 

But  each  of  the  d  terms  in  the  sum  is  no  larger  than  1,  and  since  the  sum  is  greater  than 
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Abstract 

We  propose  an  atom-cavity  chip  that  combines  laser  cooling  and  trapping  of 
neutral  atoms  with  magnetic  microtraps  and  waveguides  to  deliver  a  cold 
atom  to  the  mode  of  a  fibre  taper  coupled  photonic  bandgap  (PBG)  cavity. 

The  feasibility  of  this  device  for  detecting  single  atoms  is  analysed  using 
both  a  semiclassical  treatment  and  an  unconditional  master  equation 
approach.  Single-atom  detection  seems  achievable  in  an  initial  experiment 
involving  the  non-deterministic  delivery  of  weakly  trapped  atoms  into  the 
mode  of  the  PBG  cavity. 

(Some  figures  in  this  article  are  in  colour  only  in  the  electronic  version) 


1.  Introduction 

The  development  of  techniques  necessary  to  manipulate  single 
atoms  and  photons  and  to  control  their  interactions  is  an 
important  addition  to  the  toolbox  of  nanotechnology.  An 
important  advance  would  be  the  development  of  a  compact 
and  integrable  device  to  serve  as  a  single-atom  detector  [1,2]. 
The  system  comprised  of  a  strongly  interacting  atom  and 
photon — cavity  quantum  electrodynamics  (QED)  [3-5] — 
provides  the  basis  for  realizing  such  a  device.  These  single¬ 
atom  detectors  could  play  as  important  a  role  in  the  burgeoning 
field  of  atom  optics  [6]  as  single-photon  detectors  do  in 
conventional  optics.  The  advent  of  Bose-Einstein  condensates 
(BECs)  of  neutral  atoms  and  the  production  of  degenerate 
fermionic  condensates  [7]  further  highlights  the  importance 
of  developing  single-atom  read-out  devices. 

To  achieve  these  goals  in  cavity  QED,  a  neutral  atom  must 
be  inside  the  mode  of  a  high  finesse  cavity  with  small  mode 
volume:  the  atom-cavity  system  must  be  in  the  strong  coupling 
regime.  Strong  coupling  requires  the  atom-cavity  coupling, 
go,  to  be  much  larger  than  both  the  atomic  dipole  decay  rate, 
y±,  and  the  decay  rate  of  the  cavity  field,  k.  Specifically,  the 
saturation  photon  number,  mo  =  y\_ /2 g%,  and  the  critical  atom 
number,  Nq  =  2y±K/gQ,  must  both  be  much  less  than  unity. 


State-of-the-art  cavity  QED  experiments  have  achieved 
strong  coupling  parameters  as  small  as  [mo,  No]  & 
[l(T4,  10-3]  by  either  dropping  [8]  or  vertically  tossing  [9] 
a  cold  neutral  atom  between  the  mirrors  of  a  high  finesse, 
low  mode  volume  Fabry-Perot  cavity.  Recently,  intracavity 
atom  trapping  for  durations  up  to  3  s  has  been  demonstrated 
by  coupling  a  secondary  optical  beam  into  the  Fabry-Perot 
cavity  to  form  a  far  off  resonance  trap  (FORT)  [10]. 

The  intent  of  this  paper  is  to  introduce  a  cavity  QED 
system  based  on  magnetostatic  delivery  of  atoms  to  a 
photonic  bandgap  cavity,  and  to  discuss  the  ability  of  this 
system  to  detect  single  atoms.  This  experimental  system — 
magnetostatic  confinement  of  atoms  inside  the  field  modes  of 
photonic  bandgap  cavities — raises  the  possibility  of  achieving 
an  experimentally  robust,  integrated,  and  scalable  system. 
Mastering  the  integration  of  a  single  atom  and  photons — 
quintessential^  quantum  components — presents  an  entirely 
new  prospect  for  technology:  quantum  computation  and 
communication.  Cavity  QED  provides  a  rich  experimental 
setting  for  quantum  information  processing  (QIP),  both  in  the 
implementation  of  quantum  logic  gates  and  in  the  development 
of  quantum  networks  [11,  12].  While  not  necessary  for 
single- atom  detection,  confining  the  atom  in  the  Lamb- 
Dicke  regime  inside  the  cavity  for  long  periods  of  time  is 
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an  important  step  towards  accomplishing  QIP  using  cavity 
QED.  An  atom  is  trapped  in  the  Lamb-Dicke  regime  when  its 
recoil  energy  is  less  than  the  trap’s  vibrational  level  spacing, 
Tl  =  (Ej-ecoil/Evib)1/2  <  1.  This  regime  has  been  achieved 
using  a  FORT  [10],  and  magnetic  microwire  traps — such  as 
those  discussed  in  this  paper — may  also  be  capable  of  trapping 
atoms  three-dimensionally  inside  a  cavity  in  the  Lamb-Dicke 
regime  [13,  12]. 

2.  Magnetic  microtraps  and  photonic  bandgap 
cavities 

Patterns  of  micron-sized  wires  can  create  magnetic  held 
gradients  and  curvatures  sufficiently  large  to  accurately  guide 
and  trap  atoms  above  the  surface  of  the  substrate  [13]. 
These  magnetic  microtrap  devices — commonly  known  as 
atom  chips  [14,  15] — can  be  fabricated  using  standard 
photolithography  techniques  [16,  17]  and  have  been 

successfully  used  not  only  to  trap  and  waveguide  neutral 
atoms,  but  also  to  create  and  manipulate  Bose-Einstein 
condensates  [18,  19]. 

Atom  chips  exploit  the  interaction  potential,  V  =  — jl  B , 
between  an  atom’s  magnetic  moment,  /x,  and  a  wire’s  magnetic 
held,  B ,  to  trap  or  guide  weak  held  seeking  states  of  a  neutral 
atom.  The  simplest  example  of  a  magnetic  microtrap  involves 
the  combination  of  the  held  from  a  U-shaped  wire  with  a 
homogeneous  bias  held,  B bias  [20].  The  bias  held,  parallel  to 
the  wire  substrate  and  perpendicular  to  the  base  of  the  U-wire, 
serves  to  cancel  the  curling  held  of  the  wire  to  form  a  two- 
dimensional  quadrupole  trap  for  the  weak  held  seeking  atoms. 
The  atoms  are  conhned  in  the  third  dimension  by  the  helds  from 
the  side  wires  of  the  U-trap,  forming  a  cigar-shaped  trap  above 
the  wire  surface.  The  position  of  the  trap  minimum  above  the 
wire  surface,  r,  and  the  gradient  of  the  trap  are  completely 
determined  by  the  magnitude  of  B^as  and  the  current,  /,  in  the 
U-wire, 


/  — - ,  v  u  — - .  V  ^  7 

2 TC  ^bias  MO  -f 

For  example,  with  a  wire  current  of  1  A  and  a  bias  held  of 
10  G,  the  atoms  are  trapped  200  jim  above  the  surface  in 
a  held  gradient — perpendicular  to  the  base  of  the  U-wire — 
of  500  Gem-1.  Ioffe  traps — which  are  not  susceptible 
to  trap  losses  due  to  Majorana  spin  hips — may  be  formed 
either  by  a  similar  Z-trap  [20]  or  by  using  wires  forming 
patterns  of  nested  arcs  [13].  Although  this  latter  Ioffe  trap 
is  more  complicated,  it  does  allow  the  possibility  of  trapping 
atoms  three-dimensionally  in  the  Lamb-Dicke  regime  inside  a 
photonic  bandgap  cavity  coplanar  with  the  wires  [12].  Simple 
waveguides  for  the  atoms  can  be  formed  from  the  Z-trap 
by  extending  the  base  of  the  Z-wire,  allowing  the  atoms 
to  ballistically  expand  along  the  held  minimum  above  the 
elongated  wire.  Beam  splitters  and  conveyor  belts  have  been 
demonstrated  using  similar  techniques  [14,  15]. 

Standard  laser  cooling  and  trapping  techniques  [21]  are 
used  to  load  cold  atoms  into  the  magnetic  microtraps  and 
waveguides.  Typically,  atoms  are  collected  in  a  variant  of  the 
magneto-optical  trap  (MOT)  that  uses  the  atom  chip  surface 
as  a  mirror  to  form  four  of  the  six  required  laser  cooling 
beams  [20].  This  mirror  MOT  and  subsequent  sub-Doppler 


cooling  allows  the  collection  of  approximately  106  atoms 
of  temperature  10  fiK  a  few  millimetres  above  the  chip’s 
surface.  Conveniently,  the  quadrupole  held  from  the  U-trap 
is  in  the  same  orientation  as  the  magnetic  held  required  to 
form  a  mirror  MOT.  In  the  simplest  case,  the  atoms  can 
be  transferred  to  the  U-trap  by  replacing  the  mirror  MOT’s 
quadrupole  held  with  that  of  the  U-trap  while  maintaining 
the  cooling  lasers  in  the  same  conhguration:  this  creates  a 
U-MOT  using  the  micro  wire  magnetic  held.  An  alternative 
and  more  experimentally  compact  and  robust  method — and  the 
one  employed  in  our  lab — traps  the  atoms  directly  from  vapour 
using  a  large  copper  U-shaped  block  carrying  30  A  and  located 
underneath  the  atom  chip  [22].  The  atoms  in  this  macro-U- 
MOT  are  subsequently  transferred  to  smaller,  magnetostatic 
U-traps  on  the  atom  chip  surface. 

The  proximity  of  the  atoms  to  the  chip’s  surface  naturally 
facilitates  the  integration  of  magnetically  trapped  atoms  with 
on-chip  cavities  such  as  microdiscs  or  photonic  crystals.  Two- 
dimensional  photonic  bandgap  (PBG)  cavities — perforated 
semiconductor  structures  that  conhne  light  through  the  dual 
action  of  distributed  Bragg  reflection  and  internal  reflection — 
are  in  many  respects  ideal  for  cavity  QED  [23].  Their  small 
mode  volume  and  modest  quality  factors  open  the  possibility 
of  achieving  extremely  small  strong  coupling  parameters: 
[m0,  No]  =  [10-8,  1CT4].  With  regard  to  atom-cavity 
coupling,  these  cavities  have  the  advantage  over  microdiscs 
and  microspheres  in  that  the  mode’s  field  maximum  can  be 
located  in  the  holes  rather  than  inside  the  dielectric  material. 
As  an  inherently  stable,  monolithic  structure,  PBG  cavities 
will  not  need  the  support  structure  for  active  stabilization  that 
Fabry-Perot  cavities  require.  Moreover,  their  compactness 
and  compatibility  with  fibre  optics-based  input  and  output 
couplers  [24,  25]  allow  one  to  envision  an  array  of  PBG 
cavities,  atom  microtraps,  input/output  couplers,  and  other 
processing  devices  ah  on  the  same  integrated  chip. 

We  plan  to  use  PBG  cavities  of  the  graded  defect  design 
discussed  in  [26],  which  consist  of  a  rectangular  lattice 
of  airholes  in  an  optically  thin,  high  refractive  index  slab 
waveguide.  The  holes  gradually  decrease  in  diameter  towards 
the  cavity  centre,  and  experimental  measurements  of  such 
cavities  fabricated  in  silicon  membranes  (see  figure  1(a)) 
and  operating  at  A.  ~  1.6  Mm  possess  Qs  as  high  as 

40000  with  modal  volumes  of  Veff  ~  0.9  cubic  wavelengths 
(A ,/n)3  [27].  In  future  experiments  with  single  atoms,  cavities 
will  be  etched  in  a  thin  AlGaAs  membrane,  chosen  for  its 
transparency  at  the  wavelength  of  caesium’s  D2  transition, 
852  nm.  For  the  Q  and  Veff  values  mentioned  above, 
the  atom-cavity  coupling  can  be  a  high  as  go  =  2tt  x 
17  GHz  while  the  decoherence  rates  are  [/c,  y_l]/2tt  = 
[4.4  GHz,  2.6  MHz].  This  gives  strong  coupling  parameters 
of  [mo,  No]  =  [1.2  x  10_s,8.4  x  10-5],  which  are  much 
smaller  than  those  achieved  in  recent  experiments  using  Fabry- 
Perot  cavities,  [mo,  No]  =  [2.8  x  10— 4 ,  6. 1  x  10-3]  [8].  The 
central  hole  diameter  is  ~  100  nm  and  the  membrane  thickness 
is  ~170  nm.  An  atom  in  this  small  hole  will  be  affected  by 
the  Casimir-Polder  potential  [29],  and  cavity  QED  dynamics 
in  the  presence  of  this  force  will  need  to  be  investigated. 

The  cavity  is  coupled  to  a  photonic  crystal  waveguide, 
which  in  turn  is  evanescently  coupled  to  an  optical  fibre  taper. 
By  positioning  the  fibre  taper — whose  minimum  diameter  is  of 
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Figure  1.  (a)  A  scanning  electron  microscope  (SEM)  image  of  a 
photonic  bandgap  cavity  and  waveguide  (WG)  fabricated  in  silicon, 
(b)  A  schematic  diagram  of  the  fibre  taper  coupler,  (c)  The  finite 
difference  time-domain  calculated  electric  field  amplitude  of  the 
cavity  mode  taken  in  the  centre  of  the  membrane,  (d)  An  SEM 
image  of  an  optical  fibre  taper  aligned  above  a  photonic  crystal 
waveguide. 

the  order  of  a  micron — in  the  near  field  of  the  photonic  crystal 
waveguide  and  aligned  along  its  axis  (see  figures  1(b)  and  (d)), 
highly  efficient  (greater  than  98%)  fibre  coupling  into  and  out 
of  the  photonic  crystal  waveguide  can  be  achieved  [25].  Light 
coupled  into  the  photonic  crystal  waveguide  is  reflected  by  the 
PBG  cavity  and  recollected  in  the  backward  propagating  fibre 
taper  mode  [28].  Figure  1(a)  shows  the  boundary  between 
the  waveguide  and  the  cavity:  the  top  four  rows  of  holes 
are  the  end  of  the  waveguide,  which  is  formed  in  a  similar 
fashion  to  the  cavity,  except  that  the  holes  are  graded  in 
only  the  lateral  dimension.  This  design  maximizes  the  mode 
matching  between  the  waveguide  and  the  cavity  modes  [24]. 
The  waveguide  may  be  bent  to  allow  access  to  the  cavity 
unencumbered  by  the  fibre. 

3.  Experimental  proposal 

As  a  first-generation  experiment,  we  would  like  to  bring  a 
trapped  cloud  of  cold  neutral  atoms — caesium  in  our  case — 
into  contact  with  a  PBG  cavity,  simultaneously  demonstrating 
the  integration  of  a  cavity  with  an  atom  chip  and  the  strong 
coupling  of  a  neutral  atom  to  a  PBG  cavity.  Figure  2  shows  a 
rough  schematic  diagram  of  the  atom-cavity  chip  experiment. 
The  chip  is  divided  into  two  regions,  one  for  laser  trapping 
and  cooling  of  the  atoms  in  a  U-MOT  and  U-traps,  and  the 
other  for  the  PBG  cavity  and  its  tapered  fibre  and  photonic 
crystal  waveguide  couplers.  The  two  regions  are  connected 
by  a  microwire  waveguide  to  transport  the  atoms  from  the 
laser  cooling  region  to  the  PBG  cavity.  These  regions  must 
be  separated  by  1-2  cm  in  order  for  the  bulk  of  the  cavity 
to  not  obstruct  the  1  cm2  U-MOT  beams.  Furthermore,  to 
position  the  cavity  outside  of  the  horizontal  U-MOT  beam  that 
grazes  the  substrate  surface,  the  waveguide  must  convey  the 
atoms  around  a  90°  turn.  This  will  be  accomplished  either  by 
using  a  two-wire  guide  (chosen  for  depiction  in  figure  2  for 


Figure  2.  A  schematic  diagram  of  the  atom-cavity  chip  experiment. 
The  microwire  U-traps  and  atomic  waveguides  are  shown  as  yellow 
wires  (light  grey),  and  the  light  red  area  (lightest  grey)  centred  about 
the  U-traps  represents  the  footprint  of  the  reflected  trapping  lasers. 
The  atoms  are  the  red  (dark  grey)  cylinders,  pictured  as  they  are 
transported  towards  the  PBG  cavity  which  is  shown  as  the  black 
chip  glued  to  the  substrate’s  surface.  The  grey  line  is  the  optical 
fibre  and  fibre  taper. 

simplicity  of  illustration)  [30]  or  by  rotating  the  atoms  in  a  P- 
trap — similar  to  a  U-trap  but  with  the  base  wire  bent  allowing  a 
rotating  bias  field  to  change  the  orientation  of  the  atoms  [31] — 
before  transferring  the  atoms  into  a  Z-trap  waveguide  aligned 
perpendicular  to  the  initial  U-trap.  This  latter  design  has  the 
advantage  that  the  simple  addition  of  a  few  coplanar  wires  can 
serve  to  loosely  confine  the  atoms  once  they  reach  the  PBG 
cavity. 

In  the  PBG  region,  the  atoms  are  suspended  a  few  hundred 
microns  above  the  surface  of  the  waveguide’s  micro  wires,  and 
this  allows  enough  room  for  the  ~200  gm  thin  PBG  substrate 
to  be  placed  in  the  gap  between  the  atoms  and  the  micro  wires. 
Once  the  atoms  are  transported  to  a  position  above  the  PBG 
cavity,  the  current  and  bias  field  of  the  guide  are  adjusted 
to  lower  the  cold  atom  cloud  into  the  surface  of  the  PBG 
cavity.  A  thermoelectric  cooler  (TEC)  is  located  near  the  PBG 
cavity  to  counteract  heating  due  to  the  microwire  waveguide, 
maintaining  a  specific  cavity  detuning  from  the  frequency 
driving  laser  and  the  atomic  resonance.  We  estimate  a  cavity 
inability  of  20  GHz  “CT1,  and  with  TEC  control  of  Hr2  °C, 
we  should  be  able  to  achieve  a  200  MHz  tuning  resolution.  This 
resolution  is  sufficient,  as  we  expect  to  operate  with  detunings 
of  the  order  of  1-10  GHz. 

The  delivery  scheme  described  above  provides  a  non- 
deterministic  source  of  weakly  trapped  atoms  to  the  cavity 
mode.  The  field  of  the  cavity  mode  is  concentrated  in  the 
central  ~10  holes  (see  figure  1(c)).  The  field  maximum  is 
offset  by  45  nm  from  the  axis  of  each  of  the  two  centre  holes. 
We  expect  to  transport  105  atoms  in  a  cigar- shaped  cloud  of 
density  1011  cm-3.  The  cross-sectional  area  of  this  cloud 
parallel  to  the  chip  is  larger  than  the  0.4  gm2  area  of  the  PBG 
cavity  that  is  occupied  by  the  field,  and  we  estimate  that  there  is 
a  ~  10%  probability  of  an  atom  encountering  one  of  the  central 
10  holes  per  cloud  interaction.  With  an  experimental  repetition 
once  every  ~5  s — limited  by  the  U-MOT  replenishing  time — 
we  foresee  the  accumulation  of  a  significant  number  of  events 
in  a  reasonable  amount  of  time,  and  as  discussed  in  section  4 
below,  we  expect  to  detect  strong  signals  during  single-atom 
transits  through  the  PBG  cavity’s  central  holes.  If  we  assume 
a  caesium  cloud  temperature  of  10  /zK,  then  a  caesium  atom 
whose  velocity  is  parallel  to  the  axis,  z,  of  a  central  hole  will 
interact  with  the  mode  for  a  time  duration  of  ~10  jus. 
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Figure  3.  (a)  The  transmission  of  the  cavity  as  a  function  of  drive 
strength — measured  in  intracavity  photon  numbers  for  a  resonant 
and  empty  cavity — calculated  from  equation  (2)  (black  line)  and 
from  equation  (3)  (points).  The  empty  cavity  transmission  is  shown 
as  a  dashed  red  line,  (b)  The  difference  in  output — during  the 
expected  10  /zs  of  atom-cavity  interaction — between  a  cavity  with 
one  atom  and  an  empty  cavity.  The  detunings  are 
[A,  0]/2n  =  [10,10]  GHz. 


4.  Single-atom  detectability 

To  investigate  the  PBG  cavity’s  response  to  a  strongly  coupled 
atom  falling  through  a  central  hole,  we  solve — using  a  two- 
level  atom — the  semiclassical  optical  bistability  equation  for 
a  qualitative  understanding  of  the  interaction  and  the  quantum 
master  equation  to  obtain  a  more  quantitative  description. 
Although  neither  of  these  treatments  fully  encompasses  the 
complexity  of  the  system,  we  presume  that  they  are  sufficient 
for  demonstrating  the  feasibility  of  the  device  for  single- atom 
detection.  These  calculations  ignore  the  fact  that  go  and  the 
detunings  are  of  the  same  order  or  much  larger  than  both  the 
hyperfine  ground-state  and  excited-state  splittings,  which  for 
caesium  are  9.2  GHz  and  151-251  MHz,  respectively.  In  other 
words,  the  atom-photon  coupling  is  much  stronger  than  the 
coupling  between  the  electron  and  nuclear  spins.  This  is  an 
unusual  situation  and  requires  a  full  quantum  calculation  of  the 
atom-PBG  cavity  interaction  that  includes  the  full  caesium  D2 
manifold  of  states.  We  are  in  the  process  of  performing  this 
computation. 

The  optical  bistability  equation  is  a  semiclassical 
description  of  the  transmission  of  a  cavity  containing 
atoms  [32], 

x 

y  = - r- 

(*""  ^  \  2  /  ^  \  2  2 

1  +  No(l+(A/y±)2+y2) )  +  y±N0(l+(^/y±)2+y2) ) 

(2) 

In  the  above  equation,  x  is  the  input  field,  E/^/mo,  where 
E  is  the  amplitude  of  the  driving  field;  y  is  the  output  field, 
a/^/m o,  where  a  is  the  intracavity  coherent  state  amplitude;  A 
is  the  atom-laser  detuning;  and  9  is  the  cavity-laser  detuning. 
The  black  curves  in  figures  3(a)  and  4(a)  show  the  solution  to 
equation  (2)  for  [A,  0]/2tt  =  [10,  10]  GHz  and  [A,  6>]/2tt  = 
[10,  0]  GHz,  respectively.  These  two  sets  of  detunings  are 
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Figure  4.  (a)  and  (b)  are  the  same  as  figures  3(a)  and  (b)  except  with 
detunings  of  [A,  0]/2jt  =  [10,  0]  GHz. 


chosen  to  highlight  different  atom-cavity  response  regimes 
where  we  expect  to  be  able  to  detect  single  atoms.  In  both 
plots,  [go,K,yj_]/27r  =  [17  GHz,  4.4  GHz,  2.6  MHz].  The 
horizontal  dashed  lines  are  the  empty  cavity  transmissions. 
Both  semiclassical  solutions  show  signs  of  bistability  in  the 
region  around  one  intracavity  photon.  Within  the  context 
of  the  approximation  of  equation  (2),  figure  3(a)  shows  that 
for  10  GHz  detunings  of  the  atom  and  cavity  from  the  probe 
laser,  an  excess  of  photons  transmitted  through  the  cavity — an 
‘up-transit’ — can  be  detected  for  a  drive  of  a  few  intracavity 
photons.  Figure  4(a)  shows  that  with  the  cavity  on  resonance 
with  the  laser  and  the  atom  10  GHz  detuned,  a  deficit  of 
photons — a  ‘down-transit’ — can  be  detected  for  similar  drive 
strengths  of  a  few  intracavity  photons. 

The  solutions  to  the  unconditional  master  equation 
paint  a  more  accurate  picture  of  the  atom-cavity  system. 
Under  the  two-level  atom,  electric  dipole,  and  rotating-wave 
approximations,  the  equation  for  the  density  matrix,  p,  of  the 
joint  state  of  the  atom  and  cavity  is  as  follows: 

p  =  -r-[#o,  P]  +  y±(2apat  -  a^ap  -  po^o) 
n 

+  K(2apa"  —  a'ap  —  pa{a),  (3) 

Ho  =  TiAa^a  +  TiOa^a  +  i  fiE(a^  —  a)  +  H\ni,  (4) 

HiM  =  ihgoir(r)  [a1  a  -  o  ' a]  .  (5) 

In  this  equation,  o  is  the  atomic  lowering  operator  and  a  is  the 
cavity  field  annihilation  operator.  Along  the  axis  of  the  central 
cavity  hole,  the  mode  function,  ^(z),  closely  approximates  a 
Gaussian  of  width  ~225  nm,  centred  about  the  mid-point  of 
the  M70  nm  thick  cavity  membrane.  The  steady-state  density 
operator,  pss,  as  a  function  of  various  drive  strengths,  coupling 
strengths,  and  detunings,  is  found  by  solving  equation  (3)  with 
pss  =  0.  Operator  expectations  are  (O)  =  Tr[pssG].  The 
expected  cavity  output  in  photons  per  detector  integration  time, 
At,  is 

N  =  KAt{afa),  (6) 

with  noise  fluctuations  of  variance 

(AN)2  =  KAt((a'aa'a)  -  (a1  a)2).  (7) 
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Figure  5.  Simulated  photon  counts  due  to  atoms  transits  through  the 
axis  of  the  cavity’s  central  hole.  Blue  dots  (left  axis)  are  the  photon 
counts,  and  the  green,  dashed  curve  (right  axis)  is  the  Gaussian 
variation  of  g(t)/go  =  VT z(t ))  experienced  by  the  atom  during  its 
transit.  Calculations  are  for  detunings  of 
(a)  [A,  0]/2n  =  [10,  10]  GHz  and  (b)  [A,  0]/2n  =  [10,  0]  GHz. 

Note  that  instead  of  photon  counting,  heterodyne  detection 
may  be  used,  in  which  case  expectations  of  a  rather  than  o' a 
are  the  relevant  quantities.  The  results  presented  in  figures  3-5 
are  qualitatively  similar  for  the  two  cases. 

The  points  in  figures  3(a)  and  4(a)  represent  N  calculated 
from  solutions  to  equation  (3)  for  various  drive  strengths  and 
for  the  same  go,  k,  and  detunings  as  used  in  the  semiclassical 
calculation.  These  points  do  not  extend  past  a  drive  strength 
of  80  intracavity  photons  because  our  limited  computational 
resources  necessitate  the  use  of  a  truncated  Fock  basis.  The 
cavity  transmission  as  a  function  of  drive  qualitatively  follows 
the  semiclassical  solutions;  however,  there  is  no  longer  a  sign 
of  bistability,  which  is  to  be  expected  since  the  unconditional 
master  equation  is  linear  in  the  state  variables,  p,  and  we  plot 
only  (N).  We  also  see  that  for  a  drive  of  1-10  photons,  up- 
transits  occur  for  a  probe  laser  detuned  10  GHz  from  both  the 
atom  and  the  cavity  (figure  3(a)),  and  down-transits  for  a  probe 
laser  and  cavity  10  GHz  detuned  from  the  atom  (figure  4(a)). 
Figures  3(b)  and  4(b)  show  the  change  in  the  output  of  the 
cavity — using  the  master  equation  solutions — during  the  10  /zs 
in  which  we  expect  the  atom  to  interact  with  the  cavity  mode. 
The  black  dots  show  the  up-transits  and  blue  triangles  the 
down-transits.  For  drive  powers  of  ~1  nW  (1-10  intracavity 
photons),  photon  excesses  of  105-106  can  be  seen  in  the  up- 
transits  of  the  [A,  0]/2n  =  [10,  10]  GHz  case  (figure  3(b)), 
and  photon  deficits  of  105-106  in  the  down- transits  of  the 
[A,  0]/2tt  =  [10,  0]  GHz  case  (figure  4(b)).  For  both  sets 
of  detunings,  we  see  that  for  drive  strengths  less  (greater)  than 
one  intracavity  photon,  there  are  super-  (sub-)  Poissonian  noise 
fluctuations  of  the  photon  number.  Plots  of  the  Q  -function  [33] 
in  the  sub-Poissonian  regions  show  excess  spread — and  even 
a  bifurcation  in  the  [A,  0]/2tt  =  [10,  0]  GHz  case — of  the 
phase  quadrature  corresponding  to  photon  number  squeezing. 

Simulated  photon  counts  during  atom  transits  are  shown 
in  figures  5(a)  and  (b).  We  assume  that  the  atom  moves  with 
constant  velocity,  v  =  2.5  cm  s-1,  through  the  axis  of  the 
cavity  mode  \/f(z),  making  a  full  transit  of  the  Gaussian  waist 
in  10  /z s.  In  both  plots  the  drive  strength  is  two  intracavity 


photons.  As  the  atom  trans verses  the  cavity,  the  coupling 
g(t)  =  go^iyt)  also  varies  as  a  Gaussian,  which  modulates 
the  output  photon  flux.  The  mean  photon  count,  N,  and 
variance,  (AN)2,  are  found  by  solving  for  pss  for  each  g(t) 
in  time  steps  of  At  =  1  /z s,  chosen  to  simulate  a  finite 
bandwidth  photodetector.  Each  point  includes  additional  shot 
noise  selected  randomly  from  a  normal  distribution  of  standard 
deviation  AN.  Figures  5(a)  and  (b)  show  that  even  with  shot 
noise,  up-  and  down-transits  of  single  atoms  through  the  axis  of 
the  central  PGB  cavity  hole  are  clearly  detectable.  Moreover, 
it  seems  possible  to  detect  atom  transits  that  only  experience 
20%-30%  of  go-  During  an  experiment,  we  expect  to  detect  a 
low  background  of  signals  from  marginally  coupled  atoms — 
such  as  those  grazing  the  field  extending  from  the  surface 
of  the  PBG  membrane  or  slipping  into  holes  away  from  the 
central  region — punctuated  by  sharp  pikes  representing  atoms 
fully  coupled  to  the  field  inside  the  central  holes.  It  should  be 
noted  that  the  mean  photon  numbers  and  noise  in  figures  5(a) 
and  (b)  are  not  derived  from  a  quantum  trajectory  calculated 
from  the  conditional  master  equation  [34],  but  are  simply 
calculated  using  pss  from  the  unconditional  equation  (3).  This 
is  acceptable  given  the  inherent  limitations  of  the  model  as 
mentioned  at  the  beginning  of  this  section. 

The  atom  will  experience  a  force, 

</)  =  -i fiVg{r){a^a  -  aff1),  (8) 


as  it  encounters  the  cavity  mode.  The  maximum  acceleration 
on  an  atom  dragged  though  the  cavity  mode  at  velocity 
2.5  cm  s-1 — for  either  of  the  sets  of  detunings  used  above — is 
l(/max)l/4^cs  =  2.4  x  108  m  s-2,  corresponding  to  a  change 
in  velocity  of 


Av 


l(/max)|Az 


MCs 


5ms 


(9) 


over  half  the  length  of  the  cavity  mode,  A z  =  100  nm.  In 
the  above  equations,  MCs  is  the  mass  of  a  caesium  atom.  This 
agrees  with  a  simple  estimate  using 

hgo=0.5Mc&(Av)2,  (10) 

which  yields  Av  =  10  ms-1.  Fabry-Perot  experiments 
have  detected  effects  of  the  cavity  interaction  on  the  atomic 
motion  [35].  The  simple  estimate  using  equation  (10)  gives 
a  smaller  value  of  Av  &  0.7  m  s-1  for  the  Fabry-Perot 
experiments,  implying  that  the  motion  of  the  atom  traversing 
the  mode  of  the  PBG  cavity  will  also  be  significantly  affected. 
A  more  detailed  calculation  [36]  of  the  force  and  momentum 
diffusion  using  a  master  equation  beyond  the  two-level  atom 
approximation  is  necessary  to  make  predictions  about  the 
behaviour  of  an  atom  in  an  attractive,  red-detuned  cavity  mode 
or  in  a  repulsive,  blue-detuned  mode.  The  close  proximity 
of  the  atom  to  the  sides  of  the  PGB  cavity’s  holes  will 
surely  affect  the  system’s  dynamics  due  to  the  Casimir-Polder 
potential  [29],  and  this  will  need  to  be  addressed  in  more 
detailed  simulations. 


5.  Conclusion 

The  integration  of  atom  trapping  and  cooling  with  photonic 
bandgap  cavities  on  a  chip  introduces  a  robust  and  scalable 


S560 


Feasibility  of  detecting  single  atoms  using  photonic  bandgap  cavities 


cavity  QED  system  to  the  toolbox  of  nanotechnology.  A  device 
allowing  cooled  neutral  atoms  to  be  delivered  via  a  magnetic 
microtrap  and  waveguide  to  the  mode  of  a  graded  lattice  PBG 
cavity  is  feasible  given  present  technology.  Calculations  using 
the  semiclassical  optical  bistability  equation  and  the  uncondi¬ 
tional  master  equation  indicate  that  it  will  be  possible  to  detect 
single  strongly  coupled  atoms  with  this  atom-cavity  chip. 
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We  characterize  the  long-time  projective  behavior  of  the  stochastic  master  equation  describing  a  continuous, 
collective  spin  measurement  of  an  atomic  ensemble  both  analytically  and  numerically.  By  adding  state-based 
feedback,  we  show  that  it  is  possible  to  prepare  highly  entangled  Dicke  states  deterministically. 
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I.  INTRODUCTION 

It  has  long  been  recognized  that  measurement  can  be  used 
as  a  nondeterministic  means  of  preparing  quantum  states  that 
are  otherwise  difficult  to  obtain.  With  projective  measure¬ 
ments  that  are  truly  discrete  in  time,  the  only  way  an  experi¬ 
mentalist  can  direct  the  outcome  of  the  measurement  is  by 
preparing  the  initial  state  to  make  the  desired  result  most 
probable.  Generally,  it  is  impossible  to  make  this  probability 
equal  to  1,  as  the  measurement  will,  with  some  nonzero 
probability,  result  in  other  undesirable  states.  If  the  experi¬ 
mentalist  can  afford  to  be  patient,  then  accepting  a  low  effi¬ 
ciency  is  not  a  problem,  but  this  is  not  always  the  case.  In 
recent  years,  a  theory  of  continuous  quantum  measurement 
has  been  developed  that  fundamentally  changes  the  nature  of 
state  preparation  via  measurement  [1].  When  a  measurement 
and  the  corresponding  acquisition  of  information  are  suffi¬ 
ciently  gradual,  there  exists  a  window  of  opportunity  for  the 
experimentalist  to  affect  the  outcome  of  the  measurement  by 
using  feedback  control  [2].  In  this  paper,  we  demonstrate  that 
it  is  possible  to  deterministically  prepare  highly  entangled 
Dicke  states  [3,4]  of  an  atomic  spin  ensemble  by  adding 
state-based  feedback  to  a  continuous  projective  measure¬ 
ment. 

It  has  been  shown  that  models  of  quantum  state  reduction 
exist  that  exhibit  the  usual  rules  of  projective  measurement 
except  the  state  reduction  occurs  in  a  continuous,  stochastic 
manner  [5].  These  models  are  not  without  physical  relevance 
as  they  are  the  same  as  those  derived  to  describe  the  condi¬ 
tional  evolution  of  atomic  spin  states  under  continuous  quan¬ 
tum  nondemolition  (QND)  measurement  [6-11].  By  measur¬ 
ing  the  collective  angular  momentum  operator  Jz  of  an 
initially  polarized  coherent  spin  state  via  the  phase  shift  of  an 
off-resonant  probe  beam,  conditional  spin- squeezed  states 
have  been  experimentally  produced  [12,13].  These  states  are 
of  considerable  interest  for  applications  in  quantum  informa¬ 
tion  processing  and  precision  metrology  [14,15]. 

In  these  models,  the  reduction  in  variance  that  initially 
leads  to  conditional  spin  squeezing  is  the  precursor  of  the 
projection  onto  a  random  eigenstate  of  Jz  at  longer  times. 
Figure  1  demonstrates  the  projection  process  for  a  single 
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numerically  simulated  measurement  trajectory.1  Like  spin- 
squeezed  states,  these  Dicke  states  offer  potential  for  quan¬ 
tum  information  applications  because  of  their  unique  en¬ 
tanglement  properties  [16].  Although  the  experimental 
difficulties  in  obtaining  these  states  via  QND  measurement 
or  other  experimental  methods  [17-19]  are  considerable,  the 
details  of  the  continuous  projective  process  that  leads  to 
them  are  of  fundamental  interest. 

Whenever  the  measurement  is  sufficiently  slow,  an  ex¬ 
perimentalist  may  steer  the  result  by  feeding  back  the  mea¬ 
surement  results  in  real  time  to  a  Hamiltonian  parameter. 
Indeed,  the  measurement  process,  as  a  state  preparation  pro¬ 
cess,  can  be  made  deterministic  with  the  use  of  feedback 
control.  Recently,  we  have  experimentally  demonstrated  this 
concept  by  modulating  a  compensation  magnetic  field  with 
the  measurement  record  to  deterministically  prepare  spin- 
squeezed  states  [12]  as  proposed  in  [6,7].  This  is  just  one 
example  of  the  growing  confluence  of  quantum  measurement 
with  classical  estimation  and  control  theory  [20,21].  Other 
applications  of  quantum  feedback  include  parameter  estima¬ 
tion,  metrology,  and  quantum  error  correction  [22-26]. 

In  this  paper,  we  focus  on  the  long-time  limit  of  the  QND 
measurement  and  feedback  process.  Just  as  spin-squeezed 
states  can  be  deterministically  prepared  at  short  times,  we 
numerically  demonstrate  that  individual  Dicke  states  can  be 
deterministically  prepared  at  long  times  with  the  use  of  state- 
based  feedback  [27].  While  our  proposed  feedback  laws  are 
nonoptimal,  they  demonstrate  the  adequacy  of  intuitive  con¬ 
trollers  with  finite  gain  for  directing  the  diffusion  of  the 
quantum  state  towards  desirable  regions  of  Hilbert  space 
with  unity  efficiency.  This  is  in  contrast  to  other  proposed 
schemes  using  measurements  to  prepare  Dicke  states  proba¬ 
bilistically  [17,18].  A  more  systematic  approach  utilizing  sto¬ 
chastic  notions  of  stability  and  convergence  in  the  continu¬ 
ous  measurement  and  control  of  a  single  spin  is  presented  in 
Ref.  [28]. 

This  paper  is  organized  as  follows.  In  Sec.  II,  we  intro¬ 
duce  the  stochastic  master  equation  which  represents  the  rule 
for  updating  the  system  state  in  real  time  via  the  incoming 
measurement  record.  Here  we  discuss  the  various  represen- 


!A11  numerical  simulations  shown  were  performed  using  the  pa¬ 
rameters  {/V=10,  M- 1,  T=  5,  dt=0.001}.  The  stochastic  integrator 
used  the  norm-preserving,  nonlinear  SSE  of  Eq.  (5)  and  a  weak 
second-order  derivative-free  predictor-corrector  structure  as  can  be 
found  in  [39]. 
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FIG.  1.  The  results  of  a  single  numerical  simulation1  of  the  stochastic  Schrodinger  equation  (SSE),  Eq.  (5),  with  M- 1,  y=  1,  and  N 
=  10  spins  initially  aligned  along  the  v  axis.  (A)  In  a  quantization  axis  perpendicular  to  the  polarization,  the  level  distribution  of  a  coherent 
spin  state  (CSS)  is  Gaussian  for  large  N.  Under  conditional  measurements  the  state  evolves  at  short  times  into  a  spin-squeezed  state  and, 
eventually,  into  a  random  eigenstate  of  Jz.  (B)  A  map  of  the  state’s  angular  distribution  on  the  Bloch  sphere  in  spherical  coordinates.  The 
uncertainty  in  the  transverse  direction  to  the  measurement  axis  grows  until  there  is  no  information  about  the  perpendicular  component 
direction.  (C)  At  long  times,  the  population  is  at  most  divided  among  two  levels  that  compete  to  be  the  final  winner,  which  in  this  case 
appears  to  be  m-\.  (D)  All  of  the  state  information  is  obtained  by  properly  filtering  the  noisy  photocurrent. 

magnetic  ratio,  M  is  the  probe-parameter-dependent  mea¬ 
surement  rate,  and 

V[c\p  =  cpJ  -  (c^cp  +  pc^c)/ 2,  (2) 

H[c\p  =  cp  +  pc t  -  Tr[(c  +  cj)p]p.  (3) 

The  (scaled)  difference  photocurrent  is  represented  as 

y(t)dt  =  (Jz)(t)dt  +  dW(t)/2'jMy.  (4) 

The  stochastic  quantity  dW(t)  =  2'lMr^_y(t)dt-(Jz)(t)dt]  is  a 
Wiener  increment  and  dW(t)/dt  is  a  Gaussian  white  noise 
that  can  be  identified  with  the  shot  noise  of  the  homodyne 
local  oscillator.  [See  [29,30]  for  an  introduction  to  stochastic 
differential  equations  (SDE’s).]  The  sensitivity  of  the  photo¬ 
detection  per  VHz  is  represented  by  1/2 Vm 77,  where  the 
quantity  rj  e  [0,1]  represents  the  quantum  efficiency  of  the 
detection.  If  77=  0,  we  are  essentially  ignoring  the  measure¬ 
ment  result  and  the  conditional  SME  becomes  a  determinis¬ 
tic  unconditional  master  equation.  If  rj- 1 ,  the  detectors  are 
maximally  efficient.  In  this  latter  case,  the  conditioned  state 
will  remain  pure  for  the  entire  measurement,  thus  we  can  use 
a  state  vector  description,  and  the  SME  can  be  replaced  with 
a  SSE 


tations  of  the  dynamics  in  both  the  short-  and  long-time  lim¬ 
its.  Section  III  describes  the  probabilistic  preparation  of 
Dicke  states  via  observation  alone.  The  numerical  demon¬ 
stration  of  the  open-loop  projection  process  reveals  statistical 
features  that  clarify  the  details  of  the  projection.  Feedback  is 
added  to  the  procedure  in  Sec.  IV,  where  we  show  that  state- 
based  control  allows  one  to  prepare  the  same  Dicke  state 
deterministically  on  every  measurement.  Finally,  in  Sec.  V, 
we  discuss  future  directions  and  imminent  challenges  regard¬ 
ing  quantum- state  preparation  via  measurement  and  control. 


II.  REPRESENTATIONS  OF  THE  CONDITIONAL 
EVOLUTION 

The  physical  system  we  will  consider  is  an  ensemble  of  N 
spin- 1/2  particles  contained  within  a  cavity  and  interacting 
with  a  far  off-resonant  single-mode  field.  We  will  denote  the 
conditional  state  of  the  spin  ensemble  as  p(t)  and  the  homo¬ 
dyne  measurement  record  of  the  output  as  y{t).  The  stochas¬ 
tic  master  equation  (SME)  describing  the  conditional  evolu¬ 
tion  is  [6,7] 

dp(t)  =  -  i[H(t),p(t)~\dt  +  V\^^MJ^\p(t)dt  +  ^H[^MJz]p(t) 

X{2  ^Alrj[y(t)dt-(Jz)dt]},  (1) 

where  H(t)  =  yJyb(t)  is  the  control  Hamiltonian  that  we  will 
allow  ourselves  [without  feedback  b(t)  =  0],  y  is  the  gyro- 
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d\  >m = {-  m)  -  mvz  -  <yz>(f)]2/2}|  >mdt 

+  jM[Jz-(Jz)(t)]\iKt))dW(t).  (5) 

This  SSE  was  considered  in  [5]  where  the  motivation  was 
more  abstract  and  less  concerned  with  the  experimental  fil¬ 
tering  perspective  presented  here.  We  emphasize  that  the 
SME  or  SSE  is  physically  derived  and  is  an  explicit  function 
of  a  measured  photocurrent  variable  y{t ),  through  which  the 
randomness  enters.  The  states  are  considered  as  states  of 
knowledge  and,  in  practice,  an  experimentalist  updates  the 
description  of  the  system,  p(t)  [Figs.  1(A) — 1(C)],  as  the 
measurement  results,  y(t)  [Fig.  1(D)],  arrive  in  time. 

The  stochastic  master  equation  (1)  describes  only  the  dis¬ 
persive  part  of  the  atom-field  interaction.  Physically,  how¬ 
ever,  any  dispersive  phase  shift  must  be  accompanied  by 
some  degree  of  decohering  absorption  and  spontaneous 
emission  from  the  auxiliary  excited-state  level(s).  Generally, 
the  dispersive  SME  will  be  valid  until  some  time,  at  which 
point  spontaneous  emission  catches  up  to  destroy  the  validity 
of  the  above  description.  The  resulting  cutoff  time  will  im¬ 
pose  a  limit  on  the  amount  of  observable  squeezing  or  pro¬ 
jection. 

In  free  space  measurements,  e.g.,  free  space  Faraday  ro¬ 
tation  [12,13,31,32],  the  effects  of  spontaneous  emission 
make  this  cutoff  time  relatively  short.  By  surrounding  the 
atomic  cloud  with  a  cavity,  however,  spontaneous  emission 
can  be  suppressed  and  the  validity  of  the  SME  correspond¬ 
ingly  extended. 

For  a  cavity  with  decay  rate  k ,  N  atoms  with  a  decay  rate 
y,  and  an  atom-cavity  coupling  constant  g,  the  requirement 
to  see  any  spin  squeezing  is  only  g2/  Ky>  l/N.  On  the  edge 
of  the  strong-coupling  regime,  with  g2/Ky~l,  spin  vari¬ 
ances  can  be  further  decreased  from  initial  values  (AJ2)^N 
to  levels  [7,9,10].  (In  free  space,  it  is  in  principle  pos¬ 
sible  to  achieve  this  degree  of  squeezing  with  a  maximally 
focused  probe  beam,  but  one  can  do  no  better  because  of  the 
diffraction  limit.)  To  further  reduce  the  uncertainty  to  the 
point  where  (AJ2)^l  (i.e.,  the  Heisenberg  limit  of  spin 
squeezing)  the  cavity  needs  to  be  in  the  very  strong-coupling 
regime  with  g2/  Ky>  N.  If  one  requires  that  a  single  Jz  eigen¬ 
state  becomes  resolvable  ((A J2Z)<  1),  the  cavity  coupling  re¬ 
quirements  become  even  more  stringent  depending  on  the 
degree  of  projection  desired. 

While  there  are  currently  few  experimental  systems  even 
in  the  strong-coupling  regime,  we  expect  this  very-strong- 
coupling  regime  to  eventually  be  reached  for  moderate  num¬ 
bers  of  atoms.  With  this  attitude  we  continue  to  focus  on  the 
long-time  limit  of  the  pure  dispersive  SME  in  the  interest  of 
understanding  the  idealized  limits  of  continuous  projective 
measurement.  For  a  more  complete  discussion  of  the  realistic 
physical  limits  of  this  type  of  QND  measurement,  see  Refs. 
[6-11]. 

A.  Hilbert  space,  coherent  spin  states,  and  Dicke  states 

Under  certain  idealizations,  we  can  considerably  reduce 
the  size  of  the  Hilbert  space  needed  to  describe  the  condi¬ 
tionally  measured  ensemble.  Throughout  this  paper,  the  ini¬ 
tial  state  p(0)  will  be  made  equal  to  a  coherent  spin  state 


(CSS)  polarized  along  an  arbitrary  direction  [4].  For  ex¬ 
ample,  a  CSS  pointing  along  the  z  axis  is  denoted 
I T  l  T 2  *  *  *  T n)z  and  all  others  can  be  prepared  by  rotating  this 
state  with  the  angular  momentum  operators  Jb  with  i 
e  {x,y  ,z}.  A  CSS,  typically  obtained  via  a  dissipative  optical 
pumping  process,  is  an  eigenstate  of  J2  with  maximal  eigen¬ 
value  /(/+ 1),  where  J=N/2.  Because  the  SME  works  under 
the  QND  approximation  of  negligible  absorption  (i.e.,  the 
large  detuning  dispersive  limit),  no  angular  momentum  will 
be  exchanged  between  the  probe  beam  and  the  ensemble. 
The  only  other  allowed  dynamics  possible  are  rotations  of 
the  angular  momentum  induced  by  applied  magnetic  fields; 
thus,  the  state  will  maintain  maximal  (J2)  over  the  course  of 
the  measurement. 

The  Dicke  states  are  defined  [4]  as  the  states  | l,m)  that  are 
simultaneous  eigenstates  of  both  J2  and  Jz : 

Jz\l,m)  =  (6) 

J2|/,m)  =  Z(Z  +  (7) 

where 

\m\^l^J  =  N/  2.  (8) 

Under  the  above  approximations,  we  can  neglect  any  state 
with  /  A/.  We  then  shorten  the  labelling  of  our  complete 
basis  from  \j,m )  to  | m)  so  that 

Jz\m)  =  m\m),  (9) 

J2|m)  =J(J+  l)\m),  (10) 

where  m  e  {-N/2,-N/2+ 1, ...  ,N/2-l,N/2}. 

When  the  physical  evolution  is  such  that  the  | m)  states 
remain  complete,  we  can  limit  ourselves  to  a  density  matrix 
of  size  (N+ 1)  X  (N+ 1)  rather  than  the  full  size  2NX  2N.  This 
reduced  space  is  referred  to  as  the  symmetric  subspace,  as  its 
states  are  invariant  to  particle  exchange  [33,34].  For  the  case 
of  two  spins,  the  symmetric  subspace  contains  the  triplet 
states,  but  not  the  singlet.  States  contained  within  the  sym¬ 
metric  subspace  can  be  described  as  a  pseudospin  of  size  J 
=N/2. 

In  the  z  basis,  the  extremal  values  of  m,  ±N/ 2,  are  simply 
the  coherent  spin  states  pointing  along  the  z  axis: 

I  m=  +M2)  =  |T1T2---U),  (11) 

\m=-m=\id2"-iN)-  (i2) 

In  terms  of  the  constituent  spins,  these  states  are  obviously 
unentangled.  In  contrast,  consider  the  state  with  m  =  0  (for  N 
even): 

\m  -  0)  =  CS/Pidti  •  •  •  Tm2Iv/2+i  ' '  *  In ))>  (12) 

where  the  P*  represent  all  permutations  of  the  spins  and  C  is 
a  normalization  constant.  This  state  is  highly  entangled  in  a 
way  that  is  robust  to  particle  loss  [16].  Even  though  the 
expectation  values  (Jt)  vanish  for  this  state,  it  still  has  maxi¬ 
mal  J2  eigenvalue.  Loosely,  this  state  represents  a  state  of 
knowledge  where  the  length  of  the  spin  vector  is  known  and 
the  z  component  is  known  to  be  zero,  but  the  direction  of  the 
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spin  vector  in  the  x-y  plane  is  completely  indeterminate. 
Similarly,  the  entangled  states  with  0<\m\<N/2  can  be 
imagined  as  residing  on  cones  aligned  along  the  z  axis  with 
projection  m.  The  loss  of  pointing  angle  information  from 
the  measurement  process  is  diagrammed  in  Fig.  1(B). 

Along  with  their  unique  entanglement  and  uncertainty 
properties,  Dicke  states  are  also  of  interest  for  the  important 
role  they  play  in  descriptions  of  collective  radiation  pro¬ 
cesses  [4]  and  for  their  potential  role  in  quantum  information 
processing  tasks  [17,18,35]. 


B.  Short-time  limit 

Even  when  working  within  the  symmetric  subspace,  for  a 
large  number  of  spins  the  size  of  p(t)  may  be  too  unwieldy 
for  computational  efficiency.  Because  it  is  often  desirable  to 
update  our  state  description  in  real  time  (e.g.,  for  optimal 
feedback  procedures),  finding  simple  but  sufficient  descrip¬ 
tors  is  of  considerable  importance. 

We  can  derive  a  reduced  model  by  employing  a  moment 
expansion  for  the  observable  of  interest.  Extracting  the  con¬ 
ditional  expectation  values  of  the  first  two  moments  of  Jz 
from  the  SME  gives  the  following  scalar  stochastic  differen¬ 
tial  equations: 

d{J7)(t)  =  y(Jx)(t)b(t)dt  +  2^M^{AJ2)(t)dW(t) ,  (14) 

d(AJ2z)(t)  =  -  4My(Ajf}2(t)dt  -  iyi[AJ2z,Jy])(t)b(t)dt 

+  l4Mv{Ajl){t)dW{t).  (15) 

Note  that  these  equations  are  not  closed  because  higher-order 
moments  couple  to  them. 

At  short  times,  t<\/ rjM,  we  can  make  this  set  of  equa¬ 
tions  closed  with  the  following  approximations.  If  the  spins 
are  initially  fully  polarized  along  x,  then  by  using  the  evolu¬ 
tion  equation  for  the  v  component,  we  can  show  (Jx)(t) 
~/exp[-A/7/2].  Making  the  Gaussian  approxima¬ 
tion  at  short  times,  the  third-order  terms  (A J3Z)  and 
-iy([AJ^,Jy])(t)b(t)  can  be  neglected.  The  Holstein- 
Primakoff  transformation  makes  it  possible  to  derive  this 
Gaussian  approximation  as  an  expansion  in  l/J  [36].  Both  of 
the  removed  terms  can  be  shown  to  be  approximately  1  / J\!J 
smaller  than  the  retained  nonlinear  term.  Thus  we  can  ap¬ 
proximate  the  optimal  solution  with 

d(Jz)s(t)  =  yJ  exp[-  Mt/2]b(t)dt  +  2  VM  rj(AJ2)s(t)dWs(t) , 

(16) 

d(AJ2z)s(t)  =  -4Mv(AJ2z)2(t)dt,  (17) 

where  the  v  subscript  denotes  the  short-time  solution  and 
dWs(t)  =  2^Mr/[y(t)dt-(Jz)s(t)dt].  Also  b(t)  is  assumed  to  be 
of  a  form  that  keeps  the  total  state  nearly  pointing  along  v. 
The  differential  equation  for  the  variance  (A J2z)s{t)  is  now 
deterministic.  It  can  be  solved  to  give 


<A#,(r) 


<A#(0) 

1  +  4M  7](AJ2)(0)t 


(18) 


The  deterministically  shrinking  value  of  (A J2)s(t)  represents 
the  squeezing  about  the  initially  fluctuating  value  of  (Jz)s(t) 
as  shown  in  the  first  two  frames  of  Figs.  1(A)  and  1(B).  If 
feedback  is  added,  then  the  value  of  (Jz)s(t)  can  be  zeroed  via 
Larmor  precession  due  to  a  control  field  along  y  and  the 
same  centered  spin-squeezed  state  can  be  prepared  on  every 
trial  [6,7,12]. 

The  resulting  spin-squeezed  states  can  be  used  in  subse¬ 
quent  precision  measurements  [14,15].  It  is  also  worth  point¬ 
ing  out  that  a  precision  measurement  can  be  performed  dur¬ 
ing  the  production  of  the  conditional  spin  squeezing.  For 
example,  we  have  shown  that  by  properly  estimating  both 
the  spin  state  and  an  unknown  classical  field  simultaneously 
with  continuous  measurement  and  Kalman  filtering  tech¬ 
niques,  the  field  estimation  can  be  improved  over  conven¬ 
tional  limits  by  the  presence  of  the  simultaneous  squeezing 
[22,23]. 


C.  Long-time  limit 

The  approximations  made  in  the  previous  section  are  no 
longer  valid  at  times  The  third-order  terms  be¬ 

come  non-negligible  at  long  times;  hence,  the  variance  be¬ 
comes  stochastic.  Subsequently,  other  high-order  moments 
couple  to  the  problem  and  we  are  forced  to  consider  the 
stochastic  differential  equation  for  each.  Eventually,  any 
finite-numbered  moment  description  is  no  longer  useful  and 
it  initially  appears  that  we  must  resort  back  to  the  full  sym¬ 
metric  density  matrix  and  the  SME,  Eq.  (1)  as  our  primary 
description. 

Fortunately,  we  can  take  another  approach  and  describe 
the  state  in  terms  of  other  sufficient  statistics.  Without  a  field, 
the  only  statistic  of  the  photocurrent  needed  to  describe  the 
state  at  time  t  is  its  integral  fQy(s)ds  (see  the  Appendix  or 
[37]).  Knowing  that  the  state  is  only  a  function  of  this  vari¬ 
able  and  the  initial  state  (prior  information)  makes  the  ex¬ 
perimental  design  of  a  real-time  estimator  experimentally 
convenient.  For  example,  we  could  use  an  analog  integrator 
to  create  this  sufficient  statistic  from  the  photocurrent,  then 
feed  it  into  a  possibly  nonlinear  device  (like  an  FPGA  [33]) 
to  perform  the  estimation. 

With  the  integrated  photocurrent  and  the  initial  state 

j 

!#))>=  2  cm\m),  (19) 

m--J 


we  can  calculate  (see  the  Appendix)  the  conditional  expecta¬ 
tion  value  of  any  power  of  Jz  with  the  expression 

j  r 


Tr [Jkzp(t)]=  2  12 


m  \cm\  exp 


m=—J 


-  2Mrjm2t 


+  4mMr}\  y(s)ds 


/' 

7o 


(20) 


where  p(t)  is  the  unnormalized  density  matrix,  and  setting 
k=0  represents  its  trace,  so 
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(4)(f)  =  Tr[7jp(0]/Tr[^p(0].  (21) 


Consider  the  case  when  the  system  starts  in  the  v-polarized 
spin-coherent  state.  To  very  good  approximation  (with  rea¬ 
sonably  large  J)  we  can  write  for  this  state  in  the  z  basis, 


km  I2  a  exp 


(22) 


Using  these  coefficients,  we  now  have  the  rule  for  mapping 
the  photocurrent  to  the  expectation  of  Jz : 

(/,)(/)  =  Tr[./'p(r)]/Tr[/p(r)].  (23) 

Other  than  the  minor  approximation  of  the  initial  coeffi¬ 
cients,  using  this  estimate  is  essentially  the  same  as  using 
solution  to  the  full  SME,  so  we  do  not  give  it  a  new  sub¬ 
script. 

To  simplify  further,  we  can  change  the  sums  to  integrals, 
giving 

Tr[/p(Y)]  =  J  rnke-Am2+2Bmdrn,  (24) 


with 


T 

Jo 


A  =  —  +  2M  rjt,  B  =  2M  7]  |  y(s)ds. 


This  approximation  produces  an  estimate 
rJ 


(25) 


I 


e-Am2+2Bmdm 


<■ hUt)=  rj 

I  yo-Am2+2Bm 


(26) 


dm 


which  performs  suboptimally  when  the  distribution  of  states 
becomes  very  narrow  at  long  times.  Interestingly,  the  integral 
approximation  here  numerically  appears  to  give  the  same 
estimate  as  the  one  derived  previously  for  short  times  when 
no  field  is  present — i.e., 


=  (2V) 

This  is  not  entirely  surprising  as  both  of  these  estimators 
ignore  the  discreteness  of  the  Dicke  levels.  Also,  at  long 
times,  it  turns  out  that  both  of  these  estimates  appear  to  be 
numerically  equivalent  to  the  simplest  of  all  estimates:  aver¬ 
aging  the  photocurrent.  In  other  words,  one  simple  and  intui¬ 
tive  approximation  to  the  optimal  (Jz)(t)  would  be 

J  y(s)ds 

<■ Jz)a(t)  =  —t - ,  (28) 

which  is  an  estimate  one  might  guess  from  the  form  of  the 
photocurrent,  Eq.  (4).  From  simulation,  it  appears  that  this 
estimate  is  the  same  as  both  ( Jz)t(t )  and  (Jz)s(t)  for  t 
> 1 /  rjM.  Despite  the  nonoptimality  of  these  simple  estima¬ 
tors,  they  perform  well  enough  to  resolve  the  discretization 
of  the  Dicke  levels  at  long  times. 


Unfortunately,  the  addition  of  a  feedback  field  makes 
these  simplified  estimators  inadequate  at  long  times,  and  de¬ 
riving  simple  reduced  models  with  a  field  present  is  difficult, 
thus  forcing  us  to  use  the  full  SME  in  our  state  based  con¬ 
troller.  Despite  this  difficulty,  during  our  subsequent  feed¬ 
back  analysis  we  assume  sufficient  control  bandwidth  that 
the  SME  can  be  evolved  by  the  observer  in  real  time. 


III.  MEASUREMENT  EVOLUTION  WITHOUT  FEEDBACK 

In  this  section,  our  goal  is  to  describe  how  the  estimates 
of  the  last  section  probabilistically  evolve  at  long  times  into 
Dicke  states  via  observation  alone.  First,  we  discuss  steady- 
state  and  statistical  properties  of  the  SME,  Eq.  (1).  Then,  we 
examine  the  unconditional  dynamical  solution  with  77=0 
which  gives  the  average  state  preparation  behavior  when  77 
A  0.  We  then  consider  in  detail  how  individual  trajectories 
behave  when  77  A0.  Finally,  we  discuss  the  performance  of 
the  nonoptimal  estimators  relative  to  the  optimal  projective 
estimator. 


A.  Steady  states  of  the  SME  and  martingale  properties 

The  fact  that  the  SME  eventually  prepares  eigenstates  of 
Jz  is  rather  intuitive  from  a  projection  postulate  perspective 
because  Jz  is  the  quantity  being  measured.  If  we  insert  the 
pure  Dicke  state  p=\m)(m\  into  the  SME  with  no  Hamil¬ 
tonian  (or  only  a  field  along  z),  we  find  that  it  is  a  steady 
state,  dp= 0,  no  matter  what  happens  with  the  subsequent 
measurement  record.  Of  course,  this  does  not  yet  prove  that 
the  state  will  eventually  be  obtained,  as  we  have  not  dis¬ 
cussed  the  stability  of  attractors  in  stochastic  systems. 

Without  a  field  present,  the  SME  has  several  convenient 
properties.  First  of  all,  from  the  evolution  equation  for  the 
variance  notice  that  the  variance  is  a  stochastic  process  that 
decreases  on  average.  In  fact  it  is  a  supermartingale,  in  that 
for  times  s  ^  t  we  have 


Ej;<A7f)(f)]=s<A^)(s),  (29) 

where  the  notation  E[v(r)]  denotes  the  average  of  the  sto¬ 
chastic  variable  x{t)  at  time  t  and  the  s  subscript  represents 
conditional  expectation  given  a  particular  stochastic  trajec¬ 
tory  up  to  the  time  s.  Additionally,  it  can  be  shown  [5]  that 
the  average  variance  obeys  the  equation 


where 


e[<a4>(?)]  = 


<A#(0) 

1  +  4M  r/(\J2)(0)[t  +  £(f)]  ’ 


(30) 


r  E[{(A7^)(j)  -  E[(A7^)(^)]}2] 
m  J0  E[<A4>(s)P 


(31) 


A  more  explicit  solution  of  £( t )  is  not  necessarily  needed  as 
its  positivity  ensures  that  (A/^)(f)  stochastically  approaches 
zero.  This  implies  that  a  Dicke  state  is  eventually  prepared. 
The  numerical  simulation  of  Figs.  1  and  2  demonstrates  this 
behavior  for  an  initially  v-polarized  state.  As  expected, 
E[(A in  Fig.  2(A)  appears  to  be  less  than  the  short- 
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time 


(  j*  >  (t) 


FIG.  2.  Many  open-loop  moment  trajectories1  of  the  SSE,  Eq.  (5).  The  trajectory  of  Fig.  1  is  darkened.  (A)  At  short  times,  the  evolution 
of  the  variance  (shown  on  a  log  scale)  is  deterministic  and  given  by  (A At  long  times,  the  variances  become  stochastic  but  bounded 
(above  by  1/4  and  below  by  exp[-2(Mf- 1)]/4).  The  average  of  all  10  000  trajectories  (only  10  are  shown)  gives.  E[(AJ^)(t)].  (B)  The 
projective  nature  of  the  measurement  is  made  clear  by  the  evolution  of  100  trajectories  of  (Jz)(t).  The  distribution  of  the  final  results  is  given 
by  the  first  histogram  of  Fig.  1(A).  (C)  The  evolution  of  the  100  trajectories  all  starting  in  an  v-polarized  CSS.  When  77=1,  certain  regions 
of  Hilbert  space  are  forbidden  by  the  evolution. 


time  solution  (A J2z)s{t),  Eq.  (18),  at  long  times. 

Other  useful  properties  of  the  stochastic  evolution  are  evi¬ 
dent  from  the  moment  equations.  For  example,  we  can  show 
that 

d{Jnz )  =  2  \/M^«/!+1>  -  (ty(Jz))  dW(t)  (32) 

for  integer  n\  hence, 

dE[(Jnz)]  =  0  (33) 

and  for  times  s^t  we  have  the  martingale  condition 

E,[<^)(f)]  =  <^)(S).  (34) 

This  equation  for  n  =  1  gives  us  the  useful  identity 

E[(Jz)(t)(Jz)(s)]  =  E[(Jz)(s)2]  (35) 

for  Also,  we  can  rewrite  the  expression  for  n  =  2  as 

E,[</z>(*)2  +  <  A/2>(r)]  =  (Jz)(s)2  +  (A  J2z)(s) .  (36) 

This  implies  a  sort  of  conservation  of  uncertainty  as  the  dif¬ 
fusion  in  the  mean,  shown  in  Fig.  1(B),  makes  up  for  the 
decreasing  value  of  the  variance. 

B.  17=0 

It  is  insightful  to  examine  the  behavior  of  the  master 
equation  with  77=0  which  corresponds  to  ignoring  the  mea¬ 
surement  results  and  turns  the  SME,  Eq.  (1),  into  a  determin¬ 
istic  unconditional  master  equation.  We  continue  to  consider 
only  those  initial  states  that  are  polarized.  This  is  because 
these  states  are  experimentally  accessible  (via  optical  pump¬ 
ing)  and  provide  some  degree  of  selectivity  for  the  final  pre¬ 
pared  state.  To  see  this,  let  us  consider  a  spin- 1/2  ensemble 
polarized  in  the  x-z  plane,  making  angle  6  with  the  positive  z 
axis,  such  that 

(Jx)(  0)  =  sin(0)M2, 

(Jy)(0)  =  0, 

(Jz)(0)  =  cos(0)AV2, 

<A72>(O)  =  cos2(0)M4, 


(A/y>(0)  =N/4, 

<A/2>(O)  =  sin2(0)M4.  (37) 

Solving  the  unconditional  moment  equations  and  labeling 

them  with  u  subscripts,  we  get 

(Jx)u(t)  =  sin(0)exp(-  Mt/2)N/2, 

(Jy)u(t)=  0, 

(Jz)u(t)  =  cos(0)N/2, 

<A  J2x)u(t)  =  sin2(0)[7V2  -N-2N2  exp(-  Mt) 

+  ( N 2  -  iV)exp(-  2Mt)\/S 
+  N/4  ->•  sin2( 0)(N2  -  N)/ 8  +  N/4 , 

(A J2y)u(t)  =  sin2(0)|7V2  -N+  (N-N2)exp(-  2Mt)\/8  +  N/4 
^sin2(0)(N2-N)/8  +  N/4, 

<A72)„(r)  =  sin2(0)M4.  (38) 

Note  that,  because  the  unconditional  solutions  represent  the 
average  of  the  conditional  solution — i.e.,  pM(/)  =  E[p(/)] — we 
have 

E  [</,)(/)]  =  (Jz)u(t)  =  (Jz)(0)  =  cos(0)M2.  (39) 

This  also  follows  from  the  martingale  condition  for 
From  the  martingale  condition  for  we  get 

E[((Jz)(t)  -  E[<7z)(f)])2]  =  (A72)(0)  -  E[(A72)(f)] 

->  (A72)(0)  =  sin2(0)N/4. 

(40) 

Thus,  when  0 <  77^  1 ,  we  expect  the  final  random  condi¬ 
tional  Dicke  state  on  a  given  trial  to  fall  within  the  initial  z 
distribution.  Given  6 ,  the  distribution  will  have  spread 
|sin(0)|  ylN/2  about  the  value  cos(0)W/2.  Although  the  final 
state  is  generally  random,  starting  with  a  polarized  state 
clearly  gives  us  some  degree  of  selectivity  for  the  final  Dicke 
state  because  \[n<N. 
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C.  0<  7/^1 

When  rj  A  0,  the  measurement  record  is  used  to  condition 
the  state,  and  we  can  determine  which  Dicke  state  the  system 
diffuses  into.  Given  the  task  of  preparing  the  state  \md),  the 
above  analysis  suggests  the  following  experimental  proce¬ 
dure.  First,  polarize  the  ensemble  (via  optical  pumping)  into 
an  unentangled  coherent  state  along  any  direction.  Then  ro¬ 
tate  the  spin  vector  (with  a  magnetic  field)  so  that  the  z 
component  is  approximately  equal  to  md.  Finally,  continu¬ 
ously  measure  z  until  a  time  t>  1  /  rjM.  The  final  estimate 
will  be  a  random  Dicke  state  in  the  neighborhood  of  md. 
When  the  trial  is  repeated,  the  final  states  will  make  up 
a  distribution  described  by  the  initial  moments  of 
Jz  ((/z)(0),(A/^)(0), ...).  To  reduce  the  effects  of  stray  field 
fluctuations  and  gradients,  a  strong  holding  field  could  be 
applied  along  the  z  axis.  Because  this  Hamiltonian  commutes 
with  the  observable  Jv  the  final  open-loop  measurement  re¬ 
sults  would  be  unchanged. 

This  process  (with  zero  field)  is  shown  schematically  in 
Fig.  1  for  md= 0  where  the  initial  state  is  polarized  along  v. 
Because  (Jz)( 0)  =  0,  the  final  state  with  the  highest  probabil¬ 
ity  is  the  entangled  Dicke  state  md= 0.  In  contrast,  if 
(Jz)(0)=J,  the  state  would  start  in  an  unentangled  CSS  po¬ 
larized  along  z  and  would  not  subsequently  evolve. 

One  way  of  characterizing  how  close  the  state  is  to  a 
Dicke  state  is  through  the  variance  (A J2)(t).  Figure  2(A)  dis¬ 
plays  many  trajectories  for  the  variance  as  a  function  of  time. 
For  times  t<  1/  rjM  the  variance  is  approximately  determin¬ 
istic  and  obeys  the  short-time  solution  of  Eq.  (18).  During 
this  period,  the  mean  (Jx)(t)  is  decreasing  at  rate  Ml 2.  Be¬ 
fore  this  mean  has  completely  disappeared,  a  conditional 
spin-squeezed  state  is  created.  However,  for  larger  times  the 
mean  and  variance  stochastically  approach  zero,  and  the 
state,  while  still  entangled,  no  longer  satisfies  the  spin 
squeezing  criterion  [16]. 

There  are  several  features  to  notice  about  the  approach  to 
a  Dicke  state  that  are  evident  in  Figs.  1  and  2.  The  variance 
at  time  t- 1  /  rjM  is  already  of  order  unity.  Thus,  at  this  point, 
only  a  few  neighboring  m  levels  contain  any  population,  as 
can  be  seen  in  Fig.  1(C).  Also,  it  can  be  numerically  shown 
that,  for  x-polarized  initial  states,  the  diffusion  of  the  vari¬ 
ance  at  long  times  t>  1  /  rjM  is  bounded  above  and  below  by 

exp[-  l)]/4  <  <A J2z)(t)  =£  1/4,  (41) 

which  is  evident  from  Fig.  2(A).  These  facts  indicate  that  the 
population  is  divided  among  at  most  two  levels  at  long  times 
which  “compete”  to  be  the  final  winner.  If  we  assume  that 
only  two  neighboring  levels  are  occupied  and  apply  the  SSE 
(with  y=  1),  the  probability  p  to  be  in  one  level  obeys  the 
stochastic  equation 

dp  =  -  2Mp{\  - p)dW(t )  (42) 

and  the  variance  takes  the  form  (\J2)(t)=p(\  -p).  As  simple 
as  it  looks,  this  SDE  is  not  analytically  solvable  [29,30].  The 
maximum  variance  is  1/4  and  it  can  be  shown  that,  for  p 
=  1-6,  with  6  small,  the  lower  bound  is  of  the  exponential 
form  stated  above,  so  the  two-level  assumption  seems  to  be  a 
good  one.  The  fact  that  occupied  Hilbert  space  becomes 


small  at  long  times  is  also  evident  in  Fig.  2(C),  where  the 
allowed  states  are  seen  to  be  excluded  from  certain  regions 
when  p- 1 .  The  arclike  boundaries  of  the  forbidden  space 
are  where  the  two-level  competition  occurs. 

In  practice,  an  experimentalist  does  not  always  have  an 
infinite  amount  of  time  to  prepare  a  state.  Eventually  spon¬ 
taneous  emission  and  other  decoherence  effects  will  destroy 
the  dispersive  QND  approximation  that  the  present  analysis 
is  based  upon.  Suppose  our  task  were  to  prepare  a  Dicke 
state  with,  on  average,  a  desired  uncertainty,  (AJ2)d<  1,  such 
that  one  level  was  distinguishable  from  the  next.  From  Eq. 
(30),  we  see  that  the  time  that  it  would  take  to  do  this  on 
average  is  given  by 


1  _ l_ 


4M  rj. 


(43) 


Thus,  if  (AJ2)d<  1  is  our  goal,  then  td  is  how  long  the  state 
must  remain  coherent.  The  larger  (A/?)(0)  is,  the  more  en¬ 
tangled  the  final  states  are  likely  to  be  (ra~  0)  [16],  and 
hence,  by  Eq.  (43),  the  longer  it  takes  to  prepare  the  state  for 
a  given  (A J2z)d.  Hence,  we  arrive  at  the  intuitively  satisfying 
conclusion  that  conditional  measurement  produces  entangled 
states  more  slowly  than  unentangled  states.  Of  course,  Eq. 
(43)  is  an  average  performance  limit.  In  a  best  case  scenario, 
the  variance  would  attain  the  lower  bound  of  Eq.  (41)  where 
the  state  reduction  happens  exponentially  fast. 


D.  Performance  of  suboptimal  estimators 

Now  we  consider  the  performance  of  the  suboptimal  esti¬ 
mators  discussed  previously,  in  particular  the  current  average 
(Jz)a(t)  of  Eq.  (28).  It  makes  sense  to  associate  the  overall 
“error”  of  this  estimator,  denoted  Va,  to  be  the  average 
squared  distance  of  the  estimator  from  the  optimal  estimator 
plus  the  average  uncertainty  of  the  optimal  estimator  itself, 
E[(A/^)(f)].  Using  the  martingale  properties  of  the  optimal 
estimate  and  the  definition  of  the  photocurrent  gives  this 
quantity  as 

-  E[((JXU)  -  COW)2]  +  e[<a/2)(/)]  =  -2— . 

4  M  rjt 

(44) 

This  is  just  the  error  in  estimating  a  constant  masked  by 
additive  white  noise  with  the  same  signal-to-noise  ratio  [23]. 
The  optimal  estimator  is  better  than  this  suboptimal  estima¬ 
tor  at  long  times  only  through  the  quantity  g(t),  Eq.  (31). 

In  the  open-loop  experimental  procedure  described  at  the 
beginning  of  the  last  section,  the  above  observation  indicates 
that  we  can  replace  the  optimal  estimator  with  the  photocur¬ 
rent  average  and  still  resolve  the  projective  behavior  (given 
sufficient  elimination  of  extraneous  noise).  The  price  paid  for 
the  simplicity  of  the  averaging  estimator  is  that  it  converges 
more  slowly  and  it  only  works  when  a  field  is  not  present 
(hence  without  control). 


IV.  CLOSED-LOOP  EVOLUTION 

The  primary  problem  with  the  open-loop  state  preparation 
scheme  (and  other  approaches  [17-19])  is  that  it  is  probabi- 
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listic.  For  a  single  measurement,  there  exists  some  degree  of 
control,  by  adjusting  the  initial  angle  of  rotation  6 ,  but  the 
final  state  is  a  priori  unpredictable  within  the  variance  of  the 
initial  state.  In  this  section,  we  show  that  the  state  prepara¬ 
tion  can  be  made  deterministic  with  the  use  of  feedback.  Just 
as  the  control  scheme  of  [6,7]  produces  deterministically 
centered  spin-squeezed  states,  we  present  a  simple  feedback 
controller  that  will  prepare  the  same  desired  Dicke  state  (par¬ 
ticularly  md=  0)  on  every  measurement  trial. 

We  choose  to  work  with  y-axis  magnetic  field  actuator 
corresponding  to  the  Hamiltonian,  H(t)  =  yb(t)Jy.  If  the  CSS 
initial  state  begins  in  the  x-z  plane,  this  will  ensure  that  the 
vector  ( J)(t )  remains  in  this  plane.  This  actuator  is  natural  for 
the  control  of  spin- squeezed  states  at  short  times,  where  the 
linear  moments  of  (J)(t)  are  large  and  allow  intuitive  rotation 
of  the  spin  vector.  However,  at  long  times  the  field  will 
mostly  be  affecting  nonlinear  terms  in  the  moment  expansion 
and  the  dynamics  are  less  intuitive  as  can  be  seen  by  the 
structure  near  the  z  axis  in  Fig.  2(C).  Still,  we  continue  to 
give  ourselves  only  these  rotations  to  work  with  as  they  are 
the  most  experimentally  accessible  actuation  variable. 

In  principle,  the  fact  that  Dicke  states  can  be  prepared 
deterministically  with  feedback  should  not  be  surprising. 
Given  the  aforementioned  characteristics  of  the  noncon- 
trolled  measurement  one  could  imagine  preparing  a  particu¬ 
lar  state  by  alternating  measurement  and  control  periods.  For 
example,  an  initial  measurement  (lasting  for  a  time  A t 
<\l  rjM)  would  determine  the  fluctuation  of  (Jz)  while  the 
uncertainty  (A jf)  simultaneously  decreased  (on  average). 
Then  the  measurement  would  be  turned  off  and  the  state 
would  be  rotated  with  a  control  field  to  nullify  the  condi¬ 
tional  quantity  (Jz)-md  (if  preparing  | md)).  The  process  of 
alternating  measurement  and  control  could  then  be  repeated 
and  would  eventually  clamp  down  on  the  desired  state.  No¬ 
tice  that,  unlike  the  preparation  of  spin-squeezed  states  [6,7], 
this  procedure  could  not  be  performed  with  a  single  measure¬ 
ment  and  control  cycle.  In  other  words,  if  we  measure  for  a 
time  t  >  1  /  rjM  and  prepare  a  probabilistic  Dicke  state,  then  a 
single  postmeasurement  rotation  cannot  prepare  a  different 
desired  Dicke  state  in  the  same  basis. 

With  this  intuitive  picture  in  mind,  now  consider  the  con¬ 
tinuous  limit  of  this  process,  where  the  measurement  and 
control  are  performed  simultaneously.  We  wish  to  find  a 
mapping  from  the  photocurrent  history  to  the  control  field 
that  prepares  our  state  of  interest  in  a  satisfactory  manner  on 
every  trial.  For  simplicity,  we  work  with  77=1  and  use  the 
SSE  of  Eq.  (5)  for  all  simulations.1  In  selecting  a  controller, 
we  could  choose  one  of  several  strategies,  including  either 
direct  current  feedback  or  a  feedback  rule  based  on  the  state 
(i.e.,  what  has  been  called  Markovian  and  Bayesian  feed¬ 
back,  respectively  [27,38]).  While  direct  current  feedback 
possesses  certain  advantages,  mainly  simplicity  that  allows 
practical  implementation,  and  is  capable  of  working  ad¬ 
equately  at  short  times,  any  constant  gain  procedure  would 
never  prepare  a  Dicke  state  with  confidence.  If  the  current  is 
directly  fed  back,  a  finite  amount  of  noise  will  unnecessarily 
drive  the  system  away  from  its  target,  even  if  the  state  starts 
there.  Of  course  the  gain  could  be  ramped  to  zero  in  time, 
but  unlike  the  short-time  case,  it  is  not  clear  how  to  tailor  the 
gain  intelligently. 


Another  alternative  would  be  to  prepare  a  spin- squeezed 
state  with  this  approach  and  then  turn  off  the  feedback  at 
some  intermediate  time.  This  would  certainly  enhance  the 
probability  of  obtaining  a  certain  Dicke  state,  but  the  process 
would  remain  probabilistic  to  some  degree.  For  these  rea¬ 
sons,  we  continue  considering  only  state-based  feedback,  de¬ 
spite  the  fact  that  updating  the  state  in  real  time  is  experi¬ 
mentally  challenging. 


A.  Defining  a  cost 

A  useful  first  step  in  the  design  of  any  controller  is  to 
define  the  quantity  that  the  ideal  controller  should  minimize: 
the  cost  function.  For  example,  consider  a  state  preparation 
application  where  the  controller  aims  to  produce  the  desired 
target  state  \if/d).  In  this  case,  one  possible  cost  function  is  the 
quantity 

Uf=  1  —  <*Arf|p|0d>  55  0,  (45) 

evaluated  at  the  stopping  time,  which  is  zero  iff  the  fidelity 
of  the  state  with  respect  to  the  target  is  unity.  In  the  current 
application,  where  we  desire  a  final  Dicke  state  |  md),  we 
wish  to  minimize  a  different  quantity 

U  -  « h )  -  md )2  +  <A/j>  =  %m(m\p\m)2(m  -  md)2  3=  0, 

(46) 

which  is  zero  iff  p=\md)(md\.  Notice  that  U  gives  a  higher 
penalty  than  Uf  to  states  that  are  largely  supported  by  Dicke 
states  far  removed  from  the  target.  In  general,  U  will  evolve 
stochastically  and  we  may  be  more  interested  in  the  mean 
behavior,  denoted  F [£/].  In  the  uncontrolled  case,  it  can  be 
shown  that  this  quantity  remains  constant,  E[U(t)]  =  U(0). 
For  the  controlled  case,  we  wish  for  E[f/]— >0  as  time  ad¬ 
vances,  which,  because  0,  implies  that  every  trajectory 
approaches  the  target  state  |  md). 

In  general,  the  cost  function  could  also  include  an  integral 
of  the  quantity  U(t)  instead  of  just  the  final  value.  As  in 
classical  control  theory  [23],  it  is  also  practical  to  include  a 
function  of  b(t)  in  the  cost  as  a  way  of  expressing  our  ex¬ 
perimental  feedback  gain  and  bandwidth  constraints.  Ana¬ 
lytically  proceeding  in  this  way  by  optimizing  the  average 
cost  is  too  difficult  for  the  current  problem,  but  with  this 
perspective  in  mind,  we  proceed  by  proposing  controllers 
according  to  related  considerations. 


B.  Control  law  1 


Now  consider  the  average  evolution  of  the  above  cost 
function,  which  is  given  by 


dE[U(t)]  =  -  2yE 


b(t ) 


2 


-  md{Jx){t) 


dt. 

(47) 


Because  we  want  this  function  to  continuously  decrease,  the 
right-hand  side  should  be  negative  at  all  times.  If  we  have 
full  access  to  the  density  matrix  and  minimal  feedback  delay, 
we  could  use  the  controller 


022106-8 


DETERMINISTIC  DICKE-STATE  PREPARATION  WITH... 


PHYSICAL  REVIEW  A  70,  022106  (2004) 


time  (  Jx  )  (t) 


FIG.  3.  One  hundred  closed-loop  moment  trajectories1  of  the  SSE  with  feedback  law  b(t)  =  \(JxJz+JzJx)(t)/ 2  and  \  =  10  chosen  from 
numerical  considerations.  (A),  (B)  If  the  control  is  successful,  the  quantity  {J2z)(t)  should  go  to  zero  on  every  trial.  For  this  controller  the 
number  of  successful  trajectories  is  increased  significantly  (from  25%  to  90%),  but  the  remaining  fraction  is  attracted  to  neighboring  fixed 
points,  causing  the  mean  E[(J2)(t)]  to  saturate  at  a  nonzero  value.  Although  the  successful  fraction  converges  exponentially,  the  fastest 
converging  trajectories  are  slower  than  in  the  open-loop  case.  This  is  evident  in  (C)  as  the  converging  trajectories  have  visibly  not  yet 
reached  (Jx)  =  0  at  time  t-  5. 


bl(t)= _  w<A)Wj ,  (48) 

where  X  is  a  constant  positive  gain  factor.  This  law  guaran¬ 
tees  that  JE[f/(0]^0.  Still,  this  does  not  yet  prove  that  U 
=  0  is  obtained  because  dE[U(t)]  =  0  for  states  other  than  the 
target  state.  Furthermore,  even  with  this  control  law  applied, 
all  Dicke  states  remain  fixed  points. 

Regardless  of  these  issues,  we  proceed  by  analyzing  the 
performance  of  this  control  law  numerically  with  md= 0.  In 
principle,  the  gain  could  be  chosen  arbitrarily  large.  Here  we 
choose  to  work  with  a  gain  that  is  large  enough  to  be  effec¬ 
tive  but  small  enough  to  keep  the  numerical  simulation  re¬ 
sults  valid.1  The  choice  of  a  limited  gain  is  a  necessity  in 
both  simulation  and  experiment;  thus,  we  wish  to  find  a  con¬ 
trol  law  that  works  within  this  constraint.  For  the  parameters 
used  in  our  simulation,  we  use  a  gain  of  \  =  10  which  pro¬ 
duces  the  results  shown  in  Fig.  3. 

In  Fig.  3(A),  we  now  plot  the  figure  of  merit  for  md= 0, 
U(t)  =  (J2)(t).  In  open-loop  configuration,  only  25%  of  all 
trajectories  are  attracted  to  ra  =  0,  whereas  with  this  controller 
the  percentage  reaches  90%.  Furthermore,  most  of  these  tra¬ 
jectories  approach  the  state  at  an  exponential  rate  close  to  M, 
as  indicated  by  the  curve  under  which  80%  of  the  trajectories 
lie.  Interestingly,  this  is  at  the  expense  of  those  trajectories 
that  in  open  loop  approached  the  target  state  at  an  exponen¬ 
tial  rate  of  2 M.  There  is  a  trade-off  by  which  the  control 
slightly  compromises  the  convergence  of  the  best  case  tra¬ 
jectories. 

Unfortunately,  because  all  other  Dicke  states  are  still  fixed 
points  of  the  controlled  SSE  and  the  gain  is  finite,  a  small 
fraction  (10%)  of  trajectories  are  attracted  to  those  states 
neighboring  the  target  state.  Thus  this  controller  does  not 
appear  to  deterministically  prepare  all  trajectories  into  the 
target  state  and  the  mean  E[(J2)(t )]  flattens  at  a  level  deter¬ 
mined  by  the  unsuccessful  fraction  of  trials. 

C.  Control  law  2 

The  obvious  solution  to  the  above  problem  is  to  try  a 
controller  that  ensures  the  target  state  is  the  only  fixed  point 


of  the  SME  on  SSE.  In  this  section  we  propose  the  control 
law 

b2(t)  =  X[(Jz)(t)-mdl  (49) 

for  which  the  state  \md)  is  the  only  fixed  point.  However, 
unlike  bx{t)  this  controller  lacks  the  x  symmetry  that  ensures 
JE[U(0]^0.  Also,  while  the  symmetry  of  b\(t)  will  allow  it 
to  lock  to  both  sides  of  the  Bloch  sphere,  b2(t)  will  only  lock 
to  one  side  of  the  sphere. 

Again,  we  proceed  by  numerically  analyzing  the  perfor¬ 
mance  of  this  controller  for  md= 0,  with  the  results  displayed 
in  Fig.  4.  The  gain  is  chosen  in  the  same  manner  as  before, 
which  leads  to  the  same  reasonable  choice  of  \=  10.  In  Fig. 
4(C)  the  fundamental  nature  of  the  dynamics  can  be  seen. 
Close  to  90%  of  the  trajectories  are  directly  transported  to¬ 
wards  the  target  state,  but  the  remaining  “misses”  on  the  first 
pass.  Instead  of  being  attracted  towards  other  fixed  points 
though,  this  unsuccessful  fraction  is  recycled  and  rotated 
back  onto  the  positive  x  axis  where  they  can  reattempt  con¬ 
vergence  onto  the  target  state.  These  large  excursions  can  be 
seen  in  Figs.  4(A)  and  4(B)  as  well,  but  they  do  not  appear  to 
dominate  the  net  flow.  The  average  of  10  000  trajectories 
gives  a  quantity  E[(J2)(t )]  which  appears  to  exponentially 
descend  towards  zero,  implying  that  the  state  preparation  has 
been  made  deterministic.  As  with  the  control  of  b\{t)  there  is 
again  a  trade-off:  the  trajectories  that  previously  descended 
at  the  exponential  rate  of  2 M  converge  more  slowly,  but  still 
exponentially. 

V.  CONCLUSION 

The  purpose  of  this  paper  is  to  demonstrate  the  fact  that 
the  process  of  continuous  projective  measurement  can  be 
made  deterministic  with  a  theoretically  simple  and  intuitive 
state-based  control  law.  In  the  context  of  an  atomic  spin  en¬ 
semble,  the  resulting  Dicke  states  are  highly  entangled  and 
otherwise  difficult  to  reliably  produce  from  an  initially  unen¬ 
tangled  state. 

However,  there  is  much  work  to  be  done  in  the  general 
field  of  quantum-state  estimation  and  control,  of  which  this 
is  one  example.  In  this  pursuit,  it  is  helpful  to  utilize  and 
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FIG.  4.  One  hundred  closed-loop  moment  trajectories1  of  the  SSE  with  feedback  law  b(t)  =  \(Jz)(t)  and  \  =  10  chosen  from  numerical 
considerations.  (A)  The  average  over  10  000  trajectories  suggests  that  with  this  control  law  the  mean  E[(/^)(?)]  descends  to  zero  exponen¬ 
tially  and  the  target  state  is  deterministically  prepared.  (B)  Despite  a  number  of  early  excursions,  all  100  trajectories  shown  converge  to  the 
desired  value  of  m  =  0.  (C)  Those  trajectories  that  do  not  descend  to  the  goal  directly  (about  10  of  100)  are  recycled  and  rotated  back  into  the 
attractive  region  of  the  target  state.  Again,  the  control  slightly  compromises  the  best-case  convergence  rate  and  the  trajectories  have  a 
nonzero  (but  still  decreasing)  (Jx)  at  t= 5. 


adapt  methods  from  the  developed  fields  of  classical  stochas¬ 
tic  estimation  and  control  theory.  In  [28],  for  example,  the 
problem  of  this  paper  is  considered  for  a  single  spin  with 
greater  emphasis  on  technical  notions  of  stochastic  stability 
and  convergence.  Ultimately,  we  would  like  to  discover  con¬ 
structive  methods  for  deriving  optimal  control  laws  given  a 
cost  function  and  realistic  actuators. 

Even  with  an  optimal  control  law  in  hand,  there  is  no 
guarantee  that  experimental  implementation  will  be  possible. 
Any  analysis  should  incorporate,  among  other  constraints, 
nonunity  detection  efficiencies  and  finite  controller  resources 
(bandwidth,  memory,  etc).  For  experimental  application  of 
quantum  feedback,  the  controller  complexity  needs  to  be  re¬ 
duced  to  the  point  where  the  delay  is  minimal  compared  to 
other  dynamical  time  scales  [33].  As  in  classical  control,  ef¬ 
fective  model  reduction  techniques  are  indispensable  when  it 
comes  to  implementation. 

Despite  these  difficulties,  the  increasing  number  of  physi¬ 
cal  systems  that  can  be  measured  reliably  at  the  quantum 
limit  will  surely  hasten  the  effort  to  solve  many  of  these 
technical  challenges.  By  respecting  the  physical  basis  of 
measurement  dynamics,  experimentalists  will  be  able  to 
more  efficiently  use  measurement  itself,  in  tandem  with  more 
traditional  techniques,  to  actuate  quantum  systems  into  desir¬ 
able  states. 


dp(t)  =  V\^^MJ^\p(t)dt  +  2My[Jzp(t)  +  p(t)Jz]y(t)dt . 

(Al) 

This  equation,  known  as  the  unnormalized  or  linear  SME,  is 
equivalent  to  Eq.  (1)  with  the  identification 

p(t)  =  p(t)/Jr[p(t)].  (A2) 

Introducing  the  notation 

<3\p  =  J-pJ-, 
g2  p=j2zp+pj2z, 

gip=j-p+ pj:,  (A3) 

Eq.  (Al)  can  be  written  in  the  more  suggestive  form 

dp(t)  =m[qx~  -g)jp{t)dt  +  2M 7jQ3p(t)y(t)dt.  (A4) 

Now  note  that  Eq.  (A4)  is  a  linear  Ito  SDE  [29]  for  pit),  and 
moreover  1,2,3  all  commute  with  each  other  in  the  sense  that 
QiQjP=QjQip.  Such  SDE’s  have  a  simple  explicit  solution 
[30] 
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p(t)  =  exp 


[M(l  -  rj)Qx  -  M(1  +  rj)g2l2\t 


+  2Mrfe3  y(s)ds 


f: 

Jo 


p(  0), 


(A5) 


as  is  easily  verified  by  taking  the  time  derivative  of  this 
expression,  where  care  must  be  taken  to  use  Ito’s  rule  for  the 
stochastic  term. 

Now  consider  an  initial  pure  state  of  the  form 


APPENDIX:  SOLUTION  OF  THE  SME 
WITHOUT  A  FIELD 

An  explicit  solution  to  the  SME,  Eq.  (1),  can  easily  be 
found  in  the  case  H(t)  =  0.  First,  the  SME  is  rewritten  as 


j 

!#))>=  2  cm\m). 

m--J 

The  associated  initial  density  matrix  is  then 


(A6) 
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p(0)  =  |^0)><^0)|=  2  cmc*m,\rn)(m'\.  (A7) 

Substituting  into  Eq.  (A5)  gives 

j 

Pit )  =  2  cmc*  ,exp 


Tr[/p(/)]=  2  mk\cm |2exp 


m=-J 


■  2Mrjnrt 


+  AmMrj\  y{s)ds 


1 


M(  1  -  rj)mm '  -  — M(  1  +  77) [m2 


/' 


(A9) 


L  v. 

+  (m')2]  [7  +  2My(m  +  ra')  I 


(A8)  which  is  the  result  used  in  the  text,  Eq.  (20). 


Hence 
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We  propose  a  trap  for  confining  cold  neutral  atoms  in  a  microscopic  ring  using  a  magnetoelectrostatic 
potential.  The  trapping  potential  is  derived  from  a  combination  of  a  repulsive  magnetic  field  from  a  hard  drive 
atom  mirror  and  the  attractive  potential  produced  by  a  charged  disk  patterned  on  the  hard  drive  surface.  We 
calculate  a  trap  frequency  of  [29.7,42.6,62.8]  kHz  and  a  depth  of  [16.1,21.8,21.8]  MHz  for  [133Cs,  87Rb, 
40K],  and  discuss  a  simple  loading  scheme  and  a  method  for  fabrication.  This  device  provides  a  one¬ 
dimensional  potential  in  a  ring  geometry  that  may  be  of  interest  to  the  study  of  trapped  quantum  degenerate 
one-dimensional  gases. 

DOI:  10. 1103/PhysRevA.70. 053616  PACS  number(s):  03.75.Be,  05.30.Jp,  03.75.Lm,  32.80.Pj 


Traps  are  a  standard  tool  for  the  study  and  manipulation 
of  cold  neutral  atoms,  allowing  the  investigation  of  funda¬ 
mental  quantum  dynamics  as  well  as  providing  a  basis  for 
quantum  information  processing.  The  manipulation  of 
trapped  atoms  on  “atom  chips”  allows  the  implementation  of 
many  different  atom  optics  elements  for  trapping,  waveguid- 
ing,  interferometry,  etc.  [1,2].  Most  atom  chips  use  micron¬ 
sized  current-carrying  wires  to  generate  the  magnetic  trap¬ 
ping  fields.  We  propose  to  construct  a  magnetoelectrostatic 
ring  trap,  consisting  of  a  hard  drive  atom  mirror  that  provides 
a  repulsive  force  on  low-field  seeking  atoms  [3]  and  electric 
pads  that  attract  polarizable  atoms  via  the  Stark  effect  [4-6]. 
Schmiedmayer  and  Hinds  and  Hughes  have  proposed  a  range 
of  such  traps,  including  large-area  two-dimensional  traps, 
wire-based  waveguides,  and  quantum-dot-like  single  state 
traps.  Such  traps  could  be  used  to  construct  beam  splitters  or 
to  implement  collisional  quantum  gates  [7].  Here  we  propose 
a  ring  trap  for  cold  neutral  atoms  constructed  from  a  con¬ 
ducting  disk  placed  above  the  atom  mirror  surface,  which 
produces  a  trap  with  a  deep  ring  potential  around  the  edge  of 
the  disk. 

Let  us  first  examine  the  trapping  potential  from  a  charged 
conducting  disk  above  a  hard  drive  atom  mirror.  The  hard 
drive’s  sinusoidal  pattern  of  magnetization  results  in  a  repul¬ 
sive  potential — for  atoms  in  weak-field  seeking  states — in 
the  form  of  a  decaying  exponential  [8] 

umag  =  mFgFnBB0  exp[-  2tt:Jo\  .  ( 1 ) 

The  amplitude  depends  on  the  remnant  magnetization  of  the 
mirror,  B0 ,  as  well  as  the  magnetic  sublevel  mF  and  Lande 
gF-factor  of  the  atomic  ground  state.  The  decay  length  is 
proportional  to  the  periodicity  a  of  the  magnetization  pattern. 
A  small  externally  applied  magnetic  field  perpendicular  to 
the  magnetization  of  the  hard  disk  eliminates  zones  of  zero 
magnetic  field  which  would  allow  Majorana  spin-flip  losses. 
The  atom’s  low  velocity  allows  the  spin  adiabatically  to  fol¬ 
low  the  magnetic  field  and  thus  the  trapping  potential  de¬ 
pends  only  on  the  field  magnitude. 
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In  order  to  create  a  trap,  the  repulsive  force  from  the 
mirror  is  balanced  by  an  attractive  force  due  to  the  dc  Stark 
effect.  The  atomic  potential  due  to  an  electric  field  is 

Ustark  =  -^a\E\2,  (2) 

where  we  assume  that  we  are  working  with  atoms  such  as 
cesium  or  rubidium  which  possess  only  a  scalar  polarizabil¬ 
ity  in  the  ground  state.  A  charged  conducting  disk  creates 
high  electric  fields  near  its  edge,  resulting  in  a  strong  short- 
range  attractive  potential. 

The  mirror  is  made  out  of  an  etched  hard  drive  whose 
aluminum  substrate  is  grounded.  The  boundary  conditions 
consist  of  a  ground  at  the  mirror  surface,  and  a  constant 
potential  on  the  surface  of  the  thin  conducting  disk  which  is 
placed  a  distance  d,  typically  on  the  order  of  a  micron,  above 
the  mirror.  The  electric  fields  are  calculated  from  the  solution 
to  the  Poisson  equation  with  these  boundary  conditions.  The 
combined  atomic  potential  due  to  the  charged  disk  and  mir¬ 
ror  creates  a  trap  above  the  conducting  disk,  which  is  deepest 
near  the  edge  of  the  disk. 

As  an  example,  consider  a  conducting  disk  of  radius 
10  /z m,  placed  d= 0.6  jmm  above  a  hard  drive  atom  mirror. 
Let  the  hard  drive  have  a  field  at  its  surface  of  2  kG  (a 
typical  number  for  a  commercial  hard  drive),  and  a  period¬ 
icity  of  3  jmm  in  the  magnetization.  The  trapping  potential 
for  cesium  in  the  F=  3,  mF=- 3  state  near  the  edge  of  the  disk 
has  a  depth  of  16.1  MHz  (770  /zK)  when  the  potential  on  the 
conducting  disk  is  14.2  V.  For  87Rb  in  the  F=  2,  mF=  2  state, 
the  trap  has  a  depth  of  21.8  MHz  (1.05  mK)  when  18.5  V  is 
applied  to  the  disk.  These  two  atomic  states  will  be  used  in 
all  examples  for  the  remainder  of  the  paper.  See  Fig.  1  for 
the  133Cs  potential.  The  87Rb  potential  looks  qualitatively  the 
same,  with  a  slightly  deeper  minimum.  See  Table  I  for  trap 
parameters  for  a  range  of  geometries  for  133Cs  and  87Rb, 
respectively.  For  40K,  the  optimal  applied  voltage  is  4% 
larger  than  that  for  the  87Rb  trap,  and  trap  frequencies  scale 
up  by  a  factor  of  (mRb/mK) 1/2  =  1.48  relative  to  the  87Rb  case. 

The  curvature  of  the  trap  is  large  enough  that  the  atom  is 
confined  in  the  Lamb-Dicke  regime.  The  Lamb-Dicke  re¬ 
gime  is  defined  as  the  regime  in  which  rj 
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r  (pm)  z  (pm) 

FIG.  1.  The  atomic  potential  for  Cs  with  14.2  V  on  the  disk,  (a) 
A  cross  section  of  the  atomic  potential  in  the  plane  containing  the 
axis  of  the  disk.  The  contour  lines  are  spaced  4  MHz  apart.  The 
distance  r  along  a  diameter  of  the  disk  and  the  distance  z  above  the 
disk  are  plotted  on  the  horizontal  and  vertical  axes,  respectively,  (b) 
The  potential  along  slice  no.  1  in  (a),  (c)  The  potential  along  slice 
no.  2  in  (a). 

=  (Erecoii/ Etrap)l/2  <  1.  For  the  parameters  of  Fig.  1,  the  effec- 

*  1  QQ  0^7  Af\ 

tive  harmonic  frequencies  for  [  Cs,  Rb,  K]  in  the  radial 
direction  are  [29.7,42.6,62.8]  kHz,  and 

[40.6,56.8,83.8]  kHz  in  the  direction  perpendicular  to  the 
substrate.  We  obtain  a  Lamb-Dicke  parameter  of  77^  0.26  for 
133Cs,  77^0.30  for  87Rb,  and  77^0.37  for  40K.  Significantly 
higher  trap  frequencies  are  possible  with  the  use  of  custom 
magnetic  materials,  which  can  have  remnant  magnetic  fields 
of  up  to  2.4  T  [9].  For  the  same  trap  geometry  as  Fig.  1,  but 
using  this  custom  magnetic  material  with  a  correspondingly 
higher  applied  voltage,  the  harmonic  frequencies  for  133Cs, 
for  instance,  are  103  kHz  in  the  radial  direction  and  137  kHz 
in  the  perpendicular  direction.  The  higher  remnant  magnetic 
field  also  allows  the  disk  to  be  placed  further  from  the  hard 


TABLE  I.  133Cs  and  87Rb  trap  parameters  for  several  disk  radii 
r  and  disk-hard  drive  separations  d. 


Trap  depth 
(MHz) 

Trap  frequencies 

d  ( jam ) 

r  l/iiii) 

V 

cor!27T  (kHz) 

co±/2tt  (kHz) 

133Cs 

0.6 

5 

13 

3 

17 

0 

24.4 

44.1 

0.6 

10 

14 

2 

16 

1 

29.7 

40.6 

0.6 

20 

14 

8 

15 

4 

30.6 

37.0 

1.0 

5 

9 

4 

8 

5 

18.0 

31.1 

1.0 

10 

10 

2 

8 

2 

21.2 

28.0 

1.0 

20 

10 

8 

8 

1 

22.1 

26.3 

87Rb 

0.6 

5 

17 

3 

22 

9 

36.0 

63.4 

0.6 

10 

18 

5 

21 

8 

42.6 

56.8 

0.6 

20 

19 

2 

20 

6 

43.7 

52.7 

1.0 

5 

12 

2 

11 

4 

25.7 

44.0 

1.0 

10 

13 

3 

11 

2 

30.8 

40.4 

1.0 

20 

14 

0 

10 

7 

31.5 

37.5 

0.35  pm  ] 

r 

jo. 6  pm 

Insulator 

, _ ,  1  J~0.2  pm 

Hard  Drive 

1  pm 

2  pm 

FIG.  2.  (Color  online)  (a)  Schematic  of  the  magnetoelectrostatic 
ring  trap  drawn  to  scale.  The  disk  is  20  jmm  in  diameter,  with  a 
1  jmm  wide  lead  connected  via  a  central  stem.  The  dotted  lines  show 
the  hard  drive  atom  mirror’s  2:1  etch  pattern  with  a  3  jmm  period¬ 
icity.  (b)  Cross  section  of  the  disk  (with  the  vertical  direction  scaled 
up  by  a  factor  of  5),  showing — from  top  to  bottom — the  disk,  stem, 
lead,  insulating  layer,  and  etched  hard  drive. 

drive  while  maintaining  significant  trap  depth. 

A  thin  lead  running  along  the  hard  drive  surface  may  be 
used  to  connect  the  disk  to  a  voltage  source.  The  maximum 
possible  voltage  on  the  disk  is  limited  by  the  breakdown 
electric  field  of  the  dielectric  material  separating  the  lead 
from  the  conducting  hard  drive  surface.  In  order  to  minimize 
the  perturbation  that  the  lead  produces  on  the  atomic  poten¬ 
tial  from  the  disk,  the  lead  should  be  as  narrow  as  is  practical 
(— 1  jmm)  and  placed  much  closer  to  the  hard  drive  surface 
than  to  the  disk.  At  this  location,  the  repulsive  force  from  the 
mirror  is  much  stronger  and  no  trap  forms  due  to  the  charge 
on  the  lead.  In  order  to  connect  the  lead  to  the  disk,  the  disk 
is  placed  on  a  thin  stem,  with  the  lead  connected  to  the 
bottom  of  the  stem  (see  Fig.  2). 

Three  dimensional  solutions  to  the  Poisson  equation  indi¬ 
cate  that  the  effect  from  the  lead  on  the  trapping  potential  is 
minimized  if  the  stem  connecting  the  lead  to  the  disk  is 
located  at  the  center  of  the  disk.  For  the  previously  used  trap 
parameters  and  the  lead  placed  0.25  jmm  above  the  hard  drive 
surface  (0.35  jmm  below  the  surface  of  the  disk),  the  trap 
minimum  for  a  133Cs  atom  rises  to  —11.5  MHz  above  the 
lead,  which  is  a  —30%  loss  of  trapping  potential  compared 
to  the  unperturbed  trap.  The  width  of  the  perturbation  is  a 
few  jmm,  slightly  wider  than  the  lead.  A  shallower  trap  in 
which  the  electric  pad  is  placed  further  from  the  mirror  sur¬ 
face  is  perturbed  less  by  the  lead.  Use  of  custom  magnetic 
materials  would  allow  deeper  traps  to  be  constructed  further 
from  the  lead,  thereby  minimizing  the  height  of  the  pertur¬ 
bation. 

We  intend  to  fabricate  the  device  as  follows.  The  hard 
drive  atom  mirror  is  etched  in  the  manner  described  in  Ref. 
[3],  maintaining  the  2:1  ratio  of  magnetization  stripe  spacing 
to  minimize  higher  harmonics.  The  stripe  periodicity  will  be 
^3  jmm.  A  deposition  of  an  —200  nm  thick  insulating  layer 
of  silicon  dioxide  or  silicon  nitride  is  necessary  to  prevent 
shorting  between  the  electric  pads  and  the  hard  drive  surface 
(see  Fig.  2).  This  layer  is  thick  enough  to  both  support  the 
voltage  difference  between  the  pads  and  underlying  surface, 
and  to  help  planarize  the  100  nm  deep  corrugations  of  the 
etched  hard  drive.  The  —50  nm  tall,  —1  jmm  wide  gold  leads 
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are  patterned  on  the  insulator  surface  using  standard  photo¬ 
lithography  and  thermal  evaporation  of  the  adhesion  metal 
and  gold  layers  [10,11].  To  create  the  stems,  the  surface  is 
spin-coated  with  photoresist  to  a  predetermined  thickness  to 
achieve  optimal  disk  to  atom  mirror  spacing.  Photolithogra¬ 
phy  is  again  used  to  create  vertical,  cylindrical  holes  of 
1  jam  diameter  in  the  photoresist  located  at  the  terminals  of 
the  gold  leads.  The  gold  stems  are  electroplated  from  the 
gold  leads  through  the  cylindrical  guide  holes  to  the  top  of 
the  photoresist.  A  third  photolithographic  process  and  ther¬ 
mal  evaporation  patterns  the  20  jam  diameter  gold  disks  at¬ 
tached  to  the  tops  of  the  stems.  Finally,  the  photoresist  is 
removed  using  standard  techniques,  leaving  behind  the 
mushroomlike  structures.  Field  simulations  show  that  to  the 
percent  level,  leaving  the  photoresist  under  the  disk  does  not 
disturb  the  electric  field,  and  perturbations  to  the  trap  due  to 
disk  edge  roughness  or  due  to  the  hard  drive  trench  corruga¬ 
tions  are  both  negligible. 

The  trap  is  conservative  once  the  voltage  is  established, 
and  the  kinetic  energy  of  the  atoms  must  be  lowered  for  them 
to  stay  in  the  trap.  A  simple,  but  inefficient,  method  of  load¬ 
ing  this  trap  is  to  drop  a  cloud  of  cold  atoms  from  a  magneto¬ 
optical  trap  (MOT) — sub-doppler  cooled  to  10  jaK — onto 
the  device.  The  atoms  are  captured  by  turning  on  the  voltage 
on  the  electric  pads  as  the  atoms  are  passing  through  their 
classical  turning  point  above  the  atom  mirror.  Simulations 
indicate  that  this  scheme  can  capture  1  to  2%  of  the  dropped 
atoms.  The  fraction  is  small  because  the  voltage  ramp  must 
be  quite  fast  (~2  X  10-4  seconds)  in  order  to  remove  enough 
energy  from  the  atoms  to  trap  them,  while  the  atom  cloud 
takes  roughly  2  X  10-2  seconds  to  pass  through  the  trapping 
volume.  This  scheme  has  many  different  parameters  over 
which  loading  can  be  optimized,  including  the  initial  posi¬ 
tion,  size  and  density  of  the  MOT  before  it  is  dropped,  and 
the  shape  and  speed  of  the  voltage  ramp.  Ramping  up  the 
voltage  on  the  conducting  disk  is  the  simplest  scheme  for 
trapping  the  atoms,  but  it  is  possible  that  another  procedure, 
involving  atomic  transitions  or  other  degrees  of  freedom  in 
the  system,  could  be  more  effective  and  is  currently  being 
investigated. 

Given  this  loading  efficiency,  a  10  micron-radius  ring  trap 
will  capture  roughly  30-50  atoms  from  a  dropped  cloud  of 
107  atoms  and  temperature  10  ja K.  In  order  to  capture  more 
atoms,  disks  can  be  arranged  in  an  array  covering  a  larger 
surface  area.  The  volume  of  the  trap  deeper  than  200  /zK  is 
1  to  2  X  10-9  cm3.  Simulations  indicate  that  these  traps  can 
be  placed  roughly  20  /zm  apart  without  significantly  disturb¬ 
ing  each  other.  Therefore,  roughly  20%  of  the  surface  can  be 
covered  with  the  traps.  Combining  the  loading  efficiency 
with  this  surface  coverage,  roughly  a  few  103  atoms  can  be 
trapped.  The  leads  can  be  routed  through  spaces  between  the 
disks  with  either  a  separate  lead  for  each  disk  or  a  shared 
network  of  leads. 

Several  undesired  effects,  such  as  heating,  fragmentation 
of  Bose-Einstein  condensates,  and  a  reduction  of  trap  life¬ 
times,  have  been  detected  in  micro  trap  experiments  involv¬ 
ing  atoms  near  room-temperature  surfaces.  The  trap  pro¬ 
posed  here  is  insusceptible  to  heating  due  to  technical  noise 
on  the  currents  in  the  microwires  and  to  the  fragmentation 
problems  caused  by  the  spatial  variation  of  these  currents 
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[12,13].  However,  the  trap  remains  susceptible  to  atom  loss 
due  to  spin  flips  induced  by  magnetic  field  fluctuations  from 
thermal  currents  in  the  metal  forming  the  electric  pads,  as 
detected  in  several  experiments  [14-16].  Surface  effects  in 
this  system  will  most  closely  resemble  those  in  Lin  et  al. , 
wherein  the  skin  depth  for  the  transition  frequency  between 
trapped  and  untrapped  magnetic  sublevels  of  the  atoms  is 
much  larger  than  both  the  distance  of  the  atoms  from  the 
metal  surface  and  the  thickness  of  the  metal  conductor.  As 
reported  in  Lin  et  al .,  at  a  distance  of  2  jam  this  Johnson 
noise  limits  the  lifetime  of  87Rb  atoms  above  a  2  jam  thick 
copper  conductor  to  a  few  100  ms — ample  time  for  detecting 
atoms  in  the  ring  trap.  The  metal  film  used  for  the  electric 
disk  pad  in  the  ring  trap  will  be  ten  to  a  hundred  times 
thinner  than  that  used  for  the  above  experiment,  and  we  ex¬ 
pect  this  to  further  minimize  the  trap’s  loss  rate  [16,17]. 

There  are  several  possibilities  for  minimizing  or  eliminat¬ 
ing  the  perturbation  due  to  the  lead,  or  tuning  it  to  be  of  a 
particular  height,  other  than  simply  adjusting  the  trap  geom¬ 
etry.  The  most  versatile  possibility  is  to  add  an  additional 
photolithography  step  to  insert  another  electric  pad  directly 
above  the  lead,  separated  by  a  thin  insulation  layer.  The  volt¬ 
age  applied  to  this  separate  pad  can  be  used  to  compensate 
for  the  effects  of  the  lead.  In  particular,  the  voltage  on  a  pad 
the  same  width  as  the  lead  and  placed  100  nm  above  it  could 
be  tuned  to  completely  eliminate  the  perturbation  (to  within 
the  percent-level  accuracy  of  our  calculations)  or  turn  it  into 
a  dip  rather  than  a  bump.  Complete  elimination  of  any  per¬ 
turbation  is  possible  by  expanding  such  a  pad  to  cover  the 
entire  surface,  with  a  hole  to  allow  the  stem  to  reach  from  the 
lead  to  the  disk.  Another  possibility  is  to  charge  the  disk  not 
with  a  lead  but  with  an  intrachamber  electron  beam.  Such  a 
system  would  be  hard  to  charge  and  discharge  quickly,  re¬ 
quiring  a  loading  scheme  that  does  not  require  a  rapid 
change  to  the  charge  distribution. 

Several  future  improvements  or  extensions  of  this  trap¬ 
ping  concept  are  possible.  For  example,  the  decoherence  ef¬ 
fects  due  to  the  proximity  of  a  conductor  could  be  mitigated 
with  the  use  of  a  dielectric  magnetic  film  in  place  of  the  hard 
drive,  and  dielectric  pads  charged  via  an  electron  beam  in 
place  of  the  conducting  disks.  In  addition,  disks,  rings,  wires, 
and  other  shapes  could  be  used  to  trap  and  manipulate  the 
atoms  just  above  the  surface,  and  voltages  adjusted  to  shift 
the  atoms  from  one  potential  into  another.  Integration  of 
these  traps  with  magnetic  microtraps  based  on  current- 
carrying  wires  on  the  surface  is  also  possible.  Small  single¬ 
atom  traps  with  additional  electrostatic  pads  to  control  the 
barrier  heights  could  enable  a  system  for  quantum  logic  gates 

m. 

In  the  past  several  years,  there  has  been  much  experimen¬ 
tal  and  theoretical  interest  in  trapped  one-dimensional  (ID) 
quantum  degenerate  gases  (see  Refs.  [18-22]  and  the  cita¬ 
tions  within).  Trapped  ID  gases  require  kBT ,  ja<hco ±,  where 
T  is  the  temperature,  ja  is  the  chemical  potential,  and  a)±  is 
the  transverse  trapping  frequency.  Various  regimes  of  quan¬ 
tum  degeneracy — of  which  a  ID  gas  of  impenetrable  bosons, 
the  Tonks-Girardeau  (TG)  regime,  is  of  particular  interest — 
can  be  explored  by  changing  the  density  of  trapped  atoms  or 
by  modifying  the  interactions  between  atoms  via  Feshbach 
resonances.  In  the  latter  case,  a  magnetic  bias  field  for  ad- 
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justing  the  s-wave  scattering  length,  a ,  can  be  added  parallel 
to  the  magnetization  stripes  of  the  atom  mirror  without  af¬ 
fecting  the  potential  of  the  magnetoelectrostatic  ring  trap. 
With  respect  to  87Rb,  a  common  alkali  used  for  BEC, 
kBTlficoL  is  smaller  than  0.05  for  temperatures  below 
100  nK.  The  TG  regime  requires  that  the  mean  interparticle 
separation,  l  In,  be  much  larger  than  correlation  length,  lc 
=  (h/2mn(o_La)m,  where  m  is  the  atom’s  mass,  n=N/L  is  the 
number  density  [21].  This  constraint  limits  the  number  of 
87Rb  atoms  in  the  ring  trap  to  N  <  2m  w^aL/fi  =  250  atoms 
for  a  device  of  circumference  L=2ttX20  /xm,  o)l  =  2tt 
X  40  kHz,  and  an  a  unmodified  by  Feshbach  resonances  (the 
field  at  the  trap  minimum  is  —12  G).  Overcoming  the  chal¬ 
lenge  of  detecting  so  few  atoms  may  be  possible  through  the 
incorporation  of  micro  wire  traps  [22]. 

The  ring  geometry  adds  a  unique  element  to  the  many- 
body  physics  of  the  ID  trap.  Josephson  effects  in  trapped 
BECs  have  been  investigated  theoretically  for  the  case  of  a 
double  well  (see  Ref.  [23]  and  citations  within)  and  investi¬ 
gated  experimentally  in  an  optical  standing  wave  [24].  A 
BEC  in  this  magnetoelectro  static  ring  trap  system  with  inter¬ 
spersed  Josephson  junctions  formed  from  the  addition  of 
micron-sized  perturbations  to  the  trapping  potential — such  as 
those  caused  by  wire  leads,  possibly  tuned  using  additional 
pads — is  reminiscent  of  superconducting  electronic  systems. 
The  ratio  of  the  chemical  potential  to  the  perturbation  barrier 


height  can  be  adjusted  with  the  trap  parameters  such  as  d,  r, 
atom  number,  and  disk  potential,  as  well  as  the  use  of  addi¬ 
tional  electric  pads,  to  cause  the  perturbation  to  act  as  either 
an  impenetrable  wall,  a  tunnel  junction,  or  a  scattering  cen¬ 
ter.  The  utility  of  this  ID  ring  trap  is  highlighted  by  recent 
proposals  for  using  a  BEC  in  a  double  ring  to  create  a 
SQUID-like  device  for  neutral  atoms  [25]  and  for  investigat¬ 
ing  quantum  chaos  in  the  system  of  the  quantum  kicked  rotor 
[26]. 

This  magnetoelectrostatic  trap  for  cold  neutral  atoms — 
derived  from  balancing  the  repulsive  force  of  an  atom  mirror 
with  the  attractive  force  from  a  charged  disk — introduces  a 
ring  trapping  geometry  for  cold  neutral  atoms.  Fabrication  of 
this  trap  is  straightforward,  and  an  array  of  such  traps  can 
trap  a  significant  number  of  atoms.  Furthermore,  such  a  trap 
may  allow  the  exploration  of  interesting  many-body  physics 
in  a  one-dimensional  ring  trap.  This  device  is  an  example  of 
the  rich  potential  for  developing  atom  optical  elements 
through  the  integration  of  a  hard  drive  atom  mirror,  charged 
pads,  and  micro  wires. 
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Several  proposed  techniques  for  distinguishing  between  optical  coherent  states  are  analyzed  under  a  physi¬ 
cally  realistic  model  of  photodetection.  Quantum  error  probabilities  are  derived  for  the  Kennedy  receiver,  the 
Dolinar  receiver,  and  the  unitary  rotation  scheme  proposed  by  Sasaki  and  Hirota  for  subunity  detector  effi¬ 
ciency.  Monte  Carlo  simulations  are  performed  to  assess  the  effects  of  detector  dark  counts,  dead  time,  signal 
processing  bandwidth,  and  phase  noise  in  the  communication  channel.  The  feedback  strategy  employed  by  the 
Dolinar  receiver  is  found  to  achieve  the  Helstrom  bound  for  subunity  detection  efficiency  and  to  provide 
robustness  to  these  other  detector  imperfections  making  it  more  attractive  for  laboratory  implementation  than 
previously  believed. 

DOI:  10.1103/PhysRevA.70. 062303  PACS  number(s):  03.67.Hk,  03.65.Wj 


I.  INTRODUCTION 

Communication  is  subject  to  quantum  mechanical  indeter¬ 
minism  even  when  the  transmitted  information  is  entirely 
classical.  This  potentially  counterintuitive  property  results 
from  the  fact  that  information  must  be  conveyed  through  a 
physical  medium — a  communication  channel — that  is  un¬ 
avoidably  governed  by  quantum  mechanics.  From  this  per¬ 
spective,  the  sender  encodes  information  by  preparing  the 
channel  into  a  well-defined  quantum  state  p  selected  from  a 
predetermined  alphabet  A  =  {p0, ...  ,pM}  of  codewords.  The 
receiver ,  following  any  relevant  signal  propagation,  performs 
a  measurement  on  the  channel  to  ascertain  which  state  was 
transmitted  by  the  sender. 

A  quantum  mechanical  complication  arises  when  the 
states  in  A  are  not  orthogonal,  as  no  measurement  can  dis¬ 
tinguish  between  overlapping  quantum  states  without  some 
ambiguity  [1-4].  This  uncertainty  in  determining  the  channel 
state  translates  into  a  nonzero  probability  that  the  receiver 
will  misinterpret  the  transmitted  codeword  and  produce  a 
communication  error.  While  it  would  seem  obvious  that  the 
sender  should  simply  adopt  an  alphabet  of  orthogonal  states, 
it  is  rarely  practicable  to  communicate  under  such  ideal  con¬ 
ditions  [5,6].  Even  when  it  is  possible  for  the  sender  to  trans¬ 
mit  orthogonal  codewords,  inevitable  imperfections  in  the 
channel  including  decoherence  and  energy  dissipation 
quickly  damage  that  orthogonality.  In  some  cases,  the  classi¬ 
cal  information  capacity  of  a  noisy  channel  is  actually  maxi¬ 
mized  by  a  nonorthogonal  alphabet  [7]. 

When  developing  a  communication  system  to  operate  at 
the  highest  feasible  rate  given  fixed  channel  properties  and  a 
constrained  capability  for  state  preparation,  the  objective  is 
to  minimize  the  communication  error  by  designing  a  “good” 
receiver.  Distinguishing  between  nonorthogonal  states  is  a 
pervasive  problem  in  quantum  information  theory  [8,9]  ad¬ 
dressed  mathematically  by  optimizing  a  state-determining 
measurement  over  all  positive  operator  valued  measures 
(POVMs)  [3,10,11].  This  general  approach  can  be  applied  to 
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communication;  however,  arbitrary  POVMs  are  rarely 
straightforward  to  implement  in  the  laboratory.  Therefore,  a 
“good”  receiver  must  balance  quantum  mechanical  optimal¬ 
ity  with  implementability  and  robust  performance  under  re¬ 
alistic  experimental  conditions. 

For  example,  the  optical  field  produced  by  a  laser  pro¬ 
vides  a  convenient  quantum  system  for  carrying  information. 
Of  course,  optical  coherent  states  are  not  orthogonal  and  can¬ 
not  be  distinguished  perfectly  by  photodetection.  While  the 
overlap  between  different  coherent  states  can  be  reduced  by 
employing  large  amplitudes,  power  limitations  often  restrict 
A  to  the  small-amplitude  regime  where  quantum  effects 
dominate.  This  is  especially  true  in  situations  (such  as  optical 
fibers)  where  the  communication  medium  behaves  nonlin- 
early  at  high  power,  as  well  as  for  long  distance  communi¬ 
cation  where  signals  are  substantially  attenuated,  including 
deep  space  transmission. 

Motivated  by  these  experimental  considerations,  optimiz¬ 
ing  a  communication  process  based  on  small- amplitude  op¬ 
tical  coherent  states  and  photodetection  has  been  an  active 
subject  since  the  advent  of  the  laser  [6,12-14].  Kennedy  ini¬ 
tially  proposed  a  receiver  based  on  simple  photon  counting 
to  distinguish  between  two  different  coherent  states  [12]. 
However,  the  Kennedy  receiver  error  probability  lies  above 
the  quantum  mechanical  minimum  [3]  (or  Helstrom  bound) 
and  this  prompted  Dolinar  to  devise  a  measurement  scheme 
capable  of  achieving  the  quantum  limit  [13].  Dolinar’s  re¬ 
ceiver,  while  still  based  on  photon  counting,  approximates  an 
optimal  POVM  by  adding  a  local  feedback  signal  to  the 
channel;  but,  this  procedure  has  often  been  deemed  imprac¬ 
tical  [15]  due  to  the  need  for  real-time  adjustment  of  the 
local  signal  following  each  photon  arrival.  As  a  result,  Sasaki 
and  Hirota  later  proposed  an  alternative  receiver  that  applies 
an  open-loop  unitary  transformation  to  the  incoming  coher¬ 
ent  state  signals  to  render  them  more  distinguishable  by 
simple  photon  counting  [6,15,16]. 

However,  recent  experimental  advances  in  real-time 
quantum-limited  feedback  control  [17-19]  suggest  that  the 
Dolinar  receiver  may  be  more  experimentally  practical  than 
previously  believed.  The  opinion  that  feedback  should  be 
avoided  in  designing  an  optical  receiver  is  grounded  in  the 
now  antiquated  premise  that  real-time  adaptive  quantum 
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measurements  are  technologically  inaccessible.  Most  argu¬ 
ments  in  favor  of  passive  devices  have  been  based  on  ideal¬ 
ized  receiver  models  that  assume,  for  example,  perfect  pho¬ 
ton  counting  efficiency.  A  fair  comparison  between  open- 
and  closed-loop  receivers  should  take  detection  error  into 
account — feedback  generally  increases  the  robustness  of  the 
measurement  device  in  exchange  for  the  added  complexity. 

Here,  we  consider  the  relative  performance  of  the 
Kennedy,  Dolinar,  and  Sasaki-Hirota  receivers  under  realis¬ 
tic  experimental  conditions  that  include  (1)  subunity  quan¬ 
tum  efficiency,  where  it  is  possible  for  the  detector  to  mis¬ 
count  incoming  photons,  (2)  nonzero  dark  counts,  where  the 
detector  can  register  photons  even  in  the  absence  of  a  signal, 
(3)  nonzero  dead  time,  or  finite  detector  recovery  time  after 
registering  a  photon  arrival,  (4)  finite  bandwidth  of  any  sig¬ 
nal  processing  necessary  to  implement  the  detector,  and  (5) 
fluctuations  in  the  phase  of  the  incoming  optical  signal. 

II.  BINARY  COHERENT  STATE  COMMUNICATION 

An  optical  binary  communication  protocol  can  be  imple¬ 
mented  via  the  alphabet  consisting  of  two  pure  coherent 
states  p0=|'vP0)(''P0|  and  p1  =  |'vF1)('vF1|.  Without  loss  of  gener¬ 
ality,  we  will  assume  that  logical  0  is  represented  by  the 
vacuum, 

^o(0  =  O,  (1) 

and  that  logical  1  is  represented  by 

^lit)  =  iffi(t)exp[—  i((ot+  cp )]  +  c.c.,  (2) 

where  co  is  the  frequency  of  the  optical  carrier  and  cp  is 
(ideally)  a  fixed  phase.  The  envelope  function  ifj\(t)  is  nor¬ 
malized  such  that 

f  \^(t)\2dt  =  N,  (3) 

Jo 

where  N  is  the  mean  number  of  photons  to  arrive  at  the 
receiver  during  the  measurement  interval  O^^T7.  That  is, 
hcolif/^t)]2  is  the  instantaneous  average  power  of  the  optical 
signal  for  logical  1. 

This  alphabet  A={po,pi}  is  applicable  to  both  amplitude 
and  phase- shift  keyed  communication  protocols  as  it  is  al¬ 
ways  possible  to  transform  between  the  two  by  combining 
the  incoming  signal  with  an  appropriate  local  oscillator.  That 
is,  amplitude  keying  with  *4={|0),|a)}  (for  some  coherent 
state  |  a)  with  amplitude  a)  is  equivalent  to  the  phase-shift 
keyed  alphabet  {|-|a),||a)}  via  a  displacement,  Z)[-^a'] 
=  exp\_-\{aa^ -  a  a)\  where  cfi  and  a  are  the  creation  and 
annihilation  operators  for  the  channel  mode.  Similarly,  if 
l^o)  A  |0),  a  simple  displacement  can  be  used  to  restore  I'VP'q) 
to  the  vacuum  state. 

A.  The  quantum  error  probaility 

The  coherent  states  p0  and  px  are  not  orthogonal,  so  it  is 
impossible  for  a  receiver  to  identify  the  transmitted  state 
without  sometimes  making  a  mistake.  That  is,  the  receiver 


attempts  to  ascertain  which  state  was  transmitted  by  per¬ 
forming  a  quantum  measurement,  Y,  on  the  channel.  Y  is 
described  by  an  appropriate  POVM  represented  by  a  com¬ 
plete  set  of  positive  operators  [20], 

2  Y,,  =  t  Y„>0,  (4) 

n 

where  n  indexes  the  possible  measurement  outcomes.  For 
binary  communication,  it  is  always  possible  (and  optimal) 
for  the  receiver  to  implement  the  measurement  as  a  decision 
between  two  hypotheses:  (H0),  that  the  transmitted  state  is 
p0,  selected  when  the  measurement  outcome  corresponds  to 
Y0,  and  (Hi),  that  the  transmitted  state  is  pl9  selected  when 
the  measurement  outcome  corresponds  to  Y\. 

Given  the  positive  operators,  Yq+Y^I,  there  is  some 
chance  that  the  receiver  will  select  the  null  hypothesis  H0 
when  px  is  actually  present, 

p(ho\pi)  =  tr[Y0pi]  =  tr[(l  -  Y  OpJ,  (5) 

and  it  will  sometimes  select  Hx  when  p0  is  present, 

p(//i|p0)  =  tr[Y1p0].  (6) 

The  total  receiver  error  probability  depends  upon  the  choice 
of  Y0  and  Yx  and  is  given  by 

P[Y0,  Yi\  =  toP(H  i|po)  +  £ip(H0\pi).  (7) 

Here,  £o=Po(Po)  and  £i=Po(Pi)  are  the  probabilities  that  the 
sender  will  transmit  p0  and  px  respectively;  they  reflect  the 
prior  knowledge  that  enters  into  the  hypothesis  testing  pro¬ 
cess  implemented  by  the  receiver,  and  in  many  cases  £0=£i 
=  1/2. 

Minimizing  the  receiver  measurement  over  POVMs  (over 
Y0  and  Y\)  leads  to  a  quantity  known  as  the  quantum  error 
probability , 

PH=  minptYoY],  (8) 

foil 

also  referred  to  as  the  Helstrom  bound.  PH  is  the  smallest 
physically  allowable  error  probability,  given  the  overlap  be¬ 
tween  p0  and  pi. 

The  Helstrom  bound 

Helstrom  demonstrated  that  minimizing  the  receiver  error 
probability 

p[Y0,  YJ  =  ^[Y  ip0]  +  £itt[(f  -  Y  i)p{\  (9) 

=  £1  +  tr[Y1(£0p0-£1p1)]  (10) 

is  accomplished  by  optimizing 

min  tr[Yjf],  f  =  £0P0-£iPi,  (H) 

Yl 

over  Yi  subject  to  O^Y^l  [3].  Given  the  spectral  decom¬ 
position 
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f  =  2A»<n|, 

n 


(12) 


present  (ignoring  background  light  and  detector  dark  counts 
for  now).  Therefore,  p(Hl\p0)=0;  however, 


where  the  Xn  are  the  eigenvalues  of  f,  the  resulting  Helstrom 
bound  can  be  expressed  as  [21] 

Ph=€  i+XV  (13) 

\n<0 

For  pure  states,  where  p0=|'vP0)('Tr0|  and  px  =  |'vP1)('Tr1|,  f  has 
two  eigenvalues  of  which  only  one  is  negative, 

X_  =  l-{\  -  Vl  -4^1|<%|'P1>|2)  -  &  >  0,  (14) 

and  the  quantum  error  probability  is  therefore  [3] 

PH=l(l-Vl-4^1|(^1|%)|2).  (15) 

The  Helstrom  bound  is  readily  evaluated  for  coherent 
states  by  employing  the  relation  [22] 


p(H0\pi)  =  tifYopj]  =  |<0|'P1>|2  (21) 

is  nonzero  due  to  the  hnite  overlap  of  ah  coherent  states  with 
the  vacuum.  The  Poisson  statistics  of  coherent  state  photon 
numbers  allows  for  the  possibility  that  zero  photons  will  be 
recorded  even  when  px  is  present. 

Furthermore,  an  imperfect  detector  can  misdiagnose  px  if 
it  fails  to  generate  clicks  for  photons  that  do  arrive  at  the 
detector.  The  probability  for  successfully  choosing  Hx  when 
Pi  is  present  is  given  by 

00  00 

Pv(Hi\Pi)  =  X  ^  p(n,k)\(n\a)\2  (22) 

n=  1  k=  1 

where  the  Bernoulli  distribution 
i’W)  1 


|a>  =  e-H2/22  -S=|n>,  (16) 

n= 0  V«! 

to  compute  the  overlap  between  |^j)  and  |^0)  [3,22], 

c0-(^i|%)  =  ^/2.  (17) 

It  is  further  possible  to  evaluate  the  Helstrom  bound  for  im¬ 
perfect  detection.  Coherent  states  have  the  convenient  prop¬ 
erty  that  subunity  quantum  efficiency  is  equivalent  to  an 
ideal  detector  masked  by  a  beam  splitter  with  transmission 
coefficient  77^  1,  to  give 

PH(V)  =  \{  l-Vl-4^icU-  (18) 


gives  the  probability  that  a  detector  with  quantum  efficiency 
77  will  register  k  clicks  when  the  actual  number  of  photons  is 
n.  The  resulting  Kennedy  receiver  error 

PK(v)  =  l-Pv(Hl\p1)  =  ^c20V  (24) 

asymptotically  approaches  the  Helstrom  bound  for  large  sig¬ 
nal  amplitudes,  but  is  larger  for  small  photon  numbers. 

C.  The  Sasaki-Hirota  receiver 

Sasaki  and  Hirota  proposed  that  it  would  be  possible  to 
achieve  the  Helstrom  bound  using  simple  photon  counting 
by  applying  a  unitary  transformation  to  the  incoming  signal 
states  prior  to  detection  [6,15,16].  They  considered  rotations 


This  result  and  Eq.  (15)  indicate  that  there  is  a  hnite  quantum 
error  probability  for  ah  choices  of  |'Tr1),  even  when  an  opti¬ 
mal  measurement  is  performed. 

B.  The  Kennedy  receiver 

Kennedy  proposed  a  near-optimal  receiver  that  simply 
counts  the  number  of  photon  arrivals  registered  by  the  detec¬ 
tor  between  t=0  and  T.  It  decides  in  favor  of  #0  when  the 
number  of  clicks  is  zero,  otherwise  Hx  is  chosen.  This  hy¬ 
pothesis  testing  procedure  corresponds  to  the  measurement 
operators 


Yo  =  |0><0|.  (19) 

00 

Yi  =  ^j\n)(n\,  (20) 

n=  1 

where  |  n)  are  the  eigenvectors  of  the  number  operator 
N-cC  a. 

The  Kennedy  receiver  has  the  property  that  it  always  cor¬ 
rectly  selects  H0  when  the  channel  is  in  p0,  since  the  photon 
counter  will  never  register  photons  when  the  vacuum  state  is 


U[6]  =  exp[0(|^')(^;|  -  I'PJX^I)], 
generated  by  the  transformed  alphabet  A', 


*i>  = 


|¥l)-C0|¥0) 

vUT2 


(25) 


(26) 


obtained  from  Gram-Schmidt  orthogalization  of  A.  The  ro¬ 
tation  angle  6  e  R  is  a  parameter  that  must  be  optimized  in 
order  to  achieve  the  Helstrom  bound. 

Application  of  U\  6]  on  the  incoming  signal  states  (which 
belong  to  the  original  alphabet,  A)  leads  to  the  transformed 
states 


-  v  1  Cn  sin  6\.  v  sin  0  . 

^[0]|%>=  (  cos  |¥o>-l— jl*!) 


Vi  -  < 


Vi-< 


(27) 


and 


~  x  sin  6  ,  v  cos  0^1  -  Cn  -  Cn  sin  0. 
£/[0|¥i>  =  i=^|%)  + -  - l*i>- 


VT^1 


V 


1  -  Cn 


(28) 
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Since  I'T'q)  is  the  vacuum  state,  hypothesis  testing  can  still 
be  performed  by  simple  photon  counting.  However,  unlike 
the  Kennedy  receiver,  it  is  possible  to  misdiagnose  p0  since 
U[0]\Vo)  contains  a  nonzero  contribution  from  |'Tr1).  The 
probability  for  a  false-positive  detection  by  a  photon  counter 
with  efficiency,  77,  is  given  by 

00  00 

P0v(hi\po)  =  ^  ^p(n,k)\{n\U\_0\\^Q)\2  (29) 

n=  1  k=  1 


o2V- 


1 


co-l 


sin2  0 , 


(30) 


which  is  evaluated  by  recognizing  that 

c0  sin  0 


(n\U[0]\Vo)  = 


cos  0  + 


Vi  -  < 


4i,0“ 


c0an  sin  0 

'Jn  !  (1  -Cq) 


(31) 


where  a  is  the  (complex)  amplitude  of  l^i).  The  probability 
for  correct  detection  can  be  similarly  obtained  to  give 


p0v(hi\pi)  =  ^  2p(«.*)l<«|i/[^]|1iri)  (32) 

n=l  k=  1 


$(r)  =  |(/<r)  +  M(r)|2.  (38) 

Here,  i/s(t)  =  0  when  the  channel  is  in  the  state  p0,  and  0(f) 
=  when  the  channel  is  in  px  [refer  to  Eqs.  (1)  and  (2)]. 

The  receiver  decides  between  hypotheses  H0  and  Hx  by 
selecting  the  one  that  is  more  consistent  with  the  record  of 
photon  arrival  times  observed  by  the  detector  given  the 
choice  of  u(t).  Hx  is  selected  when  the  ratio  of  conditional 
arrival  time  probabilities, 

A  pjpiki,  •••  ,?„,«(?)]  _ 

P^Poki.  ■■■,tn,u(i)X 

is  greater  than  1;  otherwise  it  is  assumed  that  p0  was  trans¬ 
mitted.  The  conditional  probabilities  pv[pt  \  tx , . . . ,  tn ,  u(t )]  re¬ 
flect  the  likelihood  that  n  photon  arrivals  occur  precisely  at 
the  times  {tx , . . . ,  tn },  given  that  the  channel  is  in  the  state  pb 
the  feedback  amplitude  is  u(t ),  and  the  detector  quantum 
efficiency  is  rj. 

We  see  that  this  decision  criterion  based  on  A  is  immedi¬ 
ately  related  to  the  error  probabilities 

A  _  P„[Hi|pi,m(?)]  _  1  ^ 

pJ.HiPo,u(t)]  pJ.Hi\poMt)] 

when  A>  1  (i.e.,  the  receiver  definitely  selects  Hi),  and 


_  j  _ 

=  -% - [c0  sin  P- Vl  -  Cq  cos  p]2  (33) 

Co-  1 


A  PrlHolpiMt)] 

PrlHo\PoMt )]  1  ~P^l|Po.“(0] 


(41) 


by  employing  the  relationship 


(n\U[0]\Vi) 


Cq  cos  0  - 


c^an  sin  6 

'Jn  !  (1  -Cq)_ 


+ 


sin  6 

Jl-cl 


^n,  0* 


(34) 


The  total  Sasaki-Hirota  receiver  error  is  given  by  the 
weighted  sum 

P sh(p> e)  =  | |po)  +  &[1  -p^(//i|pi)]  (35) 

and  can  be  minimized  over  6  e  R  to  give 

/Vl-4^1c2-l+2^1c2 

P  =  -  tan  \  1  = - (36) 

V  Vl-4^lC2+l-2 

For  perfect  detection  efficiency,  77= 1,  Eq.  (35)  is  equivalent 
to  the  Helstrom  bound;  however,  for  77 <  1,  it  is  larger. 


D.  The  Dolinar  receiver 

The  Dolinar  receiver  takes  a  different  approach  to  achiev¬ 
ing  the  Helstrom  bound  with  a  photon  counting  detector;  it 
utilizes  an  adaptive  strategy  to  implement  a  feedback  ap¬ 
proximation  to  the  Helstrom  POVM  [13,23].  Dolinar ’s  re¬ 
ceiver  operates  by  combining  the  incoming  signal  W(t)  with 
a  separate  local  signal, 

U(t)  =  u(t)ex p[-  i(oot+  (/))]  +  c.c.,  (37) 

such  that  the  detector  counts  photons  with  total  instantaneous 
mean  rate 


when  A<1  (i.e.,  the  receiver  definitely  selects  H0).  Simi¬ 
larly,  the  likelihood  ratio  A  can  be  reexpressed  in  terms  of 
the  photon  counting  distributions  frequently  encountered  in 
quantum  optics  by  employing  Bayes’  rule, 


A  pjju  •••  ,*«|pi,m(*)]po(pi) 
pj>l,  •••  ,t„\poMt)]po(Po) 

thpjj  1,  •••  ,tn\pi,u(t)] 

•••  ’6i|PO’^(0] 


(42) 


(43) 


where  the  pv\ti, ...  ,r„|p(-, w(f)]  are  the  exclusive  counting 
densities 


n+ 1 

pv[tu  ...  ,tn\pi,u(t)]  =  n  wv[tk\pi,u(t)].  (44) 

k=  1 


Here,  f0=0,  tn+\  =  T,  and  wv[tk\pi,u(t)]  is  the  exponential 
waiting  time  distribution 


w[tk\phu(t)]  =  rf&(tk)exp 


(45) 


for  optical  coherent  states,  or  the  probability  that  a  photon 
will  arrive  at  time  tk  and  that  it  will  be  the  only  click  during 
the  half-closed  interval  ( tk_x,tk ]  [22]. 


1.  Optimal  control  problem 
The  Dolinar  receiver  error  probability 
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PD\u{t)'\  =  £;0p  V[H  i\pQ,u(t)]  +  ^pv[H0\phu(t)]  (46) 

depends  upon  the  amplitude  of  the  locally  applied  feedback 
field,  so  the  objective  is  to  minimize  PD  over  u(t).  This  op¬ 
timization  can  be  accomplished  [23]  via  the  technique  of 
dynamic  programming  [24],  where  we  adopt  an  effective 
state- space  picture  given  by  the  conditional  error  probabili¬ 
ties 


Pt  \PrlHo\PlMt)](t)  ) 

and  define  the  control  cost  as 

J[u(t)]  =  PD[u(t)]  =  £Tp. 


(47) 


(48) 


The  optimal  control  policy  u{t)  is  identified  by  solving 
the  Hamilton- Jacobi-Bellman  equation 


mm 

u{t) 


d  d 

— +  v  pJ[u(t)]T— p(f) 

at  at 


=  0, 


(49) 


which  is  a  partial  differential  equation  for  J  based  on  the 
requirement  that  p(^)  and  u(t)  are  smooth  (continuous  and 
differentiable)  throughout  the  entire  receiver  operation. 
However,  like  all  quantum  point  processes,  our  conditional 
knowledge  of  the  system  state  evolves  smoothly  only  be¬ 
tween  photon  arrivals. 

When  a  click  is  recorded  by  the  detector,  the  system  prob¬ 
abilities  p  can  jump  in  a  nonsmooth  manner.  Therefore,  the 
photon  arrival  times  divide  the  measurement  interval  0^t 
^  T  into  segments  that  are  only  piecewise  continuous  and 
differentiable.  Fortunately,  the  dynamic  programming  opti¬ 
mality  principle  [24]  allows  us  to  optimize  u{t)  in  a  piece- 
wise  manner  that  begins  by  minimizing  J[u(t)]  on  the  final 
segment  [tn,T].  Of  course,  the  system  state  at  the  beginning 
of  this  segment  p (tn)  depends  upon  the  detection  history  at 
earlier  times  and  therefore  the  choice  of  u(t)  in  earlier  inter¬ 
vals.  As  such,  the  Hamilton- Jacobi-Bellman  optimization  for 
the  final  segment  must  hold  for  all  possible  starting  states 
p(0  e  R^0  jj.  Once  this  is  accomplished,  u(t)  can  be  opti¬ 
mized  on  the  preceding  segment  \tn_x,tn )  with  the  assurance 
that  any  final  state  for  that  segment  will  be  optimally  con¬ 
trolled  on  the  next  interval  [tn,T].  This  procedure  is  iterated 
in  reverse  order  for  all  of  the  measurement  segments  until  the 
first  interval  t=[ 0,^),  where  the  initial  value  p(0)  =  (l  0)T 
can  be  unambiguously  specified. 

Solving  the  Hamilton- Jacobi-Bellman  equation  in  each 
smooth  segment  between  photon  arrivals  requires  the  time 
derivatives  p(^)  which  assume  a  different  form  when  A  >  1 
versus  when  A<1.  Using  Eqs.  (40)  and  (41),  the  coherent 
state  waiting  time  distribution,  and 


<F(0  = 


(%(t) 

Witt 


/  u(t)  \ 
\u(t)  +  ) 


(50) 


we  see  that  the  smooth  evolution  of  p(f)  between  photon 
arrivals  is  given  by 


Pi(t)  =  vpM 
when  A  >  1  and 

PoW  =  VPo(t) 

Pi(t)  =  VPi(t) 


-  -In  ^(r) 
at 


“  r  ln 
at 


-In 

at 


(51) 


(52) 


when  A  <  1 . 

Performing  the  piecewise  minimization  in  Eq.  (49)  over 
each  measurement  segment  with  initial  states  provided  by 
the  iterative  point-process  probabilities  in  Eq.  (44)  and  com¬ 
bining  the  intervals  (this  is  straightforward  but  eraser  de¬ 
manding)  leads  to  the  control  policy 


i  + 


3[u\(t)'\ 


(53) 


for  A  >1,  where  pv[H0\pi,u[(t)]  =  0  and 

M*m = w*iiA),«i(o] = la  -  Vi-4^ie-^«). 

(54) 

Here,  n(t)  represents  the  average  number  of  photons  ex¬ 
pected  to  arrive  at  the  detector  by  time  t  when  the  channel  is 
in  the  state  p1? 


n(t)  =  f  \ifjx{t')\2dt' . 

Jo 


(55) 


Conversely,  the  optimal  control  takes  the  form 

,,J  Jt*»]  \ 

for  A  <  1,  where  p^Hx  \p0,u*0(t)]=0  and 

J[u0(t)]  =  £iP ^_H0\pi,ul(t)]  =  1(1  -  Vl  -A^xe~vn(t)). 


(57) 


2.  Dolinar  hypothesis  testing  procedure 

The  Hamilton- Jacobi-Bellman  solution  leads  to  a  concep¬ 
tually  simple  procedure  for  estimating  the  state  of  the  chan¬ 
nel.  The  receiver  begins  at  t=0  by  favoring  the  hypothesis 
that  is  more  likely  based  on  the  prior  probabilities  p0(0) 
=  4  and  p1(0)  =  4.1  Assuming  that  4  S*  4  (for  4  >4,  the 
opposite  reasoning  applies),  the  Dolinar  receiver  always  se¬ 
lects  Hx  during  the  initial  measurement  segment.  The  prob¬ 
ability  of  deciding  on  #0  is  exactly  zero  prior  to  the  first 
photon  arrival  such  that  an  error  only  occurs  when  the  chan¬ 
nel  is  actually  in  p0. 


Po(t)  =  rjp0(t) 


d_ 

dt 


in  <&„(/)  -  d>0(r) 


'if  4=4  then  neither  hypothesis  is  a  priori  favored  and  the 
Dolinar  receiver  is  singular  with  PD=  1/2. 


062303-5 


JM  GEREMIA 


PHYSICAL  REVIEW  A  70,  062303  (2004) 


To  see  what  happens  when  a  photon  does  arrive  at  the 
detector,  it  is  necessary  to  investigate  the  behavior  of  A(r)  at 
the  boundary  between  two  measurement  segments.  Substitut¬ 
ing  the  optimal  control  policy  u*{t ),  which  alternates  be¬ 
tween  ux{t)  and  u0(t),  into  the  photon  counting  distribution 
leads  to 


n+ 1 


p(h ,  = vn  ^,[^2(0.^]] 

k= 0 

Xexpf-  yj  $,[m  1  (t'k_i ,  tjj] dt' 

- - ijj'  $>i[un\2{CT']W^j- 


(58) 


This  expression  can  be  used  to  show  that  the  limit  of  A(r) 
approaching  a  photon  arrival  time  tk  from  the  left  is  the  re¬ 
ciprocal  of  the  limit  approaching  from  the  right, 


lim  A(/l)  = 


limA(r) 


(59) 


That  is,  if  A  >  1  such  that  Hx  is  favored  during  the  measure¬ 
ment  interval  ending  at  th  the  receiver  immediately  swaps  its 
decision  to  favor  H0  when  the  photon  arrives.  Evidently,  the 
optimal  control  policy  u{t)  engineers  the  feedback  such  that 
the  photon  counter  is  least  likely  to  observe  additional  clicks 
if  it  is  correct  based  on  its  best  knowledge  of  the  channel 
state  at  that  time.  Each  photon  arrival  invalidates  the  current 
hypothesis  and  the  receiver  completely  reverses  its  decision 
on  every  click.  This  result  implies  that  H x  is  selected  when 
the  number  of  photons  n  is  even  (or  zero)  and  H0  when  the 
number  of  photons  is  odd. 

Despite  the  discontinuities  in  the  conditional  probabilities 
pv[Hx  \po,u*(t)\  and  py[H0\pi,u*(t)]  at  the  measurement 
segment  boundaries,  the  total  Dolinar  receiver  error  prob¬ 
ability 


PD{v,t)  =  1(1  -  Vl  -  (60) 

evolves  smoothly  since 

lim27[w*(r)]  =  \imj[u*(t)\  (61) 

at  the  boundaries.  Recognizing  that  n(t)=N  leads  to  the  final 
Dolinar  receiver  error 

PD(V)  =  \  (l-Vl-4&fi<#9,  (62) 

which  is  equal  to  the  Helstrom  bound  for  all  values  of  the 
detector  efficiency  0  <  77^  1 . 
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FIG.  1.  Monte  Carlo  simulation  of  the  Kennedy,  Dolinar,  and 
Sasaki-Hirota  receivers  as  a  function  of  the  signal  amplitude  for 
perfect  photon  counting  with  fo-  ft  =  1/2.  As  expected,  the  Dolinar 
and  Sasaki-Hirota  protocols  both  achieve  the  Helstrom  bound  while 
the  Kennedy  receiver  is  approximately  a  factor  of  2  worse. 

additional  detector  imperfections.  Figure  1  shows  benchmark 
simulation  results  for  perfect  photodetection.  The  three  re¬ 
ceivers  perform  as  expected  in  the  small-amplitude  regime; 
both  the  Sasaki-Hirota  and  Dolinar  protocols  achieve  the 
Helstrom  bound  while  the  Kennedy  receiver  is  approxi¬ 
mately  a  factor  of  2  worse,  at  the  shot-noise  limit.2  Statistics 
were  accumulated  for  10  000  Monte  Carlo  samples  in  which 
p0  and  pi  were  randomly  selected  with  ^=^  =  1/2. 

Detector  imperfections,  however,  will  degrade  the  perfor¬ 
mance  of  each  of  the  three  receivers,  and  here  we  investigate 
the  relative  degree  of  that  degradation  for  conditions  to  be 
expected  in  practice.  The  analysis  is  based  on  the  observation 
that  single  photon  counting  in  optical  communications  is  of¬ 
ten  implemented  with  an  avalanche  photodiode  (APD),  as 
APDs  generally  provide  the  highest  detection  efficiencies.  In 
the  near  infrared,  for  example,  high-gain  silicon  diodes  pro¬ 
vide  a  quantum  efficiency  of  77—  50%.  Additional  APD  non¬ 
idealities  include  a  dead  time  following  each  detected  photon 
during  which  the  receiver  is  unresponsive,  dark  counts  in  the 
absence  of  incoming  photons  due  to  spontaneous  breakdown 
events  in  the  detector  medium,  a  maximum  count  rate  above 
which  the  detector  saturates  (and  can  be  damaged),  and  oc¬ 
casional  ghost  clicks  following  a  real  photon  arrival — a  pro¬ 
cess  referred  to  as  “after  pulsing.”  For  the  Dolinar  receiver, 
which  requires  high-speed  signal  processing  and  actuation  in 
order  to  modulate  the  adaptive  feedback  field,  delays  must 
also  be  considered.  That  is,  the  optical  modulators  used  to 
adjust  the  phase  and  amplitude  of  the  feedback  signal  as  well 
as  the  digital  signal  processing  technology  necessary  [25]  for 
implementing  all  real-time  computations  display  finite  band- 
widths. 


III.  SIMULATIONS 

Monte  Carlo  simulations  of  the  Kennedy,  Sasaki-Hirota, 
and  Dolinar  receivers  were  performed  to  verify  the  above 
quantum  efficiency  analysis  and  to  analyze  the  effects  of 


m  some  contexts,  Eq.  (24)  is  referred  to  as  the  standard  quantum 
limit  despite  the  fact  that  there  is  no  measurement  back  action  as 
a\a)  =  a\a).  We  prefer  the  term  shot-noise  limit  in  order  to  avoid 
such  confusion. 
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FIG.  2.  Performance  of  the  Kennedy,  Sasaki-Hirota,  and  Dolinar 
receivers  as  a  function  of  the  detector  quantum  efficiency  y.  The 
simulations  were  performed  for  fo-fi  =  1/2  and  data  points  reflect 
the  result  of  Monte  Carlo  simulations  of  the  three  receivers.  Dotted 
lines  correspond  to  the  analytic  results  derived  in  Sec.  II  and  illus¬ 
trate  that  the  Dolinar  receiver  achieves  the  Helstrom  bound  even  for 
subunity  quantum  efficiency. 

Figure  2  compares  the  error  probabilities  of  the  three  re¬ 
ceivers  for  subunity  quantum  efficiency  but  otherwise  ideal 
detection.  The  mean  photon  number  of  the  signal,  'VF1(£),  in 

this  simulation  is  N=  1  with  'VP0(7)  =  0  and  §)=^1  =  l/2.  Data 
points  in  the  figure  were  generated  by  accumulating  statistics 
for  10  000  Monte  Carlo  simulations  of  the  three  receivers, 
and  the  dotted  lines  correspond  to  the  error  probabilities  de¬ 
rived  in  Sec.  II.  The  simulations  agree  well  with  the  analytic 
expressions  and  it  is  evident  that  the  Dolinar  receiver  is  ca¬ 
pable  of  achieving  the  Helstrom  bound  for  y  <  1  while  the 
Sasaki-Hirota  receiver  performance  lies  between  that  of  the 
Kennedy  and  Dolinar  receivers. 

Figure  3  compares  the  error  probabilities  for  the  three 
receivers  with  the  additional  detector  and  feedback  nonide¬ 
alities  taken  into  account.  Based  on  the  performance  data  of 
the  Perkin-Elmer  SPCM-AQR-13  Si  APD  single  photon 
counting  module,  we  assumed  a  maximum  count  rate  of 
107  photons/ s,  a  detector  dead  time  of  50  ns,  a  dark  count 
rate  of  250  clicks/ s,  and  an  after-pulsing  probability  of  1%. 
For  the  Dolinar  receiver,  it  was  assumed  that  there  was  a 
100  ns  feedback  delay  resulting  from  a  combination  of  digi¬ 
tal  processing  time  and  amplitude/phase  modulator  band¬ 
width.  The  data  points  in  Fig.  3  correspond  to  the  error  prob¬ 
abilities  generated  from  10  000  Monte  Carlo  simulations 
with  ^=^  =  1/2.  The  lower  dotted  line  indicates  the  appro¬ 
priate  Helstrom  bound  as  a  function  of  the  mean  photon 

number  N  for  a  detector  with  quantum  efficiency  77=0.5,  and 
the  upper  curve  indicates  the  analogous  Kennedy  receiver 
error.  Evidentally,  technical  imperfections  can  have  a  large 
negative  effect  on  the  performance  of  passive  detection  pro¬ 
tocols  like  the  Kennedy  and  Sasaki-Hirota  receivers  while 
the  Dolinar  receiver  is  more  robust. 

Unlike  open-loop  procedures,  however,  the  feedback  na¬ 
ture  of  the  Dolinar  receiver  additionally  requires  precise 
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Average  Photon  Number,  N 

FIG.  3.  Performance  of  the  Kennedy,  Sasaki-Hirota,  and  Dolinar 
receivers  as  a  function  of  the  mean  number  of  photons  in  the  signal 
under  realistic  experimental  conditions,  including  a  quantum  effi¬ 
ciency  of  50%,  a  dark-count  rate  of  250  clicks/ s,  an  after-pulsing 
probability  of  1%,  and  feedback  delays  of  100  ns. 

knowledge  of  the  incoming  signal  phase  cp,  so  that  U*(t)  can 
be  properly  applied.  Fluctuations  in  the  index  of  refraction  of 
the  communication  medium  generally  lead  to  some  degree  of 
phase  noise  in  the  incoming  signal  t ).  Adequately  setting 
the  phase  of  U(t)  necessarily  requires  that  some  light  from 
the  channel  be  used  for  phase-locking  the  local  oscillator — a 
task  that  reduces  the  data  transmission  bandwidth.  Therefore, 
operating  a  communication  system  based  on  the  Dolinar  re¬ 
ceiver  at  the  highest  feasible  rate  requires  that  the  number  of 
photons  diverted  from  the  data  stream  to  track  phase  varia¬ 
tions  in  the  channel  be  minimized.  This  optimization  in  turn 
requires  knowledge  of  how  signal  phase  noise  propagates 
into  the  receiver  error  probability. 

Figure  4  shows  the  error  probability  PD(Scp)  as  a  function 
of  the  phase  difference  Sep  between  the  incoming  signal  and 
the  local  oscillator.  Data  points  correspond  to  results  from 
10  000  Monte  Carlo  simulations  per  photon  number  and 
phase  angle,  and  the  solid  curves  reflect  numerical  fits  to  the 
Monte  Carlo  points.  An  exact  comparison  between  the  open- 
loop  and  Dolinar  receivers  requires  information  regarding 
the  specific  phase-error  density  function  for  the  actual  com¬ 
munication  channel  being  utilized.  However,  we  do  note  that 
at  N=  1  photon,  the  phase  of  the  local  oscillator  could  be  as 
large  as  Sep  —  25°  before  its  error  probability  increased  to  that 
of  the  Kennedy  receiver.  Additionally,  it  appears  that  the 
slope  of  PD(Sep)  is  zero  at  Sep= 0  which  implies  that  the 
Dolinar  receiver  conveniently  displays  minimal  sensitivity  to 
small  phase  fluctuations  in  the  channel. 

IV.  DISCUSSION  AND  CONCLUSIONS 

The  Dolinar  receiver  was  found  to  be  robust  to  the  types 
of  detector  imperfections  likely  to  exist  in  any  real  imple¬ 
mentation  of  a  binary  communication  scheme  based  on  op¬ 
tical  coherent  state  signaling  and  photon  counting.  This  ro- 
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FIG.  4.  Error  probabilities  for  the  Dolinar  receiver  as  a  function 
of  the  phase  error  in  the  signal  state  corresponding  to  logical  1  for 
different  mean  photon  numbers.  Monte  Carlo  data  points  were  gen¬ 
erated  by  accumulating  statistics  for  10  000  random  bits.  Solid  lines 
are  a  fit  to  the  data. 

bustness  seemingly  results  from  the  fact  that  the  Dolinar 
receiver  can  correct  itself  after  events  that  cause  an  open- 
loop  receiver  to  irreversibly  misdiagnose  the  transmitted 
state.  For  example,  imperfect  detection  efficiency  introduces 
a  failure  mode  where  the  probability  p(H0\pi)  is  increased 
above  the  value  set  by  quantum  mechanical  vacuum  fluctua¬ 
tions.  However,  the  optimal  structure  of  the  Dolinar  receiver 
feedback  insures  that  it  still  achieves  the  quantum  mechani¬ 
cal  minimum  because  it  has  control  over  the  counting  rate. 
That  is,  if  the  Dolinar  receiver  selects  the  wrong  hypothesis 
at  some  intermediate  time  4<  T,  the  structure  of  the  feed¬ 
back  ensures  that  the  receiver  achieves  the  highest  allowable 
probability  for  invalidating  that  incorrect  decision  during  the 
remainder  of  the  measurement 

In  the  opposite  situation,  where  dark  counts  or  back¬ 
ground  light  produce  detector  clicks  when  there  is  no  signal 
light  in  the  channel,  open-loop  receivers  will  decide  in  favor 
of  pi  without  any  possibility  for  self-correction.  This  type  of 


error  leads  to  an  irreparable  open-loop  increase  in  p(Hi\p0). 
But  the  Dolinar  receiver  has  the  potential  to  identify  and  fix 
such  a  mistake  since  selecting  the  wrong  hypothesis  at  inter¬ 
mediate  times  increases  the  probability  that  a  future  click 
will  invalidate  the  incorrect  decision.  When  background  light 
is  present,  poor  phase  coherence  between  stray  optical  fields 
and  the  signal  provides  no  enhanced  open-loop  discrimina¬ 
tion  as  there  is  no  local  oscillator  to  establish  a  phase  refer¬ 
ence;  a  received  photon  is  a  received  photon  (assuming  that 
any  spectral  filtering  failed  to  prevent  the  light  from  hitting 
the  detector).  The  Dolinar  receiver  is  better  immune  to  such 
an  error  since  incoherent  addition  of  the  stray  field  to  the 
local  oscillator  will  generally  reduce  the  likelihood  of  a  de¬ 
tector  click,  and  even  if  so,  that  click  will  be  inconsistent 
with  the  anticipated  counting  statistics. 

Despite  the  previous  belief  that  the  Dolinar  receiver  is 
experimentally  impractical  due  to  its  need  for  real-time  feed¬ 
back,  we  have  shown  that  it  is  rather  attractive  for  experi¬ 
mental  implementation.  Particularly,  quantum  efficiency 
scales  out  of  a  comparison  between  the  Dolinar  receiver  er¬ 
ror  and  the  Helstrom  bound,  while  this  is  not  the  case  for 
known  unitary  rotation  protocols.  These  results  strongly  sug¬ 
gest  that  real-time  feedback,  previously  cited  as  the  Dolinar 
receiver’s  primary  drawback,  in  fact  offers  substantial  ro¬ 
bustness  to  many  common  imperfections  that  would  be 
present  in  a  realistic  experimental  implementation.  Most  im¬ 
portantly,  simulations  under  these  realistic  conditions  suggest 
that  the  Dolinar  receiver  can  outperform  the  Kennedy  re¬ 
ceiver  with  currently  available  experimental  technology, 
making  it  a  viable  option  for  small-amplitude,  minimum- 
error  optical  communication. 
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The  number  of  atoms  trapped  within  the  mode  of  an  optical  cavity  is  determined  in  real  time  by 
monitoring  the  transmission  of  a  weak  probe  beam.  Continuous  observation  of  atom  number  is 
accomplished  in  the  strong  coupling  regime  of  cavity  quantum  electrodynamics  and  functions  in 
concert  with  a  cooling  scheme  for  radial  atomic  motion.  The  probe  transmission  exhibits  sudden  steps 
from  one  plateau  to  the  next  in  response  to  the  time  evolution  of  the  intracavity  atom  number,  from 
N>3toN  =  2  — »  1  — ►  0  atoms,  with  some  trapping  events  lasting  over  1  s. 

DOI:  10. 1103/PhysRevLett.93. 143601  PACS  numbers:  42.50.Pq,  03.67.-a,  32.80.Pj 


Cavity  quantum  electrodynamics  (QED)  provides  a 
setting  in  which  atoms  interact  predominantly  with  light 
in  a  single  mode  of  an  electromagnetic  resonator  [1,2]. 
Not  only  can  the  light  from  this  mode  be  collected  with 
high  efficiency  [3],  but  the  associated  rate  of  optical 
information  for  determining  atomic  position  can  greatly 
exceed  the  rate  of  free-space  fluorescent  decay  employed 
for  conventional  imaging  [4].  Moreover,  the  regime  of 
strong  coupling,  in  which  coherent  atom-cavity  interac¬ 
tions  dominate  dissipation,  offers  a  unique  setting  for  the 
study  of  open  quantum  systems  [5].  Dynamical  processes 
enabled  by  strong  coupling  in  cavity  QED  provide  power¬ 
ful  tools  in  the  emerging  field  of  quantum  information 
science  (QIS),  including  for  the  realization  of  quantum 
computation  [6]  and  distributed  quantum  networks  [7]. 

With  these  prospects  in  mind,  experiments  in  cavity 
QED  have  made  great  strides  in  trapping  single  atoms  in 
the  regime  of  strong  coupling  [4,8-10].  However,  many 
protocols  in  QIS  require  multiple  atoms  to  be  trapped 
within  the  same  cavity,  with  “quantum  wiring”  between 
internal  states  of  the  various  atoms  accomplished  by  way 
of  strong  coupling  to  the  cavity  field  [6,11-13].  Clearly, 
the  experimental  ability  to  determine  the  number  of 
trapped  atoms  coupled  to  a  cavity  is  a  critical  first  step 
toward  the  realization  of  diverse  goals  in  QIS. 
Experimental  efforts  to  combine  ion  trap  technology 
with  cavity  QED  are  promising  [14],  but  have  not  yet 
reached  the  regime  of  strong  coupling. 

In  this  Letter,  we  report  measurements  in  which  the 
number  of  atoms  trapped  inside  an  optical  cavity  is 
observed  in  real  time.  After  initial  loading  of  the  intra¬ 
cavity  dipole  trap  with  N  ~  5  atoms,  the  decay  of  atom 
number  A>3— >2— >1— >0is  monitored  via  the  trans¬ 
mission  of  a  near-resonant  probe  beam,  with  the  trans¬ 
mitted  light  exhibiting  a  cascade  of  “stair  steps”  as 
successive  atoms  leave  the  trap.  After  the  probabilistic 
loading  stage,  the  time  required  for  the  determination  of  a 
particular  atom  number  N  =  1,  2,  3  is  much  shorter  than 
the  mean  interval  over  which  the  N  atoms  are  trapped. 
Hence,  this  scheme  can  be  used  to  prepare  a  precise 
number  of  trapped  intracavity  atoms  for  subsequent  ex¬ 


periments  in  QIS,  for  which  the  time  scales  (g_1  ~ 
10-8  s)  «  (t  ~  3  s)),  where  r  is  the  atomic  trapping 
time  [9]  and  Hg  is  the  atom-field  interaction  energy.  In 
addition,  it  requires  none  of  the  imaging  optics  or  shield¬ 
ing  needed  for  traditional  fluorescence  detection  from 
single  ions  and  atoms  [15,16],  which  would  be  compli¬ 
cated  by  the  presence  of  our  short  cavity. 

As  illustrated  in  Fig.  1,  our  experiment  combines  laser 
cooling,  state-insensitive  trapping,  and  strong  coupling  in 
cavity  QED  [9].  Cs  atoms  are  released  from  a  magneto¬ 
optical  trap  (MOT)  several  mm  above  the  cavity,  which  is 
formed  by  the  reflective  surfaces  of  mirrors  (M{,  M2). 
Several  atoms  are  cooled  and  loaded  into  an  intracavity 
far-off-resonance  trap  (FORT)  and  are  thereby  strongly 
coupled  to  a  single  mode  of  the  cavity.  The  single-photon 
Rabi  frequency  2g0  for  one  atom  at  the  peak  of  the 
Gaussian  standing  wave  is  given  by  go/2ir  =  24  MHz, 
and  is  based  on  the  reduced  dipole  moment  for  the 
6Si/2,  F  =  4  —>  6P3/2,  F'  =  A'  transition  of  the  D2  line 
in  Cs  at  A0  =  852.4  nm.  Decay  rates  for  the  6 P3/2  excited 
states  and  the  cavity  mode  at  co0  =  2ttc/Aq  are  y  I2tt  = 
2.6  MHz  and  k!2tt  =  4.2  MHz,  respectively  [17].  The 
fact  that  g0  ^  (k,  y)  places  our  system  in  the  strong 


D 


FIG.  1  (color  online).  Schematic  of  our  experiment.  Cs  atoms 
are  loaded  into  an  intracavity  FORT  (£F)  by  way  of  the 
transverse  cooling  field  H3  and  the  cavity  probe  field  S4.  The 
transmitted  S4  field  is  directed  to  a  heterodyne  detector  (HD), 
allowing  real-time  determination  of  intracavity  atom  number. 


143601-1 


0031-9007 /  04/ 93(14)/143601(4)$22.50 


©  2004  The  American  Physical  Society 


143601-1 


Volume  93,  Number  14 


PHYSICAL  REVIEW  LETTERS 


week  ending 
1  OCTOBER  2004 


coupling  regime  of  cavity  QED,  giving  critical  atom  and 
photon  numbers  n0  =  y2/(2gl)  ~  0.0057,  V0  = 
2 xy/g2  «  0.037  [1]. 

The  cavity  length  /0  =  42.2  /jl m  is  tuned  such  that  it 
supports  TEMqo  modes  simultaneously  resonant  with 
both  the  F  =  4  —>  F'  =  4'  atomic  transition  at  A0  and 
our  FORT  laser  at  XF  =  935.6  nm.  A  weak  probe  laser 
£4  excites  the  cavity  mode  at  A0  with  the  cavity  output 
directed  to  detector  HD,  while  a  much  stronger  trapping 
laser  £F  drives  the  mode  at  \F.  In  addition,  the  region 
between  the  cavity  mirrors  is  illuminated  by  two  or¬ 
thogonal  pairs  of  counter-propagating  cooling  beams  in 
the  transverse  plane  (denoted  I23),  and  for  fl3  =  0,  no 
atoms  are  detected.  Atoms  arriving  in  the  region  of  the 
cavity  mode  are  exposed  to  the  (£4,  £F,  fl3)  fields  con¬ 
tinuously,  with  a  fraction  of  the  atoms  cooled  and  loaded 
into  the  FORT  by  the  combined  actions  of  the  £4  and  fl3 
fields  [9].  For  all  measurements,  the  cavity  detuning  from 
the  4  — ►  4'  atomic  resonance  is  Ac  =  0.  The  detuning  of 
the  £4  probe  from  the  atom-cavity  resonance  is  A4  = 
+4  MHz,  and  its  intensity  is  set  such  that  the  mean 
intracavity  photon  number  n  =  0.02  with  no  atoms  in 
the  cavity.  The  detuning  of  the  fl3  transverse  cooling 
field  is  A3  =  +25  MHz  from  the  F  =  3  — ►  F1  =  3'  reso¬ 
nance,  with  intensity  h  ~  4  X  101  mW/cm2. 

The  field  £F  that  drives  the  standing-wave,  intracavity 
FORT  is  linearly  polarized,  so  that  all  sublevels  of  the 
6S1/2  ground  manifold  experience  nearly  equal  ac-Stark 
shifts  [18].  The  peak  value  of  the  trapping  potential  is 
~U0/h  =  -47  MHz,  giving  a  trap  depth  U0/kB  = 
2.2  mK  [19].  Importantly,  all  the  6 P3/2  excited  states 
likewise  experience  a  trapping  shift  of  about  —  Uq  (to 
within  ~  ±15%)  [9,20,21],  enabling  continuous  monitor¬ 
ing  of  trapped  atoms  in  our  cavity. 

Figure  2(a)  displays  a  typical  record  of  the  heterodyne 
current  i(t)  resulting  from  one  instance  of  FORT  loading. 
Here,  the  current  i(t)  is  referenced  to  the  amplitude  of  the 
intracavity  field  \(a)\  by  way  of  the  known  propagation 
and  detection  efficiencies.  The  initial  sharp  drop  in 
\(a(t))\  shortly  after  t  =  0  results  from  atoms  that  are 
cooled  and  loaded  into  the  FORT  by  the  combined  action 
of  the  (£4,  fl3)  fields  [9].  Falling  atoms  are  not  exposed  to 
£4  until  they  reach  the  cavity  mode,  presumably  leading 
to  efficient  trap  loading  for  atoms  that  arrive  at  a  region  of 
overlap  between  the  standing  waves  at  (A0,  AF)  for  the 
(£4,  £f)  fields.  The  mean  atomic  free-fall  time  from  our 
MOT  is  about  rF  =  0.03  s,  and  trap  loading  occurs  within 
a  ±10  ms  window  around  t  =  rF  [relative  to  t  =  0  in 
Fig.  2(a)],  giving  a  maximum  total  loading  time  of  about 
40  ms  [8,9]. 

Subsequent  to  this  loading  phase,  a  number  of  features 
are  apparent  in  the  trace  of  Fig.  2(a),  and  are  consistently 
present  in  all  the  data.  Most  notably,  the  transmission  vs 
time  consists  of  a  series  of  flat  “plateaus”  in  which  the 
field  amplitude  is  stable  on  long  time  scales  [15,16]. 
Additionally,  these  plateaus  reappear  at  nearly  the  same 


FIG.  2  (color  online),  (a)  A  typical  detection  record  in  which 
several  ( N  >  4)  atoms  are  loaded  into  the  trap.  Heterodyne 
detection  bandwidth  B  =  1  kHz.  (b)  Histogram  of  500  such 
traces,  binned  with  respect  to  the  heterodyne  signal  \(a)\.  A 
digital  low-pass  filter  of  bandwidth  100  Hz  is  applied  to  each 
trace  prior  to  the  computation  of  the  histogram,  (c)  Comparison 
of  the  model  prediction  for  p^\y  =  0.5)  (  +  )  with  the  mea¬ 
sured  positions  of  the  histogram  peaks  in  (b)  (  O  ).  Also  shown 
(  X  )  is  p^\y  =  0.1)  to  indicate  the  possibility  to  detect 
specific  atom  numbers  for  larger  N. 

heights  in  all  repeated  trials  of  the  experiment,  as  evi¬ 
denced  by  the  histogram  of  Fig.  2(b).  We  hypothesize  that 
each  of  these  plateaus  represents  a  different  number  N  of 
trapped  atoms  coupled  to  the  cavity  mode  (arrows  in 
Fig.  2).  For  detection  bandwidth  B ,  N  can  be  measured 
in  time  tm  ~  (2ttB)~1,  and  for  our  parameters,  the  uncer¬ 
tainty  pQrr  is  dominated  by  the  probability  that  an  atom 
leaves  during  the  interval  tm  (for  N  =  1,  pQn  ~  1%  for 
B  =  100  Hz). 

Consider  first  the  one-atom  case,  which  exhibits  rela¬ 
tively  large  transmission  and  small  variance  compared  to 
prior  work  [1,4,8,22,23].  For  fixed  drive  £4,  the  intra¬ 
cavity  field  is  a  function  of  the  coupling  parameter 
gM\r)  =  g0Gijsm(k0z)Qxp(-2p2/wl)  where  p  is  the 
transverse  distance  from  the  cavity  axis  (z),  k0  = 
2i t/ A0,  and  Gtj  relates  to  the  Clebsch-Gordan  coefficient 
for  particular  initial  and  final  states  (i,  f )  within  the  F  = 
4,  F'  =  4'  manifolds.  Variations  in  g  as  a  function  of  the 
atom’s  position  r  and  internal  state  might  be  expected  to 
lead  to  large  variations  of  the  intracavity  field,  both  as  a 
function  of  time  and  from  atom  to  atom. 

However,  one  atom  in  the  cavity  produces  a  reasonably 
well-defined  intracavity  intensity  I  \{a)\2  due  to  the 
interplay  of  two  effects.  The  first  is  that  for  small  probe 
detunings  A4,  the  intracavity  intensity  Ix  for  one  atom  is 
suppressed  by  a  factor  /  relative  to  the  empty-cavity 
intensity  70,  where  for  weak  excitation,  /  ~  4C2  »  1 
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with  Ci  =  g1  / 2 Ky.  A  persistent,  strongly  reduced  trans¬ 
mission  thereby  results,  since  the  condition  [C^(r)]2  » 
1  is  robust  to  large  fluctuations  in  atomic  position  r  and 
internal  state.  The  second  effect  is  that  the  F  =  4  <-►  F'  = 
4'  transition  cannot  be  approximated  by  a  closed,  two- 
level  system,  since  the  F'  =  4'  excited  states  decay  to 
both  F  =  3,  4  hyperfine  ground  levels  [Fig.  1(b)].  An  atom 
thus  spends  a  fraction  q  of  its  time  in  the  cavity  QED 
manifold  (4,  4'),  and  a  fraction  p  ~  1  —  q  in  the  (3,  3’) 
manifold.  In  this  latter  case,  the  effective  coupling  is 
negligible  (Ce/^  ~  4  X  10-4),  leading  to  an  intensity  I  ~ 
70.  Hence,  the  intracavity  intensity  7(f)  should  approxi¬ 
mate  a  random  telegraph  signal  [15]  switching  between 
levels  (70,  / 1),  with  dwell  times  determined  by  (£4,  ft3), 
which  in  turn  set  p  [24].  Since  (£4,  H3)  drive  their  re¬ 
spective  transitions  near  saturation,  the  time  scale  for 
optical  pumping  from  one  manifold  to  another  is  rP  ~ 
1  pi s.  This  time  scale  represents  a  fundamental  limit  to 
the  averaging  time  for  detection,  which  in  our  case  is 
(1/27 tB)  ~  160  pi s  »  rp.  The  fast  modulation  of  7(f)  due 
to  optical  pumping  processes  thereby  gives  rise  to  an 
average  detected  signal  corresponding  to  intensity 
h  “  P1 o  +  <7 A  “ 

This  explanation  for  the  N  =  1  case  can  be  extended  to 
N  intracavity  atoms  to  provide  a  simple  model  for  the 
stair  steps  of  Fig.  2(a).  For  N  atoms,  the  intracavity 
intensity  should  again  take  the  form  of  a  random  tele¬ 
graph  signal,  now  switching  between  the  levels  (70,  4), 
with  high  transmission  70  during  intervals  when  all  N 
atoms  happen  to  be  pumped  into  the  (3,  3')  manifold,  and 
with  low  transmission  Ik  <  4  anytime  that  1  <  k  <  N 
atoms  reside  in  the  (4,  4')  manifold,  where  Ik  ~  4  A2  f°r 
weak  excitation  with  Ac  =  A4  =  0.  The  intensities  {4} 
determine  the  transition  rates  {yk^k-\}  between  states 
with  k  and  k  —  1  atoms  in  the  (4,  4')  manifold,  while 
fl3  determines  {yk-i-^k)  f°r  k  —  l  ^  k  via  transitions 
from  the  (3,  3')  manifold.  For  the  hierarchy  of  states  k  = 
0,1,..., N  with  transition  rates  {yk-^k-i,yk-i-^kh  the 
steady-state  populations  can  be  obtained.  With  the 
physically  motivated  assignments  yk-i->k  =  7o^i  inde¬ 
pendent  of  k  and  y^k- 1  =  7 i^oA2  corresponding  to 
4  -  4/ k 2,  we  find  that  p^  =  1  /  where  y  = 

y0^i/7i^o*  Hence,  for  Ik  «  4,  the  prediction  for  the 
average  intensity  is  7#  ~  Pq^/q,  which  leads  to  a  se¬ 
quence  of  plateaus  of  increasing  heights  IN+i  — ►  7^  — ► 
IN_i  as  successive  atoms  leave  the  trap. 

Figure  2(c)  compares  the  prediction  of  this  simple 
model  with  the  measured  values  of  peak  positions  in 
Fig.  2(b).  The  only  adjustable  parameter  is  the  value  y  = 
0.5,  giving  reasonable  correspondence  between  the  model 
and  the  measurements.  Also  shown  are  values  p ^  for  y  = 
0. 1  to  indicate  that  it  might  be  possible  to  enhance  the 
resolution  for  a  particular  range  of  atom  number  by 
framing  a  given  few  values  Ni,Ni  ±  1  in  the  transition 
region  7?^  ~  0.5,  where  Ni  ~  6  in  Fig.  2(c).  The  pa- 
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rameter  y  would  be  adjusted  via  the  strengths  of  the 
(£4,  «3)  fields. 

Although  our  simple  model  accounts  for  the  qualitative 
features  in  Fig.  2,  more  detailed  correspondence  could  be 
obtained  using  the  full  master  equation  for  N  intracavity 
atoms,  including  all  Zeeman  states  and  atomic  motion 

[24] .  We  have  made  initial  progress  for  the  N  =  1  case 

[25] ,  and  are  working  to  extend  the  treatment  to  N  >  2. 

Figure  3  provides  additional  evidence  for  the  corre¬ 
spondence  between  the  plateaus  in  Fig.  2  and  atom  num¬ 
ber.  Here,  the  probe  transmission  data  have  been  binned 
not  only  with  respect  to  the  value  of  \(a)\  as  in  Fig.  2(b), 
but  also  as  a  function  of  time.  Definite  plateaus  for  \(a)\ 
are  again  apparent,  but  now  their  characteristic  time 
evolution  can  be  determined.  Clearly,  the  plateaus  lying 
at  higher  values  of  |(a)|  correspond  to  times  later  in  the 
trapping  interval,  in  agreement  with  the  expectation  that 
N  should  always  decrease  with  time  after  the  trap-loading 
window.  Moreover,  none  of  the  500  traces  in  the  data  set 
includes  a  downward  step  in  transmission  after  the  initial 
trap  loading. 

Next  we  consider  each  atom  number  individually  by 
integrating  the  plateau  regions  along  the  \(a(t))\  axis  for 
each  time  t.  The  dashed  horizontal  lines  in  Fig.  3  indicate 
the  boundaries  chosen  to  define  the  limits  of  integration 
for  each  value  of  N.  We  thereby  obtain  time-dependent 
“populations”  for  N  =  0,  1, 2,  and  <l>>3(/)  = 

Y,n=3  tsee  Fig*  4(a)].  The  qualitative  behavior  of 

these  populations  is  sensible,  since  almost  all  trials  begin 
with  N  >  3,  eventually  decaying  to  N  =  2,  1,  0. 

The  quantities  <&N(t)  are  approximately  proportional  to 
the  fraction  of  experimental  trials  in  which  N  atoms  were 
trapped  at  time  t ,  so  long  as  the  characteristic  duration 
A  tN  of  each  plateau  far  exceeds  the  time  resolution  of  the 
detection.  If  the  bandwidth  is  too  low,  transient  steps  no 
longer  represent  a  negligible  fraction  of  the  data,  as  is  the 
case  for  transitions  between  the  shortest-lived  levels  (e.g., 
N  =  3  — ►  2).  We  estimate  that  this  ambiguity  causes  un¬ 
certainties  in  <&N  at  the  5% -10%  level. 
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FIG.  3  (color  online).  Histogram  of  500  traces  such  as  the  one 
in  Fig.  2(a),  binned  with  respect  to  both  signal  strength  |(^)| 
and  time  t.  The  signals  are  filtered  first  as  in  Fig.  2(b). 
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FIG.  4  (color  online),  (a)  Experimentally  determined  A-atom 
populations  normalized  such  that  their  sum  is  approxi¬ 

mately  unity  throughout  the  interval  shown.  Here,  the  t  axis 
begins  at  t0  =  0.034  s.  (b)  The  results  of  a  simple  model 
calculation  PN(t)  are  fit  to  the  data  with  one  free 

parameter,  the  single-atom  decay  rate  T. 


Also  shown  [Fig.  4(b)]  is  the  result  of  a  simple  birth- 
death  model  for  predicting  the  time  evolution  of 
the  populations,  namely  PN(t )  =  —  T[NPN(t)  —  {N  + 
l)P#+1(f)],  where  PN(t )  represents  the  probability  of  N 
atoms  in  the  trap.  We  assume  that  there  is  one  character¬ 
istic  trap-decay  rate  T,  and  that  each  atom  leaves  the  trap 
independently  of  all  others.  Initial  conditions  for  N  =  0, 
1,  and  2  are  determined  directly  from  the  record  of  cavity 
transmission  immediately  after  trap  loading,  <f>N(t0). 
Since  the  plateaus  for  higher  values  of  N  are  not  well 
resolved,  we  use  a  Poisson  distribution  for  N  >  3.  The 
mean  /jl  =  5.2  is  obtained  by  solving  X5v=3  e~^jmN/N\  = 
<l>#> 3(fo).  The  value  T  =  8.5  s-1  used  in  Fig.  4(b)  then 
results  from  fitting  the  analytic  solutions  {P#(f)}  t0  the 
data  {<!>#(*)}.  Although  there  is  reasonable  correspon¬ 
dence  between  Figs.  4(a)  and  4(b),  evolves  more 

rapidly  than  does  PN(t )  at  early  times,  and  yet  the  data 
decay  more  slowly  at  long  times.  This  suggests  that  there 
might  be  more  than  one  time  scale  involved,  possibly  due 
to  an  inhomogeneity  of  decay  rates  from  atom  to  atom  or 
to  a  dependence  of  the  decay  rate  on  V.  Indeed,  we  have 
observed  nonexponential  decay  behavior  in  other  mea¬ 
surements  of  single-atom  trap  lifetimes. 

Our  experiment  represents  a  new  method  for  the  real¬ 
time  determination  of  the  number  of  atoms  trapped  and 
strongly  coupled  to  an  optical  cavity.  We  emphasize  that 
an  exact  number  of  atoms  can  be  prepared  in  our  cavity 
within  ~  200  ms  from  the  release  of  the  MOT.  Although 
the  trap  loading  is  not  deterministic,  N  can  be  measured 
quickly  compared  to  the  subsequent  trapping  time  r  ~ 
3  s  [9].  These  new  techniques  could  assist  in  the  realiza¬ 
tion  of  various  protocols  in  quantum  information  science, 
including  probabilistic  schemes  for  entangling  multiple 


atoms  in  a  cavity  [11-13].  Although  our  current  inves¬ 
tigation  has  centered  on  the  case  of  small  N  <  3,  there  are 
reasonable  prospects  for  an  extension  to  higher  values 
N  ^  10  [see  Fig.  2(c)].  Moreover,  the  rate  at  which  infor¬ 
mation  about  N  is  acquired  can  be  substantially  increased 
from  the  current  value  k\(o)\2  ~  105/s  toward  the  rate 
g1  / k  ^  108/s  »  y,  with  7  the  rate  for  fluorescent  imag¬ 
ing  [4]. 
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Observation  of  the  Vacuum  Rabi  Spectrum  for  One  Trapped  Atom 

A.  Boca,  R.  Miller,  K.  M.  Birnbaum,  A.  D.  Boozer,  J.  McKeever,  and  H.  J.  Kimble 
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The  transmission  spectrum  for  one  atom  strongly  coupled  to  the  field  of  a  high  finesse  optical 
resonator  is  observed  to  exhibit  a  clearly  resolved  vacuum  Rabi  splitting  characteristic  of  the  normal 
modes  in  the  eigenvalue  spectrum  of  the  atom-cavity  system.  A  new  Raman  scheme  for  cooling  atomic 
motion  along  the  cavity  axis  enables  a  complete  spectrum  to  be  recorded  for  an  individual  atom  trapped 
within  the  cavity  mode,  in  contrast  to  all  previous  measurements  in  cavity  QED  that  have  required 
averaging  over  103  —  105  atoms. 


DOI:  10. 1 103/PhysRevLett.93. 233603 

A  cornerstone  of  optical  physics  is  the  interaction  of  a 
single  atom  with  the  electromagnetic  field  of  a  high 
quality  resonator.  Of  particular  importance  is  the  regime 
of  strong  coupling,  for  which  the  frequency  scale  g  asso¬ 
ciated  with  reversible  evolution  for  the  atom-cavity  sys¬ 
tem  exceeds  the  rates  (y,  k )  for  irreversible  decay  of  atom 
and  cavity  field,  respectively  [1].  In  the  domain  of  strong 
coupling,  a  photon  emitted  by  the  atom  into  the  cavity 
mode  is  likely  to  be  repeatedly  absorbed  and  reemitted  at 
the  single-quantum  Rabi  frequency  2 g  before  being  irre¬ 
versibly  lost  into  the  environment.  This  oscillatory  ex¬ 
change  of  excitation  between  atom  and  cavity  field  results 
from  a  normal-mode  splitting  in  the  eigenvalue  spectrum 
of  the  atom-cavity  system  [2-4],  and  has  been  dubbed  the 
vacuum  Rabi  splitting  [3]. 

Strong  coupling  in  cavity  QED  as  evidenced  by  the 
vacuum  Rabi  splitting  provides  enabling  capabilities  for 
quantum  information  science,  including  for  the  imple¬ 
mentation  of  scalable  quantum  computation  [5,6],  for  the 
realization  of  distributed  quantum  networks  [7,8],  and 
more  generally,  for  the  study  of  open  quantum  systems 
[9].  Against  this  backdrop,  experiments  in  cavity  QED 
have  made  great  strides  over  the  past  two  decades  to 
achieve  strong  coupling  [10].  The  vacuum  Rabi  splitting 
for  single  intracavity  atoms  has  been  observed  with 
atomic  beams  in  both  the  optical  [11-13]  and  microwave 
regimes  [14].  The  combination  of  laser  cooled  atoms  and 
large  coherent  coupling  has  enabled  the  vacuum  Rabi 
spectrum  to  be  obtained  from  transit  signals  produced 
by  single  atoms  [15].  A  significant  advance  has  been  the 
trapping  of  individual  atoms  in  a  regime  of  strong  cou¬ 
pling  [16,17],  with  the  vacuum  Rabi  splitting  first  evi¬ 
denced  for  single  trapped  atoms  in  Ref.  [16]  and  the  entire 
transmission  spectra  recorded  in  Ref.  [18]. 

Without  exception  these  prior  single  atom  experiments 
related  to  the  vacuum  Rabi  splitting  in  cavity  QED  [11- 
18]  have  required  averaging  over  trials  with  many  atoms 
to  obtain  quantitative  spectral  information,  even  if  indi¬ 
vidual  trials  involved  only  single  atoms  (e.g.,  105  atoms 
were  required  to  obtain  a  spectrum  in  Ref.  [14]  and  >103 
atoms  were  needed  in  Ref.  [18]).  By  contrast,  the  imple- 


PACS  numbers:  42.50.Pq,  03.67.-a,  32.80.Pj 

mentation  of  complex  algorithms  in  quantum  information 
science  requires  the  capability  for  repeated  manipulation 
and  measurement  of  an  individual  quantum  system,  as 
has  been  spectacularly  demonstrated  with  trapped  ions 
[19,20]  and  recently  with  Cooper  pair  boxes  [21]. 

With  this  goal  in  mind,  in  this  Letter  we  report  mea¬ 
surements  of  the  spectral  response  of  single  atoms  that 
are  trapped  and  strongly  coupled  to  the  field  of  a  high 
finesse  optical  resonator.  By  alternating  intervals  of  probe 
measurement  and  of  atomic  cooling,  we  record  a  complete 
probe  spectrum  for  one-and-the-same  atom.  The  vacuum 
Rabi  splitting  is  thereby  measured  in  a  quantitative  fash¬ 
ion  for  each  atom  by  way  of  a  protocol  that  represents  a 
first  step  towards  more  complex  tasks  in  quantum  infor- 


FIG.  1  (color  online).  A  single  atom  is  trapped  inside  an 
optical  cavity  in  the  regime  of  strong  coupling  by  way  of  an 
intracavity  FORT  driven  by  the  field  £Fort-  The  transmission 
spectrum  Tfcop )  for  the  atom-cavity  system  is  obtained  by 
varying  the  frequency  cop  of  the  probe  beam  Sp  and  recording 
the  output  with  single -photon  detectors.  Cooling  of  the  radial 
atomic  motion  is  accomplished  with  the  transverse  fields  D4, 
while  axial  cooling  results  from  Raman  transitions  driven  by 
the  fields  £Fort>  ^Raman-  An  additional  transverse  field  D3  acts 
as  a  repumper  during  probe  intervals. 
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mation  science.  An  essential  component  of  our  protocol  is 
a  new  Raman  scheme  for  cooling  atomic  motion  along  the 
cavity  axis  that  leads  to  inferred  atomic  localization 
A-£axial  “  33  nm,  Aptransverse  —  5.5  /xm. 

A  simple  schematic  of  our  experiment  is  given  in  Fig.  1 
[22].  After  release  from  a  magneto-optical  trap  (MOT) 
located  several  mm  above  the  Fabry-Perot  cavity  formed 
by  mirrors  (Mx,  M2),  single  Cesium  atoms  are  cooled  and 
loaded  into  an  intracavity  far-off-resonance  trap  (FORT) 
and  are  thereby  strongly  coupled  to  a  single  mode  of  the 
cavity.  Our  experiment  employs  the  6Si/2,  F  =  4  — ► 
6P3/2,  F'  =  51  transition  of  the  D2  line  in  Cs  at  XA  = 
852.4  nm,  for  which  the  maximum  single-photon  Rabi 
frequency  2g0/27T  =  68  MHz  for  (F  =  4,  mF  =  ±4)  — ► 
(. F '  =  5',  m'F  =  ±5).  The  transverse  decay  rate  for  the 
6P3/2  atomic  states  is  y I2tt  =  2.6  MHz,  while  the  cavity 
field  decays  at  rate  k!2tt  =  4. 1  MHz.  Our  system  is  in  the 
strong  coupling  regime  of  cavity  QED  g0  »  (7,  k)  [1]. 

The  intracavity  FORT  is  driven  by  a  linearly  polarized 
input  field  £Fort  at  Af  =  935.6  nm  [23],  resulting  in 
nearly  equal  ac-Stark  shifts  for  all  Zeeman  states  in  the 
651/2,  F  =  3,  4  manifold  [24].  At  an  antinode  of  the  field, 
the  peak  value  of  the  trapping  potential  for  these  states  is 
Uo/h  =  —  39  MHz  for  all  our  measurements.  Zeeman 
states  of  the  6P3/2,  F'  =  51  manifold  likewise  experience 
a  trapping  potential,  albeit  with  a  weak  dependence  on  m'F 
[17].  The  cavity  length  is  independently  stabilized  to 
length  Z0  =  42.2  yarn  such  that  a  TEM00  mode  at  ACl  is 

resonant  with  the  free-space  atomic  transition  at  Aa  and 
another  TEM00  mode  at  XCl  is  resonant  at  AF  [25]. 

As  illustrated  in  Fig.  1,  we  record  the  transmission 
spectrum  Ti(cop)  for  a  weak  external  probe  Sp  of  variable 
frequency  c op  incident  upon  the  cavity  containing  one 
strongly  coupled  atom  [26].  Our  protocol  consists  of  an 
alternating  sequence  of  probe  and  cooling  intervals.  The 
probe  beam  is  linearly  polarized  [27]  and  is  matched  to 
the  TEMqo  mode  around  ACl.  £p  illuminates  the  cavity 
for  Afprobe  =  100  ytxs,  and  the  transmitted  light  is  de¬ 
tected  by  photon  counting  [28].  During  this  interval  a 
repumping  beam  fl3,  transverse  to  the  cavity  axis  and 
resonant  with  6Si/2,  F  =  3  — ►  6 P3/2,  F'  =  4',  also  illumi¬ 
nates  the  atom.  In  successive  probe  intervals,  the  fre¬ 
quency  (op  is  linearly  swept  from  below  to  above  the 
common  atom-cavity  resonance  at  coA  —  coCx .  The  fre¬ 
quency  sweep  for  the  probe  is  repeated  eight  times  in 
A ttot  =  1.2  s,  and  then  a  new  loading  cycle  is  initiated. 

Following  each  probe  interval,  we  apply  light  to  cool 
both  the  radial  and  axial  motion  for  A^cool  =  2.9  ms. 
Radial  cooling  is  achieved  by  the  fl4  beams  consisting 
of  pairs  of  counter-propagating  fields  in  a  cr±  configura¬ 
tion  perpendicular  to  the  cavity  axis,  as  shown  in  Fig.  1. 
The  fl4  beams  are  detuned  A4  —  10  MHz  to  the  blue  of 
the  4  — ►  A'  transition  to  provide  blue  Sisyphus  cooling 
[29]  for  motion  transverse  to  the  cavity  axis. 

To  cool  the  axial  motion  for  single  trapped  atoms,  we 
have  developed  a  new  scheme  that  employs  £Fort  and  an 


auxiliary  field  £Raman  that  is  frequency  offset  by  ARaman  = 
AHF  +  8  and  phase  locked  to  £FORt*  Here,  A HF  = 
9.192  632  GHz  is  the  hyperfine  splitting  between 
6Si/2,  F  =  3,  4.  £Fort>  ^Raman  drive  Raman  transitions 
between  the  F  =  3,  4  levels  with  effective  Rabi  frequency 
ClE  ~  200  kHz.  By  tuning  S  near  the  An  =  —  2  motional 
sideband  (i.e.,  —2v0  ~  S  =  —1.0  MHz,  where  v0  is  the 
axial  vibrational  frequency  at  an  antinode  of  the  FORT), 
we  implement  sideband  cooling  via  the  F  =  3  — ►  4  tran¬ 
sition,  with  repumping  provided  by  the  D4  beams.  The 
Raman  process  also  acts  as  a  repumper  for  population 
pumped  to  the  F  =  3  level  by  the  fl4  beams.  Each  cooling 
interval  is  initiated  by  turning  on  the  fields  fl4,  £Raman 
during  A  tcooi  and  is  terminated  by  gating  these  fields  off 
before  the  next  probe  interval  A^probe. 

Figure  2  displays  normalized  transmission  spectra  Tx 
[26]  for  individual  atoms  acquired  by  alternating  probe 
and  cooling  intervals.  Clearly  evident  in  each  trace  is  a 
two-peaked  structure  that  represents  the  vacuum  Rabi 
splitting  observed  on  an  atom-by-atom  basis.  Also  shown 
is  the  predicted  transmission  spectrum  obtained  from  the 
steady-state  solution  to  the  master  equation  for  one  atom 
strongly  coupled  to  the  cavity,  as  discussed  below.  The 
quantitative  correspondence  between  theory  and  experi¬ 
ment  is  evidently  quite  reasonable  for  each  atom.  Note  that 
mF-dependent  Stark  shifts  for  F'  =  5'  in  conjunction  with 
optical  pumping  caused  by  £p  lead  to  the  asymmetry  of 
the  peaks  in  Fig.  2  via  an  effective  population-dependent 
shift  of  the  atomic  resonance  frequency  [30]. 

To  obtain  the  data  in  Fig.  2,  Aload  =  61  atoms  were 
loaded  into  the  FORT  in  500  attempts,  with  the  probabil- 


FIG.  2  (color  online).  Transmission  spectrum  Tx{(op)  for  six 
randomly  drawn  atoms  [26].  In  each  case,  Tx{(op)  is  acquired 
for  one-and-the-same  atom,  with  the  two  peaks  of  the  vacuum 
Rabi  spectum  clearly  evident.  The  error  bars  reflect  the  statis¬ 
tical  uncertainties  in  the  number  of  photocounts.  The  full  curve 
is  from  the  steady-state  solution  to  the  master  equation. 
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ity  that  a  given  successful  attempt  involved  two  or  more 
atoms  estimated  to  be  P\oad(N  >  2)  ^  0.06.  Of  the  Aload 
atoms,  A^surviVe  =  28  atoms  remained  trapped  for  the  en¬ 
tire  duration  Aftot.  The  six  spectra  shown  in  Fig.  2  were 
selected  by  a  random  drawing  from  this  set  of  Asurvive 
atoms.  Our  sole  selection  criterion  for  presence  of  an  atom 
makes  no  consideration  of  the  spectral  structure  of 
Ti(cop)  except  that  there  should  be  large  absorption  on 
line  center,  T^Wp  =  coCi)  <  Tthresh  ~  0.2  [26,31].  Note 
that  an  atom  trapped  in  the  FORT  in  the  absence  of  the 
cooling  and  probing  light  has  lifetime  r0  —  3  s,  which 
leads  to  a  survival  probability  p(Attot)  ^0.7. 

In  Fig.  3  we  collect  the  results  for  Ti(c op)  for  all 
Survive  =  28  atoms,  and  display  the  average  transmission 
spectrum  Ti(a>p ),  as  well  as  a  scatter  plot  from  the 
individual  spectra.  This  comparison  demonstrates  that 
the  vacuum  Rabi  spectrum  observed  for  any  particular 
atom  represents  with  reasonable  fidelity  the  spectrum 
that  would  be  obtained  from  averaging  over  many  atoms, 
albeit  with  fluctuations  due  to  Poisson  counting  and  opti¬ 
cal  pumping  effects  over  the  finite  duration  of  the  probe. 
The  total  acquisition  time  associated  with  the  probe  beam 
for  the  spectrum  of  any  one  atom  is  only  40  ms. 

We  have  also  acquired  transmission  spectra  Tx{o)p) 
for  operating  conditions  other  than  those  in  Figs.  2  and 
3,  including  intensities  \£p\ 2  varied  by  factors  of  2, 
\ ,  and  and  atom-cavity  detunings  Aac  =  coA  —  coCi  = 
±13  MHz.  We  will  describe  these  results  elsewhere. 

The  full  curves  in  Figs.  2  and  3  are  obtained  from  the 
steady-state  solution  of  the  master  equation  including  all 
transitions  (F  =  4,  mF)  <-►  (. F '  =  5',  m'F)  with  their  re¬ 
spective  coupling  coefficients  as  well  as  the 

two  nearly  degenerate  modes  of  our  cavity  [23,27].  For 


FIG.  3  (color  online).  Transmission  spectrum  Tx{(i)p)  (thick 
trace)  resulting  from  averaging  all  28  individual  spectra  Tx(a)p) 
(dots)  [26].  The  thin  trace  is  from  the  steady-state  solution  to 
the  master  equation,  and  is  identical  to  that  in  Fig.  2.  The  only 
free  parameters  in  the  theory  are  the  temperature  and  the  range 
of  FORT  antinodes;  the  vertical  scale  is  absolute. 


the  comparison  of  theory  and  experiment,  the  parameters 
{g(™F’mF\  y,  K,  Aac,  (Op  -  coA,  \£p\2,  U0)  are  known  in  ab- 
solute  terms  without  adjustment.  However,  we  have  no  a 
priori  knowledge  of  the  particular  FORT  well  into  which 
the  atom  is  loaded  along  the  cavity  standing  wave,  nor  of 
the  energy  of  the  atom.  The  FORT  shift  and  coherent  cou¬ 
pling  rate  are  both  functions  of  atomic  position  r,  with 
U(  r)  =  f/0sin2(^C2z)exp(-2p2/w^2)  and  g(mF>™'F\ r)  = 

go'”F’”VV(r),  where  gi™F’mr)  =  g0GmFjtn'F  with  Gi  f  related 
to  the  Clebsch-Gordan  coefficient  for  the  particular 
mF  <-►  m'F  transition,  =  cos(£Clz)  exp(— p2/w2c^), 

where  p  is  the  transverse  distance  from  the  cavity  axis 
z,  and  kChl  =  2tt/ AC]  2  [25], 

As  discussed  in  connection  with  Fig.  4  below,  for  the 
theoretical  curves  shown  in  Figs.  2  and  3,  we  have  chosen 
only  the  30  out  of  90  total  FORT  wells  for  which 
|  ^(Tfort)  I  —  0.87,  where  i*fort  is  such  that  ^(ffort)  = 
Uq.  Furthermore,  for  these  wells  we  have  averaged 
Ti(cOp)  over  a  Gaussian  distribution  in  position  r  consis¬ 
tent  with  a  temperature  kgT'  =  0. 1 U0  (~  200  pK).  Since 
all  parameters  are  known  except  for  those  that  character¬ 
ize  atomic  motion,  the  good  agreement  between  theory 
and  experiment  [32]  allows  us  to  infer  that  our  cooling 
protocol  together  with  the  selection  criterion  Tthresh  =  0.2 
results  in  individual  atoms  that  are  strongly  coupled  in 


FIG.  4  (color  online).  Theoretical  plots  for  Tx(a)p)  from  the 
steady-state  solution  of  the  master  equation,  (a)  For  zero  tem¬ 
perature,  T]  (o)p)  is  calculated  from  an  average  over  various 
FORT  antinodes  along  the  cavity  axis,  with  the  inset  showing 
the  associated  distribution  of  values  for  I  ^(ffort)  I  •  (A)  F°r  an 
optimum  FORT  well  (i.e.,  |{/'(rF0Rrr)l  =  1 )»  T^cOp)  is  computed 
for  various  temperatures  from  an  average  over  atomic  positions 
within  the  well. 
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one  of  the  “best”  FORT  wells  (i.e.,  |^(rFORx)l  ~  0.87) 
with  “temperature”  —200  julK  [33]. 

In  support  of  these  assertions,  Fig.  4(a)  explores  the 
theoretical  dependence  of  Ti(cop)  on  the  set  of  FORT 
wells  selected,  and  hence  on  the  distribution  of  values 
for  |*A(rFc>RT)l  in  ideal  case  T  =  0.  Extending  the 
average  beyond  the  30  “best”  FORT  wells  leads  to  spectra 
that  are  inconsistent  with  our  observations  in  Figs.  2  and 
3.  Figure  4(b)  likewise  investigates  the  theoretical  depen¬ 
dence  of  Ti((op)  on  the  temperature  "T  for  an  atom  at  an 
antinode  of  the  FORT  with  optimal  coupling  (i.e., 
\if/(r)\  =  1).  For  temperatures  T  ^  200  julK,  the  calcu¬ 
lated  spectra  are  at  variance  with  the  data  in  Figs.  2  and  3, 
from  which  we  infer  atomic  localization  A z  —  33  nm  in 
the  axial  direction  and  Ax  =  Ay  3.9  jmm  in  the  plane 
transverse  to  the  cavity  axis.  Beyond  these  conclusions,  a 
consistent  feature  of  our  measurements  is  that  reasonable 
correspondence  between  theory  and  experiment  is  only 
obtained  by  restricting  |^(r)|  ^0.8. 

Our  experiment  represents  an  important  advance  in  the 
quest  to  obtain  single  atoms  trapped  with  optimal  strong 
coupling  to  a  single  mode  of  the  electromagnetic  field. 
The  vacuum  Rabi  splitting  is  the  hallmark  of  strong 
coupling  for  single  atoms  and  photons,  and  all  measure¬ 
ments  until  now  have  required  averaging  over  many 
atoms  for  its  observation.  By  contrast,  we  are  able  to 
observe  spectra  T{{wp)  on  an  atom-by-atom  basis  with 
clearly  resolved  normal-mode  splittings.  These  spectra 
contain  detailed  quantitative  information  about  the  co¬ 
herent  coupling  g(r)  and  FORT  shifts  for  each  atom.  This 
information  indicates  that  the  coupling  g  is  in  a  narrow 
range  of  near- maximal  values.  Our  observations  are  made 
possible  by  the  implementation  of  a  new  scheme  to  cool 
both  the  radial  and  axial  atomic  motion.  The  capabilities 
demonstrated  in  this  Letter  should  provide  the  tools  nec¬ 
essary  to  implement  diverse  protocols  in  quantum  infor¬ 
mation  science  [5-9]. 

We  gratefully  acknowledge  the  contributions  of 
T.  Northup.  This  research  is  supported  by  the  Caltech 
MURI  Center  for  Quantum  Networks,  by  the  National 
Science  Foundation,  and  by  the  Advanced  Research  and 
Development  Activity  (ARDA). 


[1]  H.  J.  Kimble,  Phys.  Sen,  T  76,  127  (1998). 

[2]  E.T.  Jaynes  and  F.W.  Cummings,  Proc.  IEEE  51,  89 
(1963). 

[3]  J.  J.  Sanchez -Mondragon,  N.  B.  Narozhny,  and  J.  H. 
Eberly,  Phys.  Rev.  Lett  51,  550  (1983). 

[4]  G.S.  Agarwal,  Phys.  Rev.  Lett  53,  1732  (1984). 

[5]  T.  Pellizzari,  S.  A.  Gardiner,  J.  I.  Cirac,  and  P  Zoller, 
Phys.  Rev.  Lett.  75,  3788  (1995). 

[6]  L.-M.  Duan  and  H.  J.  Kimble,  Phys.  Rev.  Lett.  92,  127902 
(2004) 

[7]  J.  I.  Cirac,  P.  Zoller,  H.  J.  Kimble,  and  H.  Mabuchi,  Phys. 
Rev.  Lett.  78,  3221  (1997). 


[8]  H.-J.  Briegel  et  al ,  in  The  Physics  of  Quantum 
Information ,  edited  by  D.  Bouwmeester,  A.  Ekert,  and 
A.  Zeilinger  (Springer,  Berlin,  2000),  p.  192. 

[9]  H.  Mabuchi  and  A.  C.  Doherty,  Science  298,  1372 

(2002). 

[10]  Cavity  Quantum  Electrodynamics ,  edited  by  P.  Berman 
(Academic  Press,  San  Diego,  1994). 

[11]  R.  J.  Thompson,  G.  Rempe,  and  H.  J.  Kimble,  Phys.  Rev. 
Lett.  68,  1132  (1992). 

[12]  J.  J.  Childs  et  al ,  Phys.  Rev.  Lett.  77,  2901  (1996). 

[13]  R.  J.  Thompson,  Q.  A.  Turchette,  O.  Carnal,  and  H.  J. 
Kimble,  Phys.  Rev.  A  57,  3084  (1998). 

[14]  M.  Brune  et  al,  Phys.  Rev.  Lett.  76,  1800  (1996). 

[15]  C.  J.  Hood,  M.  S.  Chapman,  T.W.  Lynn,  and  H.  J.  Kimble, 
Phys.  Rev.  Lett.  80,  4157  (1998). 

[16]  J.  Ye,  D.  W.  Vernooy,  and  H.  J.  Kimble,  Phys.  Rev.  Lett.  83, 
4987  (1999). 

[17]  J.  McKeever  et  al,  Phys.  Rev.  Lett.  90  ,  133602  (2003). 

[18]  P.  Maunz  et  al,  quant-ph/0406136. 

[19]  M.  Riebe  et  al.  Nature  (London)  429,  734  (2004). 

[20]  M.  D.  Barrett  et  al.  Nature  (London)  429,  737  (2004). 

[21]  A.  Wallraff  et  al.  Nature  (London)  431,  162  (2004); 
I.  Chiorescu  et  al,  ibid.  431,  159  (2004). 

[22]  See  J.  McKeever,  Ph.D  thesis,  California  Institute  of 
Technology,  2004  for  more  technical  details. 

[23]  Birefringence  in  the  mirrors  leads  to  two  nondegenerate 
cavity  modes  with  orthogonal  polarizations  /+  and  mode 
splitting  Ai>Cl  =  4.4  ±  0.2  MHz.  %>rt  (^Raman)  is  line- 
arly  polarized  and  aligned  close  to  /+  (/_)  for  the  higher 
(lower)  frequency  mode. 

[24]  K.  L.  Corwin  et  al,  Phys.  Rev.  Lett.  83,  1311  (1999). 

[25]  At  the  cavity  center  z  =  0,  the  mode  waists  wCl2  = 
{23.4,  24.5}  jam  at  ACl>2  =  {852.4,  935.6}  nm. 

[26]  TfcOp)  is  proportional  to  the  ratio  of  photon  flux  trans¬ 
mitted  by  M2  to  the  flux  \Sp\1  2 3 4 5 6 7  incident  upon  Mx,  with 
normalization  T0((op  =  coCi)  =  1  for  the  empty  cavity. 
Figures  2  and  3  also  show  the  corresponding  intracavity 
photon  number  (n(cop)). 

[27]  Relative  to  /+  [23],  the  linear  polarization  vector  lp  for 
the  probe  field  Ep  is  aligned  along  a  direction  lp  = 
cos0/+  +  sin#/_,  where  6  =  13°  for  Fig.  2;  however, 
the  theory  is  relatively  insensitive  to  9  for  6  ^  15  °. 

[28]  The  efficiency  for  photon  escape  from  the  cavity  is  ae2  = 
0.6  ±  0.1.  The  propagation  efficiency  from  M2  to  detec¬ 
tors  (Pi,  D2)  is  ol p  =  0.41  ±  .03,  with  then  each  detector 
receiving  half  of  the  photons.  The  avalanche  photodiodes 
(Di,D2)  have  quantum  efficiencies  aP  =  0.49  ±  0.05. 

[29]  D.  Boiron  et  al,  Phys.  Rev.  A  53,  R3734  (1996). 

[30]  The  ac-Stark  shifts  of  the  ( F 1  =  5',  m'F )  states  are  given 
by  {m'F,  Um,  }  =  {±5,  1.18L},  {±4,  1.06 U},  {±3,  0.97L}, 
{±2,  0.90 uj,  {±  1,  0.861/},  {0,  0.85 U}. 

[31]  Transmission  spectra  T \  ( cop ),  T x  ( cop )  are  insensitive  over 
a  range  of  selection  criteria  0.02  <  Tthresh  <  0.73. 

[32]  In  Figs.  2  and  3  the  discrepancy  between  experiment  and 
the  steady-state  theory  for  Ti(cop)  around  (op  ~  0  can  be 
accounted  for  by  a  transient  solution  to  the  master 
equation  which  includes  optical  pumping  effects  over 
the  probe  interval  A^probe. 

[33]  Although  the  spectra  are  consistent  with  a  thermal  dis¬ 
tribution,  we  do  not  exclude  a  more  complex  model 
involving  probe -dependent  heating  and  cooling  effects. 


233603-4 


arXi v :  quant-ph/040 1 1 47  v2  24  J an  2004 


Single-Photon  Generation  from  Stored  Excitation  in  an  Atomic  Ensemble 

C.  W.  Chou,  S.  V.  Polyakov,  A.  Kuzmich,*  and  H.  J.  Kimble 

Norman  Bridge  Laboratory  of  Physics  12-33 
California  Institute  of  Technology,  Pasadena,  CA  91125 
(Dated:  May  24,  2006) 

Single  photons  are  generated  from  an  ensemble  of  cold  Cs  atoms  via  the  protocol  of  Duan  et  al. 
[Nature  414,  413  (2001)].  Conditioned  upon  an  initial  detection  from  held  1  at  852  nm,  a  photon 
in  held  2  at  894  nm  is  produced  in  a  controlled  fashion  from  excitation  stored  within  the  atomic 
ensemble.  The  single-quantum  character  of  the  held  2  is  demonstrated  by  the  violation  of  a  Cauchy- 
Schwarz  inequality,  namely  ic(l2, 12 1 li)  =  0.24  ±0.05  ^  1,  where  ic(l2, 12 1 1 1 )  describes  detection  of 
two  events  (I2, 12)  conditioned  upon  an  initial  detection  1 1 ,  with  w  — >  0  for  single  photons. 


A  critical  capability  for  quantum  computation  and 
communication  is  the  controlled  generation  of  single¬ 
photon  pulses  into  well-defined  spatial  and  temporal 
modes  of  the  electromagnetic  held.  Indeed,  early  work  on 
the  realization  of  quantum  computation  utilized  single¬ 
photon  pulses  as  quantum  bits  ( flying  qubits ),  with  non¬ 
linear  interactions  mediated  by  an  appropriate  atomic 
medium  [1,  2].  More  recently,  a  scheme  for  quantum 
computation  by  way  of  linear  optics  and  photoelectric  de¬ 
tection  has  been  developed  that  again  relies  upon  single¬ 
photon  pulses  as  qubits  [3].  Protocols  for  the  implemen¬ 
tation  of  quantum  cryptography  [4]  and  of  distributed 
quantum  networks  also  rely  on  this  capability  [5,  6],  as 
do  some  models  for  scalable  quantum  computation  [7]. 

Efforts  to  generate  single-photon  wavepackets  can  be 
broadly  divided  into  techniques  that  provide  photons  “on 
demand”  (e.g.,  quantum  dots  coupled  to  microcavities 
[8,  9,  10])  and  those  that  produce  photons  as  a  result  of 
conditional  measurement  on  a  correlated  quantum  sys¬ 
tem.  For  conditional  generation,  the  detection  of  one 
photon  from  a  correlated  pair  results  in  a  one-photon 
state  for  the  second  photon,  as  was  hrst  achieved  us¬ 
ing  “twin”  photons  from  atomic  cascades  [11,  12]  and 
parametric  down  conversion  [13],  with  many  modern  ex¬ 
tensions  [14,  15,  16,  17].  Within  the  context  of  the  col¬ 
lective  enhancement  of  atom-photon  interactions  in  op¬ 
tically  thick  atomic  samples  [18,  19],  a  remarkable  pro¬ 
tocol  for  scalable  quantum  networks  [6]  suggests  a  new 
avenue  for  producing  single  photons  via  conditional  mea¬ 
surement. 

Inspired  by  the  protocol  of  Ref.  [6] ,  in  this  Letter  we 
report  a  significant  advance  in  the  creation  of  single  pho¬ 
tons  for  diverse  applications  in  quantum  information  sci¬ 
ence,  namely  the  generation  and  storage  of  single  quanta 
from  an  atomic  ensemble.  As  illustrated  in  Figure  1,  an 
initial  write  pulse  of  (classical)  light  creates  a  state  of  col¬ 
lective  excitation  in  an  ensemble  of  cold  atoms  as  deter¬ 
mined  by  photoelectric  detection  for  the  generated  field 
1.  Although  this  first  step  is  probabilistic,  its  success 
heralds  the  preparation  of  one  excitation  stored  within 
the  atomic  medium.  After  a  programmable  delay  5t,  a 
read  pulse  converts  the  state  of  atomic  excitation  into  a 
field  excitation,  thereby  generating  one  photon  in  a  well- 
defined  spatial  and  temporal  mode  2.  The  quantum  char¬ 


FIG.  1:  Schematic  of  experiment  for  conditional  generation 
of  single  photons.  Write  and  read  pulses  sequentially  prop¬ 
agate  into  a  cloud  of  cold  Cs  atoms  (MOT),  generating  the 
correlated  output  fields  (1,  2).  A  detection  event  for  field  1  at 
Dia,ib  leads  to  an  approximate  one-photon  state  for  field  2, 
as  confirmed  with  detectors  D2A,2B-  (P)BS  -  (polarization) 
beam  splitter,  SM  -  single-mode.  The  inset  illustrates  the 
relevant  atomic  level  scheme. 

acter  of  the  (1,2)  fields  is  demonstrated  by  the  observed 
violation  of  a  Cauchy- Schwarz  inequality  for  the  ratio 
R  of  cross  correlations  to  auto-correlations  [11],  namely 
R  =  53  ±2  where  R  <  1  for  any  classical  field  [20,  21,  22]. 

This  greatly  improved  nonclassical  correlation  for  pho¬ 
ton  pairs  for  the  (1,2)  fields  enables  the  conditional  gen¬ 
eration  of  single  photons  as  in  Refs.  [12,  13,  14,  15,  16, 
17],  but  now  with  the  photon  2  stored  as  an  excitation  in 
the  atomic  ensemble  [15].  Given  a  first  photon  1  from  the 
write  pulse,  we  trigger  the  emission  of  a  second  photon 
2  with  the  read  pulse.  To  demonstrate  the  single-photon 
character  of  the  field  2,  we  measure  the  three- fold  cor¬ 
relation  function  ^(12 , 12 1 li)  for  detection  of  two  pho¬ 
tons  (12,12)  from  field  2  given  a  detection  event  li  from 
field  1,  where  ^(12 , 12 1 1 1 )  =  1  f°r  coherent  states  and 
w( I2 , 12 1 1 1 )  >  1  f°r  any  classical  field.  Experimentally, 
we  find  w{ I2, 12 1 li )  =  0.34  ±  0.06  for  5t  =  150  ns,  while 
w( I2, 12 1 1 1 )  =  0.24  ±  0.05  for  5t  =  0,  thereby  taking  an 
important  step  toward  the  creation  of  ideal  single  pho¬ 
tons  for  which  w( I2, 12 1  li)  ~ >  0. 

Figure  1  provides  an  overview  of  our  experiment  for 
producing  correlated  photons  from  an  optically  thick 
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sample  of  four-level  atoms  in  a  magneto-optical  trap 
(MOT)  [20,  23].  The  ground  states  {|a);  |fo) }  corre¬ 
spond  to  the  6^1/2,  F  =  {4;  3}  levels  in  atomic  Cs, 
while  the  excited  states  { |e) ;  |e')}  denote  the  {6P3/2,  F  = 
4;6Pi/25P  =  4}  levels  of  the  D2,D\  lines  at  {852;  894} 
nm,  respectively.  We  start  the  protocol  for  single  photon 
generation  by  shutting  off  all  light  responsible  for  trap¬ 
ping  and  cooling  for  1/is,  with  the  trapping  light  turned 
off  approximately  300  ns  before  the  re-pumping  light  in 
order  to  empty  the  P  =  3  hyperfine  level  in  the  Cs  6S1/2 
ground  state,  thus  preparing  the  atoms  in  | a).  During 
the  “dark”  period,  the  jth  trial  is  initiated  at  time 
when  a  rectangular  pulse  of  laser  light  from  the  write 
beam,  150  ns  in  duration  (FWHM)  and  tuned  10  MHz 
below  the  | a)  — >  \e)  transition,  induces  spontaneous  Ra¬ 
man  scattering  to  level  | b)  via  | a)  — >  \e)  —>  \b).  The 
write  pulse  is  sufficiently  weak  so  that  the  probability  to 
scatter  one  Raman  photon  into  a  forward  propagating 
wavepacket  ^^(r,  t^)  is  less  than  unity  for  each  pulse. 
Detection  of  one  photon  from  field  1  results  in  a  “spin” 
excitation  to  level  |b),  with  this  excitation  distributed  in 
a  symmetrized,  coherent  manner  throughout  the  sample 
of  N  atoms  illuminated  by  the  write  beam. 

Given  this  initial  detection,  the  stored  atomic  excita¬ 
tion  can  be  converted  into  one  quantum  of  light  at  a  user 
controlled  time  t^p  =  t^  +  St.  To  implement  this  con¬ 
version,  a  rectangular  pulse  from  the  read  beam,  120  ns 
in  duration  (FWHM)  and  resonant  with  the  | b)  — >  \e') 
transition,  illuminates  the  atomic  sample.  This  pulse  af¬ 
fects  the  transfer  | b)  — ►  \e')  — >  | a)  with  the  accompany¬ 
ing  emission  of  a  second  Raman  photon  2  on  the  \e')  — > 
| a)  transition  described  by  the  wavepacket  ^2\r,  t^), 
where  the  spatial  and  temporal  structure  of  ^1,2\r:t) 
are  discussed  in  more  detail  in  Ref.  [24].  The  trapping 
and  re-pumping  light  for  the  MOT  are  then  turned  back 
on  to  prepare  the  atoms  for  the  next  trial  j  - hi,  with  the 
whole  process  repeated  at  250  kHz. 

The  forward-scattered  Raman  light  from  the  write , 
read  pulses  is  directed  to  two  sets  of  single-photon  de¬ 
tectors  (Dia,ib  for  field  1  and  D2a,2B  for  field  2)  [26]. 
Light  from  the  (write,  read)  pulses  is  strongly  attenu¬ 
ated  (by  ~  106)  by  the  filters  shown  in  Fig.  1,  while 
the  associated  (1,2)  photons  from  Raman  scattering  are 
transmitted  with  high  efficiency  (^  80%)  [20].  Detection 
events  from  D\a,ib  within  the  intervals  \pP  Pp  +  T] 

and  from  D2A,2B  within  pp ,  tP  +T]  are  time  stamped 
(with  a  resolution  of  2  ns)  and  stored  for  later  analysis. 
T  =  200  ns  for  all  of  our  measurements. 

For  a  particular  set  of  operating  conditions,  we  deter¬ 
mine  the  single  pi  and  joint  event  probabilities  from 
the  record  of  detection  events  at  Dia,ib,  D2A,2B,  where 
(l,m)  =  1  or  2.  For  example,  the  total  singles  probabil¬ 
ity  pi  for  events  at  D\a ,  D\b  due  to  field  1  is  found  from 
the  total  number  of  detection  events  ti\a  +  u\b  recorded 
by  D\a ,  Dib  during  the  intervals  \pp ,  tP  +T]  over  Mtot 
repeated  trials  { j },  with  then  p\  =  (niA  +  nis)  /  Mtot-  To 


FIG.  2:  Left  column  (a)  -  (c)  Normalized  intensity  correla¬ 
tion  functions  <71,1,  <72,2,  #1,2  versus  observed  detection  prob¬ 
abilities  pi,p2,  \jB\P2->  respectively.  Right  column  (d)  -  (f) 
<71,1,  <72,2,  qi,2  for  joint  detection  versus  qi,q2 ,  \Jq\q2  for  single 
detection,  with  qi,qi,m  referenced  to  the  output  of  the  MOT. 
Statistical  uncertainties  are  indicated  by  the  error  bars.  The 
full  curves  are  from  the  model  calculation  described  in  the 
text  with  (^1,^2)  =  (0.17,0.90)  and  (\vib\2  ,\v2b\2)  =  0.006. 

determine  pyi  for  joint  detections  at  D\a ,  Dib,  we  count 
the  total  number  of  coincidences  Nia,ib  recorded  by 
D1A,D1B,  with  then  =  N1A,iB/Mtot.  p2, 2  is  found  in 
an  analogous  fashion  using  events  from  D2A,D2b-  Joint 
detections  between  the  (1,  2)  fields  are  described  by  pi:2, 
which  is  determined  by  summing  coincidence  events  be¬ 
tween  the  four  pairs  of  detectors  for  the  (1,2)  fields  (e.g., 
between  pairs  D\a,  D2a)- 

From  (pi,pi,m)  we  derive  estimates  of  the  normalized 
intensity  correlation  functions  gi,m,  where  gi =  1  for 
coherent  states.  For  example,  the  auto-correlation  func¬ 
tion  giA  =  Pia/(piaPib)  for  field  1,  and  similarly  for  the 
functions  <72,2,  §1,2  for  the  auto-correlation  of  field  2  and 
the  cross  correlation  between  fields  (1,2).  The  first  col¬ 
umn  in  Figure  2  displays  <71,1,  <72,2,  and  <71,2  as  functions 
of  pi,P2,  and  y/piP2  [25].  A  virtue  of  gijTn  is  its  indepen¬ 
dence  from  the  propagation  and  detection  efficiencies.  In 
the  ideal  case,  the  state  for  the  fields  (1,  2)  is  [6,  22,  24] 

1^12)  =  100)  +  Vxlll)  +  x|22>  +  0(x3/2)  ,  (1) 

where  yjx  is  the  excitation  amplitude  for  field  1  in  each 
trial  of  the  experiment.  For  x  ^  1?  <71,1  =  ^2,2  —  2 
and  gi,2  =  1  +  1/x-  By  contrast,  for  reasons  that  we 
will  shortly  address,  our  measurements  in  Fig.  2  give 
§1,1  —  1-7  and  <72,2  —  1.3,  with  g\ ^  exhibiting  a  sharp  rise 
with  decreasing  yJp\P2 ,  but  with  considerable  scatter. 

To  provide  a  characterization  of  the  field  generation 
that  is  independent  of  the  efficiency  of  our  particular 
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detection  setup,  we  convert  the  photodetection  proba¬ 
bilities  (jpi , Pi^m)  to  the  quantities  ( qi,qi,m )  for  the  field 
mode  collected  by  our  imaging  system  at  the  output  of 
the  MOT.  Explicitly,  for  single  events  for  fields  (1,2), 
we  define  q\  m  pi/au,  q2  =  P2/&2,  while  for  joint  events, 
Qi,i  =Pi,i/al,q2,2  =P2,2/oc2,qi,2  =Pi,2/aia2,  where  a; 
gives  the  collection,  propagation,  and  detection  efficiency 
[26].  The  second  column  in  Fig.  2  displays  the  measured 
dependence  of  for  joint  events  versus  <71 ,  <72 ,  yj Q1Q2 
for  single  events  over  a  range  of  operating  conditions.  As 
expected  from  Eq.  1,  ^1,1 ,  ^2,2  exhibit  an  approximately 
quadratic  dependence  on  q-y,  q2,  while  qy ^  would  be  linear 
for  y/ (71(72  <C  1  in  the  ideal  case. 

In  our  experiment  there  are  a  number  of  imperfections 
that  lead  to  deviations  from  the  ideal  case  expressed  by 
|  $12)  [6,  22,  24].  To  capture  the  essential  aspects,  we  have 
developed  a  simple  model  that  assumes  the  total  fields 
(1,2)  at  the  output  of  the  MOT  consist  of  contributions 
from  |  $12),  together  with  background  fields  in  coherent 
states  1^1,2)-  Operationally,  increases  in  pi ,  P2  are  accom¬ 
plished  by  way  of  increases  in  the  intensity  of  the  write 
beam,  with  only  minor  adjustments  to  the  read  beam. 
Hence,  we  parameterize  our  model  by  taking  x  —  \vw\2, 
with  vw  as  the  (scaled)  amplitude  of  the  write  beam. 
Since  important  sources  of  noise  are  light  scattering  from 
the  write  and  read  beams  and  background  fluorescence 
from  uncorrelated  atoms  in  the  sample  [24],  we  assume 
that  vit2  =  y/Ki,2Vw  We  further  allow  for  fixed  incoher¬ 
ent  backgrounds  vib,  ^2 b  to  account  for  processes  that  do 
not  depend  upon  increases  in  the  write  intensity. 

With  this  model,  it  is  straightforward  to  compute  the 
quantities  that  appear  in  Figs.  2-4.  The  parameters 
(k  1,  K2)  =  (0.17,  0.90)  and  (|?q&|2,  |^2&|2)  =  0.006  are  ob¬ 
tained  directly  by  optimizing  the  comparison  between  the 
model  results  and  our  measurements  of  normalized  cor¬ 
relation  functions  (e.g.,  gy^y  vs.  <71,2)  without  requiring 
absolute  efficiencies.  By  =  0.17  implies  that  the  photon 
number  for  “good”  events  associated  with  {$>12)  exceeds 
that  for  “bad”  (background)  events  from  \vy)  by  roughly 
6— fold  for  detection  at  D\a ,  Dib-  Fen*  the  curves  in  Fig. 
2,  we  must  also  obtain  the  efficiencies  Pi,r]i  that  convert 
expectation  values  for  normally  ordered  photon  number 
operators  hi  for  fields  l  =  (1,2)  in  the  model  into  the 
various  (pi,pi,m)  and  (e.g.,  pi  =  fa{hi),qi  = 

Vi(ni),qi,2  =  Vim(:  nin2 :  ))■  Ideally  fa  =  an  and  pt  =  1; 
we  find  instead  (fa,pi)  =  (0.013,0.15),  where  we  take 
py  =  P2  and  771  =  772  for  simplicity.  Among  various  can¬ 
didates  under  investigation,  values  Pi  <  a^gi  <  1  can 
arise  from  inherent  mode  mismatching  for  capturing  col¬ 
lective  emission  from  the  atomic  ensemble  [24]. 

Independent  of  the  absolute  efficiencies,  we  can  uti¬ 
lize  the  results  from  Fig.  2  to  address  directly  the  ques¬ 
tion  of  the  nonclassical  character  of  the  (1,2)  fields  by 
following  the  pioneering  work  of  Clauser  [11].  The  cor¬ 
relation  functions  gi:Tn  for  fields  for  which  the  Glauber- 
Sudarshan  phase-space  function  p  is  well-behaved  (i.e., 
classical  fields)  are  constrained  by  the  inequality  R  = 
[^i,2]2  /gi,ig2,2  <  1  [11,  22].  In  Fig.  3  we  plot  the  ex- 
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FIG.  3:  Ratio  R  =  [§1,2] 2  / <71,1^22  versus  the  normalized  cross 
correlation  <71,2,  where  R  >  1  for  manifestly  quantum  (non- 
classical)  fields.  The  points  are  from  our  experiment  with 
statistical  uncertainties  indicated  by  the  error  bars.  The 
full  curve  is  from  the  model  calculation  with  (^1,^2)  and 
(l^ife |2,  1^26 12)  as  in  Fig.  2. 


perimentally  derived  values  for  R  as  a  function  of  the 
degree  of  cross-correlation  g\ ^  [25].  As  compared  to  pre¬ 
vious  measurements  for  which  R  —  1.84  ±  0.06  [20]  and 
R  =  1.34  ±  0.05  [21],  we  have  now  achieved  R^>  1,  with 
R  =  53  d=  2  for  the  largest  value  of  771,2-  In  Figs.  2  and  3 
as  well  as  4  to  follow,  all  points  are  taken  with  St  =  150 
ns,  except  the  points  at  gi ,2  —  10,  which  have  St  =  0. 

This  large  degree  of  quantum  correlation  between  the 
(1,2)  fields  suggests  the  possibility  of  producing  a  sin¬ 
gle  photon  for  field  2  by  conditional  detection  of  field  1. 
To  investigate  this  possibility,  we  consider  the  three-fold 
correlation  function  w(l2 , 12 1 li )  f°r  detection  with  the 
setup  shown  in  Fig.  1,  namely 


w(l2>  I2 1 li) 


p(c)(l2, 12 1 li) 

WtUhW’ 


(2) 


where  p^c^(l2, 12 1  li)  is  the  conditional  probability  for  de¬ 
tection  of  two  photons  (12,12)  from  field  2  conditioned 
upon  the  detection  of  an  initial  photon  li  for  field  1,  and 
p(c)  (12 1 1 1 )  is  the  probability  for  detection  of  one  photon 
I2  given  a  detection  event  li  for  field  1.  Bayes’  theorem 
allows  the  conditional  probabilities  in  Eq.  2  to  be  written 
in  terms  of  single  and  joint  probabilities  for  k— fold 
detection,  so  that 


w(l2, 12 1 li) 


P ^ (li)p^ (li,  I2, 12) 
[P(2)(  li,l2)]2 


(3) 


Fields  with  a  positive-definite  p  must  satisfy  the  Cauchy- 
Schwarz  inequality  w(l2, 12 1 1 1 )  >  1-  Indeed,  for  indepen¬ 
dent  coherent  states,  w  =  1,  while  for  thermal  beams, 
w  =  2.  By  contrast,  for  the  state  | ^>12)  of  Eq.  1, 
w  =  4y  1  for  small  y,  approaching  the  ideal  case 
w  — ►  0  for  a  “twin”  Fock  state  1 1 1 12) • 

From  the  record  of  photo-detection  events  at 
Dia,ib,  D2a,2B,  we  calculate  estimates  of  the  various 
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FIG.  4:  Three-fold  correlation  function  w(l2,l2|li)  for  de¬ 
tection  event  li  for  field  1  followed  by  two  events  (12,12) 
for  field  2  versus  the  normalized  cross  correlation  g  1,2.  (a) 

Wij (I2, 12 1 li)  for  events  (li)z  and  (I2,  I2);/  from  different  tri¬ 
als  i  ^  j  together  with  points  for  g 2,2.  wij  =  #2,2  for  statis¬ 
tically  independent  trials,  (b)  Witi(l2, 12 1 1 1 )  for  events  from 
the  same  trial  i.  wi <  1  for  sub-Poissonian  fields  in  support 
of  the  single-photon  character  of  field  2.  Statistical  uncertain¬ 
ties  are  indicated  by  the  error  bars.  The  full  curves  are  from 
the  model  calculation  with  (^1,^2)  and  ( |^i6 1 2 ,  \v2b\2)  as  in 
Figs.  2,  3. 

probabilities  appearing  in  Eq.  3,  with  the  results  of  this 
analysis  shown  in  Fig.  4.  Part  (a)  examines  the  quantity 
Wij( I2, 12 1 li)  obtained  from  events  taken  from  different 
trials  i  7^  j  for  the  (1,2)  fields  (i.e.,  detection  li  in  trial 
i  for  field  1  followed  by  two  detections  (I2, 12)  in  trial  j 


for  field  2).  In  this  case,  the  (1,2)  fields  should  be  sta¬ 
tistically  independent  [22],  so  that  Wij( I2, 12 1 li)  =  <72,2- 
Hence,  we  also  superimpose  <72,2  from  Fig.  2  and  find 
reasonable  correspondence  within  the  statistical  uncer¬ 
tainties  (in  particular,  r^j(l2, 12II1)  >  1),  thereby  val¬ 
idating  our  analysis  techniques  [25].  No  corrections  for 
dark  counts  or  other  backgrounds  have  been  applied  to 
the  data  in  Fig.  4  (nor  indeed  to  Figs.  2,  3). 

Fig.  4  (b)  displays  ic^(l2, 12 1 li)  f°r  events  from  the 
same  experimental  trial  i  for  the  (1,2)  fields.  Signifi¬ 
cantly,  as  the  degree  of  cross-correlation  expressed  by 
<7i52  increases  (i.e.,  decreasing  %),  ic^(l2, 12 1 li)  drops 
below  the  classical  level  of  unity,  indicative  of  the  sub- 
Poissonian  character  of  the  conditional  state  of  field 
2.  For  St  =  150  ns,  wM(l2,l2|li)  =  0.34  =b  0.06  for 
<71,2  =  7.3,  while  with  St  =  0,  I2, 12 1 li)  =  0.24T0.05 

fc>r  <71,2  =  10.2.  Beyond  the  comparison  to  our  model 
shown  the  figure,  empirically  we  find  that  11^(12, 12II1) 
is  well  approximated  by  ^1,1^2, 2/^1, 2,  as  in  the  ideal  case 
of  Eq.  1.  However,  independent  of  such  comparisons,  we 
stress  that  the  observations  reported  in  Fig.  4  represent 
a  sizable  nonclassical  effect  in  support  of  the  conditional 
generation  of  single  photons  for  field  2. 

In  conclusion,  our  experiment  represents  an  important 
step  in  the  creation  of  an  efficient  source  of  single  photons 
stored  within  an  atomic  ensemble,  and  thereby  towards 
enabling  diverse  protocols  in  quantum  information  sci¬ 
ence  [3,  4,  6,  7].  Our  model  supports  the  hypothesis 
that  the  inherent  limiting  behavior  of  I2, 12 1  li)  be¬ 
low  unity  is  set  by  the  efficiency  rji,  which  leads  to  pro¬ 
hibitively  long  times  for  data  acquisition  for  x  <  0.04, 
corresponding  to  the  smallest  value  of  Wi ^  in  Fig.  4.  We 
are  pursuing  improvements  to  push  rji  ~  0.15  — >  1,  in¬ 
cluding  in  the  intrinsic  collection  efficiency  following  the 
analysis  of  Ref.  [24].  Dephasing  due  to  Larmor  preces¬ 
sion  in  the  quadrupole  field  of  the  MOT  limits  St  <  300 
ns,  which  can  be  extended  to  several  seconds  in  optical 
dipole  or  magnetic  traps  [23]. 
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Real-Time  Quantum  Feedback 
Control  of  Atomic 
Spin-Squeezing 

JM  Geremia,* *  John  K.  Stockton,  Hideo  Mabuchi 

Real-time  feedback  performed  during  a  quantum  nondemolition  measurement 
of  atomic  spin-angular  momentum  allowed  us  to  influence  the  quantum  sta¬ 
tistics  of  the  measurement  outcome.  We  showed  that  it  is  possible  to  harness 
measurement  backaction  as  a  form  of  actuation  in  quantum  control,  and  thus 
we  describe  a  valuable  tool  for  quantum  information  science.  Our  feedback- 
mediated  procedure  generates  spin-squeezing,  for  which  the  reduction  in  quan¬ 
tum  uncertainty  and  resulting  atomic  entanglement  are  not  conditioned  on  the 
measurement  outcome. 


Quantum  systems  evolve  deterministically 
when  no  one  is  looking.  Free  from  observation, 
knowledge  of  a  quantum  state  at  one  point  in 
time  is  in  principle  sufficient  to  predict  its  entire 
evolution.  However,  when  a  measurement  is 
performed,  quantum  mechanics  postulates  that 
the  observer  will  obtain  a  random  postmeasure¬ 
ment  outcome.  Conveniently,  measurement  can 
produce  states  that  are  difficult  to  obtain  by 
other  means,  such  as  Hamiltonian  evolution, 
and  thus  provides  a  powerful  tool  for  quantum 
state  preparation.  But  standard  quantum  me¬ 
chanics  does  not  predict  the  outcomes  of  indi¬ 
vidual  experiments,  only  their  likelihood.  Mea¬ 
surement-based  state  preparation  is  hindered  by 
nondeterminism,  and  desirable  (for  example, 
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entangled)  quantum  states  often  correspond  to 
highly  unlikely  measurement  outcomes.  Here 
we  demonstrate  that  quantum  indeterminism 
can  be  reduced  by  suitable  intrameasurement 
feedback,  engineered  to  steer  the  outcome  of  an 
otherwise  random  quantum  process  toward  a 
deterministic  outcome. 

The  quantum  system  in  our  experiment 
was  provided  by  a  cloud  of  N  atoms  each 
with  intrinsic  angular  momentum,  hf,  because 
of  a  combination  of  nuclear  spin,  valence 
electron  spin,  and  orbital  angular  momentum. 
The  atoms  were  initially  polarized  such  that 
their  individual  momenta  were  oriented  along 
a  common  longitudinal  direction,  which  we 
chose  to  be  the  v  axis.  The  resulting  atomic 
state  displayed  a  net  magnetization,  F,  along 

v  with  magnitude  I  F I  =  h  \lF(F+ 1 ),  where 

F  =  Nf.  The  cartesian  components  of  F  are 
associated  with  noncommuting  quantum  op¬ 
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erators,  Fx,  Fy  and  Fz,  that  obey  the  Heisen¬ 
berg  uncertainty  relation 

\Fy\FzF-\(Fx)\  (1) 

This  inequality  has  the  interpretation  that  an 
ensemble  of  measurements  (for  similarly  pre¬ 
pared  atomic  samples)  performed  on  either  Fy 
or  Fz  will  yield  a  distribution  of  random 
shot-to-shot  outcomes.  For  a  large  magneti¬ 
zation,  the  Fz  (for  example)  measurement 
distribution  is  essentially  Gaussian  with  mean 
(Fz)  and  variance  A F  2  =  (F2)  —  (Fz)2.  The 
fully  polarized  atomic  state  has  (Fx)  =  F  and 
A F  =  A Fz  =  Vi7/ 2,  and  is  referred  to  as  a 
coherent  spin  state  (Fig.  1A) 

It  is  possible  to  reduce  the  measurement 
variance  in  one  of  the  transverse  components 
below  the  coherent  state  value  of  FI 2  at  the 
expense  of  increased  uncertainty  in  the  or¬ 
thogonal  component,  provided  that  Eq.  1  re¬ 
mains  satisfied  (Fig.  1A).  Polarized  states 
with  this  property  are  referred  to  as  spin- 
squeezed  states  (7)  and  have  received  much 
attention  for  their  potential  to  improve  the 
sensitivity  of  spin-resonance  measurements, 
including  magnetometry  (2,  3)  and  atomic 
clocks  ( 4 ,  5).  Spin-squeezing  below  the  co¬ 
herent  state  level  is  also  of  fundamental  in¬ 
terest  in  quantum  information  science  for 
achieving  many-particle  entanglement  (6). 

Although  several  different  mechanisms 
have  been  explored  for  the  preparation  of 
squeezed  atomic  states  (7,  8),  interest  has 
focused  on  using  quantum  nondemolition 
(QND)  measurements  (9-11),  in  which  the 
atomic  system  interacts  coherently  with  an 
off-resonant  optical  probe.  As  a  result,  the  z 
component  of  the  atomic  magnetization,  Fz, 
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becomes  weakly  entangled  with  the  probe 
polarization,  and  photodetection  gradually 
( 9 )  provides  the  observer  with  information 
about  the  spin  state  (12-14).  Continuous  ob¬ 
servation  conditionally  reduces  A Fz2  with  re¬ 
spect  to  the  QND  measurement  outcome  in 
any  individual  measurement  trajectory;  how¬ 
ever,  the  ensemble  averaged  uncertainty  is 
not  reduced,  because  the  Fz  measurement 
outcomes  over  many  QND  trajectories  are 
distributed  with  a  variance  equal  to  A Fz2  of 
the  initial  state. 

Our  experiment  used  real-time  continuous 
quantum  feedback  (15)  to  steer  the  value  of 
Fz  toward  a  fiducial  value  in  each  QND 
measurement  trajectory  (3,  12 ),  and  we  were 
thus  able  to  demonstrate  unconditional  spin¬ 
squeezing.  Figure  IB  provides  a  schematic  of 
our  experimental  apparatus  (16),  which  con¬ 
sisted  of  a  spin-polarized  cold  atom  cloud 
(prepared  by  laser  cooling)  and  a  Faraday 
polarimeter.  As  in  previous  experiments  on 
atomic  spin-squeezing,  we  used  optical 
pumping  initially  to  prepare  approximate  co¬ 
herent  spin  states.  The  polarimeter  photocur¬ 
rent,  which  was  proportional  to  Fz  via  Fara¬ 
day  rotation,  provided  our  continuous  QND 
measurement  of  atomic  spin.  With  feedback 
enabled,  the  QND  photocurrent  modulated  a 
y-axis  magnetic  field  via  proportional  feed¬ 
back  to  actuate  the  orientation  of  F  by  y-axis 
Larmor  precession. 

Figure  2A1  shows  the  DC-coupled  po¬ 
larimeter  photocurrent  for  one  open-loop 
(feedback  disabled)  QND  measurement  tra¬ 
jectory.  Time  t  =  0  corresponds  to  the 
beginning  of  the  measurement  record  after 
the  atom  sample  preparation  (16).  The 
QND  measurement  was  initiated  at  t  =  10 
|xs  by  opening  the  probe  laser  shutter,  and 
the  value  of  Fz  for  each  trajectory  manifests 
itself  as  an  offset  in  the  photocurrent.  We 
computed  the  QND  value  of  Fz  from  the 
mean  photocurrent,  yl  =  y(t),  for  the  win¬ 
dow  50  <  t  <  60  |xs.  At  t  =  60  fxs,  the 
QND  measurement  was  disabled  for  10  pis 
by  closing  the  probe  shutter  but  was  reen¬ 
abled  for  the  final  30  fjis  of  the  trajectory. 
This  latter  probe  phase  constitutes  a  second 
QND  measurement,  where  Fz  was  deter¬ 
mined  fromy2  =  y(t),  for  70  <  t  <  80  pis. 

The  measurement  trajectory  in  Fig.  2A1 
illustrates  conditional  spin-squeezing,  as  the 
value  of  y2  is  seen  to  be  correlated  with  yv 
The  conditional  uncertainty  in  Fz  may  be 
identified  with  the  statistical  uncertainty  (due 
to  optical  noise)  in  determining  the  mean 
value  yv  With  sufficient  filtering  (time  aver¬ 
aging),  this  uncertainty  is  significantly  re¬ 
duced  and  spin-squeezing  is  obtained  (2). 
However,  on  a  shot-to-shot  basis,  y1  random¬ 
ly  assumes  different  Gaussian-distributed 
values  with  variance  AFz  ***  FI 2.  Figure  3  A 
shows  a  histogram  of  P(yx),  the  probability  of 
observing  a  given  value  ofyx  (generated  from 


500  trajectories).  The  measured  variance  of 
this  distribution  is  given  by  AFz  for  the 
initial  atomic  state. 

The  outcome  of  the  second  QND  mea¬ 
surement,  y2,  can  only  be  predicted  with  im¬ 
proved  precision  provided  that  the  value  of  y1 
is  already  known:  The  squeezing  is  condi¬ 
tioned  on  the  value  of  yv  This  can  be  seen 


from  the  probability  distribution  P(y2  -  yx) 
(Fig.  3B),  which  displays  one- 10th  the  vari¬ 
ance  of  the  initial  spin  projection.  However, 
the  average  atomic  state  over  many  trajecto¬ 
ries  is  not  squeezed  with  respect  to  any  fidu¬ 
cial  value  of  Fz,  such  as  zero,  as  indicated  by 
P(y2)  (dashed  line  in  Fig.  3B).  In  Fig.  2A2,  a 
precision-pulsed  longitudinal  magnetic  field 


Fig.  1.  (A)  Graphical  representation  of  atomic  magnetization,  including  transverse  uncertainty  due 
to  the  Heisenberg  uncertainty  principle.  (B)  Schematic  of  our  experimental  apparatus.  PD,  photo¬ 
detector;  PBS,  polarizing  beam  splitter;  A/2,  half  waveplate;  By(t),  feedback  control  field;  DAQ,  data 
acquisition  and  control. 


30  50  70 

Trajectory  Time  (^is) 


30  50  70 

Trajectory  Time  (|is) 


Fig.  2.  Single-shot  photocurrents  for  both  open-  (A)  and  closed-  (B)  loop  QND  measurements.  The 
value  of  Fz  for  each  trajectory  was  obtained  by  time-averaging  the  photocurrent  over  the  indicated 
windows.  TTL,  computer-controlled  digital  trigger. 
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is  applied  during  the  10- jjis  dark  period  to 
produce  a  90°  rotation  around  the  x  axis.  The 
induced  Larmor  precession  exchanges  the 
transverse  uncertainties  A F  and  A Fz,  making 
Fz  the  antisqueezed  component.  Consequent¬ 
ly,  y2  is  distributed  with  greater  variance  than 
yv  as  seen  in  the  histogram  in  Fig.  3C. 

In  sharp  contrast  to  the  open-loop  proce¬ 
dure,  Fig.  2B1  demonstrates  our  deterministic 
ability  to  steer  the  value  of  Fz  toward  y1  =  0, 
using  intrameasurement  feedback.  Here  the 
QND  measurement  reveals  an  initial  offset  in 
the  photocurrent  that  corresponds  to  the  ran¬ 
dom  value  of  Fz  for  that  trajectory.  However, 
the  photocurrent  drives  a  y-axis  feedback 
control  field  to  rotate  the  atomic  magneti¬ 
zation  until  Fz  =  0.  Feedback  locks  the 
orientation  of  the  atomic  magnetization 
onto  the  x  axis  despite  the  initial  offset 
(illustrated  by  the  squeezing  ellipse  cen¬ 
tered  on  the  origin  in  Fig.  2B1). 


The  QND  measurement  ofy2,  after  a  10- 
|xs  dark  interval,  demonstrates  that  the  spin 
state  is  unconditionally  squeezed:  The  feed¬ 
back-stabilized  value  of  y2  =  0  can  be  pre¬ 
dicted  more  precisely  than  the  initial  spin 
projection  variance  as  confirmed  by  the  P(y2) 
histogram  in  Fig.  3D.  The  variance  of  this 
distribution  is  approximately  an  order  of 
magnitude  smaller  than  the  initial- state  vari¬ 
ance  and  reflects  significant  unconditional 
squeezing.  Application  of  a  90°  x-axis  Lar¬ 
mor  pulse  between  the  two  QND  measure¬ 
ments  again  reveals  the  antisqueezed  compo¬ 
nent  of  the  atomic  magnetization  (Fig.  2B2). 

Figure  3E  displays  the  P(y2)  variances  of 
atomic  states  for  different  x-axis  dark-period 
rotation  angles.  Although  the  initial  spin-pro¬ 
jection  noise  (red  crosses)  is  independent  of 
angle,  the  atomic  states  subject  to  QND  mea¬ 
surement  display  the  expected  sinusoidal 
form  that  is  characteristic  of  an  elliptical 


noise  distribution  (1 7).  The  open-loop  uncon¬ 
ditioned  variance,  P(y2)  (green  squares),  sat¬ 
urates  to  that  of  the  optically  pumped  state, 
whereas  the  conditioned  (blue  triangles)  and 
feedback  (magenta  circles)  data  display  sig¬ 
nificant  squeezing.  Dotted  horizontal  lines 
depict  the  average  uncertainty  products  (the 
square  root  of  the  minimum  value  times  the 
maximum  value)  of  their  corresponding  sinu¬ 
soids;  these  uncertainty  products  respect  Eq. 
1  in  that  they  lie  above  the  measured  variance 
of  our  approximate  coherent  spin  state. 

Although  A Fz2  can  only  be  determined  by 
statistical  analysis  of  many  measurement  tra¬ 
jectories,  the  QND  nature  of  our  procedure  is 
manifest  in  individual  trajectories  of  the  ex¬ 
perimental  data.  This  is  a  key  feature  of  our 
unmodulated,  broadband,  and  continuous¬ 
time  approach;  it  makes  the  principles  of 
feedback  control  evident  from  basic  features 
of  the  trajectory  data.  Figure  2  portrays  a 
direct  relation  among  the  quantum  spin  state, 
the  real-time  measurement  record,  and  the 
effects  of  feedback.  Although  spin-projection 
noise  of  the  optically  pumped  atomic  state 
gives  rise  to  a  random  offset  in  the  photocur¬ 
rent  of  each  trial,  the  conditional  variance  of 
the  evolving  spin  state  has  no  further  influ¬ 
ence  on  the  measurement  record  (in  terms  of 
additional  “spin  noise”).  The  photocurrent 
noise  is  purely  optical,  and  the  decrease  in 
statistical  uncertainty  accomplished  by  filter¬ 
ing  rigorously  corresponds  to  quantum-me¬ 
chanical  squeezing  (2). 

Care  was  taken  to  characterize  possible 
systematic  errors  that  could  exaggerate  the 
observed  spin- squeezing  in  our  experimental 
data.  Stray  magnetic  fields  were  nulled  (with¬ 
in  a  100- pis  line-locked  window)  with  exter¬ 
nal  computer-controlled  Helmholtz  coils.  Im¬ 
perfect  polarimeter  detector  balancing  was 
characterized  by  measuring  the  shot-to-shot 
photocurrent  variance  with  no  atoms  in  the 
probe  beam  path.  This  optical  noise  floor  was 
found  to  lie  well  below  the  detected  squeez¬ 
ing  minima  (Figs.  3E  and  4B).  Atomic  depo¬ 
larization  due  to  probe-induced  decoherence 
and  ambient  magnetic  noise  was  character¬ 
ized  by  observing  the  magnetization  free- 
induction  decay  (Fig.  4A).  Spin  relaxation 
was  estimated  to  be  less  than  6%  over  our 
100- pis  trajectories,  as  is  also  evident  from 
the  absence  of  any  apparent  “droop”  in  the 
open-loop  optical  signals  (16). 

The  variance  of  the  QND  measurement, 
y1?  was  measured  for  varying  probe  intensity 
and  atom  number  to  verify  correct  quantum 
noise-limited  scaling.  Figure  4B  demon¬ 
strates  good  linearity  between  the  probe  op¬ 
tical  power  and  the  open-loop  measurement 
uncertainty,  cr  =  std[/:>(y , )],  In  Fig.  4C,  a  is 
plotted  on  a  logarithmic  scale  versus  the  max¬ 
imum  amplitude  A L  of  the  atomic  free-induc- 
tion  generated  by  y-axis  Larmor  precession. 
Because  A L  is  proportional  to  A,  the  slope- 1/2 
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Fig.  3.  QND  measurement  outcome  distributions  for  (A)  a  coherent  state,  (B)  conditional 
squeezing,  (C)  the  antisqueezing,  (D)  deterministic  squeezing  via  quantum  feedback,  and  (E) 
rotation  of  the  squeezing  ellipse  by  Larmor  precession. 
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Fig.  4.  (A)  Induced  atomic  Larmor  precession,  (B)  measurement  uncertainty  a  =  std[y1]  as  a 
function  of  optical  probe  power,  and  (C)  square  root  dependence  of  cr  on  atom  number.  Std  Dev, 
sample  standard  deviation  computed  from  500  QND  measurement  trajectories. 
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fit  of  cr  versus  A L  (red  data  points  in  Fig.  4C) 
indicates  correct  Va  scaling  for  AFz.  As  a 
further  verification,  the  Larmor  amplitude 
was  compared  to  an  independent  determina¬ 
tion  of  atom  number  via  fluorescence  imag¬ 
ing  with  excellent  linear  scaling  (green  data 
points  in  Fig.  4C).  The  solid  line  in  Fig.  4C 
indicates  our  predicted  value  for  cr  comput¬ 
ed  as  AJwN  using  the  value  of  N  deter¬ 
mined  from  fluorescence  imaging.  We  es¬ 
timate  our  fluorescence  data  to  be  accurate 
to  10  to  20%,  verified  for  varied  imaging 
powers  and  detunings  and  by  comparison  to 
absorption  data  (Fig.  4C,  inset).  Combined 
with  the  visibility  of  the  squeezing  data  in 
Fig.  3E,  these  calibrations  strongly  suggest 
quantum-limited  performance. 

It  may  be  noted  that  an  alternative  proce¬ 
dure  for  deterministic  spin-squeezing  would 
be  to  produce  a  conditionally  squeezed  state, 
with  a  random  offset,  by  open-loop  QND 
measurement,  and  then  to  apply  a  pulsed 
y-axis  magnetic  field  to  recover  the  desired 
value  of  Fz.  This  procedure  resembles  a  dis¬ 
crete  form  of  feedback,  but  unless  further 
verification  measurements  and  correction 
steps  were  performed,  the  absence  of  true 
“loop  closure”  would  render  the  procedure 
highly  sensitive  to  technical  uncertainties. 
The  precise  relation  between  the  photocurrent 
and  the  orientation  of  the  magnetization  vec¬ 
tor  depend  on  a  number  of  parameters,  such 
as  the  exact  number  of  atoms,  that  will  not  be 
known  perfectly  in  practice.  Hence,  as  noted 
previously  (3,  12),  a  single-pulse  correction 
scheme  will  be  substantially  (even  prohibi¬ 
tively)  nonrobust. 

Our  results  demonstrate  the  plausibility  of 
emerging  methodologies  in  quantum  physics 
that  use  real-time  feedback  to  influence  the 
stochastic  dynamics  of  quantum  state  reduc¬ 
tion  in  the  manner  of  directed  diffusion. 
Through  carefully  engineered  measurements 
and  control  laws,  it  may  become  possible  to 
implement  dynamics  that  are  inherently  un¬ 
likely  or  otherwise  technically  difficult  to 
achieve.  This  approach  offers  great  promise 
for  optimizing  applications,  such  as  quantum 
noise-limited  metrology,  by  incorporating 
techniques  from  mathematical  filtering  and 
control  theories. 
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In  coarse-grained  polycrystalline  metals,  plasticity 
is  carried  by  dislocations  generated  by  sources 
within  the  grains.  Such  dislocations  propagate  and 
interact  with  preexisting  structures  and  also  with 
each  other,  and  can  partly  annihilate  each  other.  At 
a  given  deformation  level,  all  the  dislocation  seg¬ 
ments  that  have  not  annihilated  make  up  the  final 
microstructure  of  the  deformed  state  and  are 
stored  as  extrinsic  grain  boundary  (GB)  disloca¬ 
tions,  dislocation  walls  and  subgrain  boundaries, 
or  individual  dislocation  segments  (1). 

There  is  a  critical  length  scale  below  which 
a  dislocation  source  can  no  longer  operate  (2) 
because  the  stress  to  bow  out  a  dislocation 
approaches  the  theoretical  shear  stress.  For 
face-centered  cubic  (fee)  metals  the  critical 
grain  size  is  believed  to  he  between  20  and  40 
nm,  depending  on  the  nature  of  the  dislocations 
considered.  Whether  plasticity  in  nanocrystal- 
line  metals  with  mean  grain  sizes  below  100  nm 
and  an  intrinsic  grain  size  distribution  is  still 
carried  by  dislocations,  and  whether  these  dis¬ 
locations  are  of  the  same  nature  as  those  ob¬ 
served  in  coarse-grained  metals,  are  questions 
that  remain  unresolved  (3).  Postdeformation 
analysis  in  compressed  or  indented  nanocrys- 
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talline  Ni  (nc-Ni)  material  does  not  indicate 
major  dislocation  debris  ( 4 ),  whereas  deforma¬ 
tion  twinning  is  observed  in  ground  nc-Al  (5). 
In  situ  deformation  in  a  transmission  electron 
microscope  ( TEM)  demonstrates  some  disloca¬ 
tion  activity,  often  at  the  crack  tip;  however,  the 
samples  investigated  are  thinned  for  electron 
transparency,  and  the  observed  dislocation  ac¬ 
tivity  might  therefore  be  a  result  of  other  dislo¬ 
cation  sources  such  as  surface  defects  ( 4 ,  6-8). 
On  the  other  hand,  atomistic  simulations  sug¬ 
gest  that  GBs  in  the  nanocrystalline  regime 
promote  sliding  and  act  as  both  a  source  and 
sink  for  lattice  dislocations  that  extend  through¬ 
out  the  entire  grain,  leaving  behind  a  stacking 
fault  defect  (9-15). 

X-ray  diffraction  profile  analysis  is  a  well- 
known  technique  for  microstructural  analysis, 
where  broadening  of  the  peaks  occurs  for  two 
reasons:  (i)  limitations  in  the  spatial  extent  of  the 
coherent  scattering  volumes  (in  our  case  the  grain 
size),  and  (ii)  the  presence  of  inhomogeneous 
strain.  The  first  type  of  broadening  is  independent 
of  diffraction  order,  whereas  the  second  is  order 
dependent  (16).  Possible  sources  for  inhomoge¬ 
neous  strain  are  lattice  dislocations,  but  other 
sources  can  be  of  equal  importance,  especially  in 
nanocrystalline  structures.  These  sources  may  in- 


Plastic  Deformation  with 
Reversible  Peak  Broadening  in 
Nanocrystalline  Nickel 

Zeljka  Budrovic,  Helena  Van  Swygenhoven,  Peter  M.  Derlet, 
Steven  Van  Petegem,  Bernd  Schmitt 

Plastic  deformation  in  coarse-grained  metals  is  governed  by  dislocation- 
mediated  processes.  These  processes  lead  to  the  accumulation  of  a  residual  dislocation 
network,  producing  inhomogeneous  strain  and  an  irreversible  broadening  of  the  Bragg 
peaks  in  x-ray  diffraction.  We  show  that  during  plastic  deformation  of  electrodeposited 
nanocrystalline  nickel,  the  peak  broadening  is  reversible  upon  unloading;  hence,  the 
deformation  process  does  not  build  up  a  residual  dislocation  network.  The 
results  were  obtained  during  in  situ  peak  profile  analysis  using  the  Swiss 
Light  Source.  This  in  situ  technique,  based  on  well-known  peak  profile 
analysis  methods,  can  be  used  to  address  the  relationship  between  micro¬ 
structure  and  mechanical  properties  in  nanostructured  materials. 


www.sciencemag.org  SCIENCE  VOL  304  9  APRIL  2004 


273 


arXiv:quant-ph/0406050  vl  8  Jun  2004 


Temporal  Dynamics  of  Photon  Pairs  Generated  by  an  Atomic  Ensemble 

S.  V.  Polyakov,  C.  W.  Chou,  D.  Felinto  and  H.  J.  Kimble 

Norman  Bridge  Laboratory  of  Physics  12-33 
California  Institute  of  Technology,  Pasadena,  CA  91125 

The  time  dependence  of  nonclassical  correlations  is  investigated  for  two  fields  (1,2)  generated  by 
an  ensemble  of  cold  Cesium  atoms  via  the  protocol  of  Duan  et  al.  [Nature  414,  413  (2001)].  The 
correlation  function  R(t\,t2)  for  the  ratio  of  cross  to  auto-correlations  for  the  (1,2)  fields  at  times 
(ti,  £2)  is  found  to  have  a  maximum  value  Rmax  =  292  ±57,  which  significantly  violates  the  Cauchy- 
Schwarz  inequality  R  <  1  for  classical  fields.  Decoherence  of  quantum  correlations  is  observed  over 
Td  —  175  ns,  and  is  described  by  our  model,  as  is  a  new  scheme  to  mitigate  this  effect. 


In  recent  years  quantum  measurement  combined  with 
conditional  quantum  evolution  has  emerged  as  a  powerful 
paradigm  for  accomplishing  diverse  tasks  in  quantum  in¬ 
formation  science  [1,  2,  3,  4].  For  example,  Duan,  Lukin, 
Cirac  and  Zoller  ( DLCZ )  [4]  have  proposed  a  scheme  for 
the  realization  of  scalable  quantum  communication  net¬ 
works  that  relies  upon  entanglement  created  probabilisti¬ 
cally  between  remotely  located  atomic  ensembles.  By  uti¬ 
lizing  successful  measurements  to  condition  subsequent 
steps  in  their  protocol,  DLCZ  have  developed  a  scheme 
that  has  built-in  quantum  memory,  entanglement  purifi¬ 
cation  and  resilience  to  realistic  sources  of  noise,  thereby 
enabling  a  quantum  repeater  architecture  to  overcome 
photon  attenuation  [5,  6]. 

Central  to  the  DLCZ  protocol  is  the  ability  to  write 
and  read  collective  spin  excitations  into  and  out  of  an 
atomic  ensemble,  with  efficient  conversion  of  discrete  spin 
excitations  to  single-photon  wavepackets.  Observations 
of  the  resulting  non-classical  correlations  between  the  op¬ 
tical  fields  generated  from  writing  and  reading  such  spin 
excitations  have  recently  been  reported  by  several  groups, 
both  at  the  single-photon  level  [7,  8,  9]  as  appropriate  to 
the  protocol  of  DLCZ  and  in  a  regime  of  large  photon 
number  n  ~  103  —  107  [10].  Generation  and  detection 
efficiencies  have  now  been  improved  so  that  excitation 
stored  within  an  atomic  ensemble  can  be  employed  as  a 
controllable  source  for  single  photons  [8]. 

A  critical  aspect  of  such  single-photon  wavepackets  is 
that  they  are  emitted  into  well-defined  spatio-temporal 
modes  to  enable  quantum  interference  between  emissions 
from  separate  ensembles  (e.g.,  for  entanglement  based 
quantum  cryptography  [4]).  However,  with  the  excep¬ 
tion  of  the  verification  of  the  time-delay  implicit  for 
the  Raman  processes  employed  [10],  experiments  to  date 
have  investigated  neither  the  time  or  spatial  dependence 
of  quantum  correlations  for  the  emitted  fields  from  the 
atomic  ensemble.  The  high  efficiencies  achieved  in  Ref. 
[8]  now  enable  such  an  investigation  for  the  temporal 
properties  of  nonclassical  correlations  between  emitted 
photon  pairs,  which  we  report  in  this  Letter. 

Specifically,  we  study  the  time  dependence  of  quantum 
correlations  for  photons  emitted  from  an  ensemble  of  cold 
Cesium  atoms,  with  photon  pairs  created  sequentially  by 
classically  controlled  write  and  read  pulses.  Large  vio- 


FIG.  1:  (a)  Schematic  of  experiment.  Write  and  read  pulses 
propagate  into  a  cloud  of  cold  Cs  atoms  (MOT)  at  times  Tj 
and  Tj  ±  At  respectively,  and  generate  the  correlated  output 
fields  1  and  2.  Quantum  correlations  for  these  fields  at  times 
(£1,^2)  are  investigated  by  way  of  photoelectric  detection,  (b) 
The  relevant  atomic  level  scheme. 


lations  of  the  Cauchy- Schwartz  inequality  R  <  1  for  the 
ratio  of  cross  to  auto-correlations  are  observed  for  the 
generated  fields,  with  Rmax  =  292  ±57  1.  By  contrast, 

previous  measurements  have  reported  violations  R  ft  1 
only  for  detection  events  integrated  over  the  entire  dura¬ 
tions  of  the  write  and  read  pulses  ( R  =  1.84  ±0.06  in  Ref. 

[7] ,  R  =  1.34  ±  0.05  in  Ref.  [9],  and  R  =  53  ±  2  in  Ref. 

[8] ).  We  also  map  the  decay  of  quantum  correlations  by 
varying  the  time  delay  between  the  write  and  read  pulses, 
and  find  a  decoherence  time  175  ns.  We  have  devel¬ 
oped  a  model  to  describe  the  decoherence  and  find  good 
correspondence  with  our  measurements.  This  model  is 
utilized  to  analyze  a  new  proposal  that  should  extend 
the  correlation  times  to  beyond  10  /is,  which  would  allow 
for  entanglement  between  atomic  ensembles  on  the  scale 
of  several  kilometers. 

Our  experimental  procedure  is  illustrated  in  Figure  1. 
An  initial  write  pulse  at  852  nm  creates  a  state  of  collec¬ 
tive  excitation  in  an  ensemble  of  cold  Cs  atoms,  as  her- 


2 


aided  by  a  photoelectric  detection  event  from  the  Raman 
field  1.  After  a  user-programmable  delay  At,  a  read  pulse 
at  894  nm  converts  this  atomic  excitation  into  a  field  exci¬ 
tation  of  the  Raman  field  2.  The  ( write ,  read )  pulses  are 
approximate  coherent  states  with  mean  photon  numbers 
(103,105),  respectively,  and  are  focussed  into  the  MOT 
as  TEMoo  Gaussian  beams  with  orthogonal  polarizations 
and  beam  waist  wo  =  30  /am.  This  scheme  is  imple¬ 
mented  in  an  optically  thick  sample  of  four- level  atoms, 
cooled  and  trapped  in  a  magneto-optical  trap  (MOT) 
[11].  In  particular,  we  utilize  the  ground  hyperfine  levels 
65'i/2,E  =  {4;  3}  of  atomic  Cs  (labelled  {| a);  |fo)}),  and 
excited  levels  {6P3/2,  F  =  4;  6Pi/2,  P  =  4}  of  the  P2,  D\ 
lines  at  {852;  894}  nm  (labelled  ||e);  |e')}). 

Each  attempt  to  generate  a  correlated  pair  of  photons 
in  the  (1,2)  fields  is  preceded  by  shutting  off  the  trapping 
light  for  700  ns.  The  re-pumping  light  is  left  on  for  an 
additional  300  ns  in  order  to  empty  the  |  b)  state,  thus 
preparing  the  atoms  in  | a).  The  jth  trial  of  a  protocol  is 
initiated  at  time  Tj  when  a  rectangular  pulse  from  the 
write  laser  beam,  150  ns  in  duration  (FWHM)  and  tuned 
10  MHz  below  the  | a)  — >  \e)  transition,  induces  sponta¬ 
neous  Raman  scattering  to  level  | b)  via  | a)  — >  \e)  — ►  | b). 
The  write  pulse  is  sufficiently  weak  so  that  the  probabil¬ 
ity  to  scatter  one  Raman  photon  into  a  forward  propa¬ 
gating  wavepacket  is  much  less  than  unity  for  each  pulse. 
Detection  of  one  photon  from  field  1  results  in  a  “spin” 
excitation  to  level  |b),  with  this  excitation  distributed  in 
a  symmetrized,  coherent  manner  throughout  the  sample 
of  N  atoms  illuminated  by  the  write  beam  [4].  Regard¬ 
less  of  successful  detection  of  a  photon  in  field  1,  we  next 
address  the  atomic  ensemble  with  a  read  pulse  at  a  time 
Tj  +  At,  where  At  is  controlled  by  the  user.  The  read 
light  is  a  rectangular  pulse,  120  ns  in  duration,  tuned  to 
resonance  with  the  | b)  — >  \e')  transition. 

To  investigate  the  photon  statistics,  we  use  four 
avalanche  photodetectors,  a  pair  for  each  field  i,  labelled 
as  DiAiB ,  which  are  activated  at  ( Tj,Tj  +  At)  with 
i  =  (1,2),  respectively,  for  200  ns  for  all  experiments. 
The  quantity  pr(URm)  is  defined  as  the  joint  probabil¬ 
ity  for  photoelectric  detection  from  field  l  in  the  interval 
[Tj  +  ti ,  Tj  +ti+r\  and  for  an  event  from  field  m  in  the 
interval  [Tj  +  tm ,  Tj  -\-tm  +  t],  where  l  and  m  equal  1  or  2. 
pT  (ti ,  tm)  is  determined  from  the  record  of  time-stamped 
detection  events  at  Dia,ib,  D2A,2B-  In  a  similar  fash¬ 
ion,  qT(ti,tm)  gives  the  joint  probability  for  detection  for 
fields  (Z,ra)  in  the  intervals  ([Tj  +  ti ,  Tj  +  ti  +  r],  [Tk  + 
£m,  Tk  +  tm  +  t])  for  two  trials  k  ^  j. 

Following  Refs.  [7,  8],  we  introduce  the  time- 
dependent  ratio  .Rr(ti,t2)  of  cross-correlation  to  auto¬ 
correlation  for  the  (1,2)  fields,  where 


Rr(t  i,t2) 


\Pr{hM)]2 

Pr(tl,tl)pT(t2,t2) 


(1) 


This  ratio  is  constrained  by  the  inequality  RT(ti,t2)  ^  1 
for  all  fields  for  which  the  Glauber- Sudarshan  phase- 
space  function  is  well-behaved  (i.e.,  classical  fields) 
[7,  12].  Beyond  enabling  a  characterization  of  the  quan- 


FIG.  2:  Probability  for  joint  detection  from  the  fields  (1,2)  at 
times  (ti,t2)  with  origin  at  the  beginning  of  the  write  pulse, 
(a)  pT(ti,t2),  and  (b)  gT(ti,t2)  for  overlapped  write  and  read 
pulses,  At  —  50  ns,  with  the  solid  line  corresponding  to  t2  = 
t\.  (c)  pT(ti,t2)  and  (d)  gr(ti,t2)  for  consecutive  write  and 
read  pulses,  At  =  200  ns.  In  all  cases,  the  bin  size  r  =  4  ns, 
and  the  joint  probabilities  pT  and  qT  have  been  scaled  by  109. 


turn  character  of  the  (1,2)  fields  in  a  model  independent 
fashion,  measurements  of  RT(ti,t2)  also  allow  inferences 
of  the  quantum  state  for  collective  excitations  of  single 
spins  within  the  atomic  ensemble. 

The  first  step  in  the  determination  of  Rr(ti,t 2)  is  the 
measurement  of  the  joint  probability  pr(ti,t2)  for  the 
(1,2)  fields,  and  for  comparison,  gr(ti,t2)  for  indepen¬ 
dent  trials.  In  our  experiment,  we  focus  on  two  cases: 
(I)  nearly  simultaneous  application  of  write  and  read 
pulses  with  offset  At  =  50  ns  less  than  the  duration  of 
either  pulse,  and  (II)  consecutive  application  of  write 
and  read  pulses  with  At  =  200  ns  longer  than  the  write , 
read  durations.  Results  for  pr(ti,t2)  and  gr(ti,t2)  are 
presented  in  Fig.  2  as  functions  of  the  detection  times 
(£i,£2)  for  the  fields  (1,2).  For  both  At  =  50  and  200 
ns,  pT (t\ ,  t2)  qT(t i ,  t2) ,  indicating  the  strong  corre¬ 

lation  between  fields  1  and  2,  with  the  maximal  ratio 
<7f, 2(^*2)  =  Pt (^1 , ^2)/ qr (^1 , t2)  >  30,  which  is  much 
greater  than  reported  previously  [7,  8,  9].  In  Fig.  2, 
r  =  4  ns,  leading  to  statistical  errors  of  about  8%  for  the 
largest  values  shown. 

In  case  (I)  for  nearly  simultaneous  irradiation  with 
write  and  read  pulses,  Fig.  2(a)  shows  that  pT(£i,£2) 
peaks  along  the  line  t2  —  t\  =  Sti2  —  50  ns  with  a  width 
Ati2  ~  60  ns,  in  correspondence  to  the  delay  Sti2  and 
duration  Ati2  for  read-out  associated  with  the  transition 
| b)  — ►  \e')  — ►  | a)  given  an  initial  transition  | a)  — ►  \e)  — >  | b) 
[10].  Apparently,  the  qualitative  features  ofpr(ti,t2)  de¬ 
pend  only  upon  the  time  difference  between  photon  de¬ 
tections  in  fields  1  and  2  (i.e.,  pT(t\,t2)  ~  F(t2  —  ti)). 
In  case  (II)  with  the  read  pulse  launched  200  ns  after 
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FIG.  3:  Probability  of  joint  detection  pr(£i,£i)  for  field  1 
(squares)  and  pr(t2,t2)  for  field  2  (diamonds)  as  functions  of 
respective  detection  times  £1  and  £2.  Bin  size  r  =  30  ns.  (a) 
Overlapping  write  and  read  pulses,  At  =  50  ns;  (b)  Consecu¬ 
tive  write  and  read  pulses,  At  =  200  ns. 


FIG.  4:  The  experimentally  derived  ratio  i?T(£i,£2)  [Eq.  1]  as 
a  function  of  detection  times  (£i,£2)  for  the  (1,2)  fields,  with 
Rr  <  1  for  classical  fields.  The  left  column  gives  Rr(ti,  £2)  for 
At  =  50  ns  (top)  and  At  =  200  ns  (bottom),  while  the  right 
column  gives  the  associated  statistical  uncertainties.  Bin  size 
t  —  30  ns. 


the  write  pulse,  excitation  is  “stored”  in  the  atomic  en¬ 
semble  until  the  readout.  The  production  of  correlated 
photon  pairs  should  now  be  distributed  along  £2  —  ££12 
with  width  ~  A£i2-  Instead,  as  shown  in  Fig.  2(c), 
Pr{ti,t2)  peaks  towards  the  end  of  the  write  pulse  (i.e., 
£1  >  100  ns),  and  near  the  beginning  of  the  read  pulse 
(i.e.,  200  <  £2  <  300  ns).  Early  events  for  field  1  lead 
to  fewer  correlated  events  for  field  2,  as  pr(t  1^2)  decays 
rapidly  beyond  the  line  £2— £1  =  ~  175  ns.  The  marked 

contrast  between  pr(£i,  £2)  for  At  =  50  and  200  ns  results 
in  a  diminished  ability  for  the  conditional  generation  of 
single  photons  from  excitation  stored  within  the  atomic 
ensemble  [8]  and,  more  generally,  for  the  implementation 
of  the  DLCZ  protocol  for  increasing  At.  The  underlying 
mechanism  is  decoherence  within  the  ensemble,  as  will 
be  discussed. 

Fig.  2(b,  d)  displays  (£1 ,  £2 )  for  independent  trials 
j  7^  k.  qT  {t\ ,  £2 )  is  expected  to  be  proportional  to  the 
product  of  intensities  of  the  fields  1  and  2,  in  reasonable 
correspondence  to  the  form  shown  in  Fig.  2(b,  d)  for  our 
roughly  rectangular  write ,  read  pulses,  but  distinctively 
different  from  pr(£i,  £2)  in  (a,  c). 


To  deduce  RT(ti,t2)  from  Eq.  1,  we  next  determine 
the  joint  detection  probabilities  pr(£i,£i)  for  field  1  and 
Pr  (^2 1  ^2)  for  field  2  from  the  same  record  of  photoelectric 
events  as  for  Fig.  2  (a,  c).  Since  the  rate  of  coincidences 
for  auto-correlations  is  roughly  102  times  smaller  than  for 
cross-correlations  for  the  (1,2)  fields,  we  increase  the  bin 
size  r  to  30  ns  to  accumulate  enough  events  to  reduce  the 
statistical  errors  to  acceptable  levels.  Fig.  3  shows  the 
resulting  time  dependencies  of  pr(£i,  £1)  and pT (£2,  £2)  for 
cases  (/,  II).  While  the  shape  ofpr(£i,  £1)  associated  with 
the  write  pulse  does  not  change  with  A £,  the  profile  of 
Pr  (^2  5  ^2)  from  the  read  pulse  is  affected  and  exhibits  a 
rise  time  that  is  ^  3  times  shorter  for  At  =  200  ns  than 
for  At  =  50  ns.  This  prompt  rise  in  (b)  is  consistent 
with  the  observation  that  stored  excitation  is  efficiently 
addressed  at  the  beginning  of  the  read  pulse  for  non¬ 
overlapping  write ,  read  pulses,  while  the  longer  rise  time 
in  (a)  results  from  continuous  excitation  and  retrieval  of 
atoms  from  the  state  | b)  for  the  overlapping  case. 

We  employ  the  data  in  Figs.  2,  3  together  with  Eq. 
1  to  construct  the  ratio  Rr  (£1  ,£2)5  with  the  result  pre¬ 
sented  in  Fig.  4  [13].  Not  unexpectedly,  the  trends 
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FIG.  5:  Coherence  time  assessment.  Experimentally  acquired 
#1,2  (black  dots),  theoretical  description  of  the  current  exper¬ 
iment  with  K  =  1.1  MHz  (solid  line),  and  the  theoretical 
prediction  for  a  spin  polarized  tuf  —  0  MOT  (dotted  line). 

for  RT(ti,t2)  closely  resemble  those  of  the  joint  proba¬ 
bility  pT  (t\ ,  £2)  for  correlated  pair  generation  previously 
discussed.  As  for  the  violation  of  the  Cauchy- Schwarz 
inequality  RT(t\,t2)  <  1  for  classical  fields  [7,  12],  we 
observe  maximal  violations  with  R™ax  =  292  ±57  for 
At  =  50  ns  and  R ™ax  =  202  ±  60  for  At  =  200  ns 
(. Rr  =  198  ±  33  in  the  neighboring  bin).  The  relatively 
large  errors  in  Rr(t  1^2)  arise  predominantly  from  the 
uncertainties  inpr(£i,£i)  andpT(t2,t2)  (Fig.  3)  [14]. 

The  forms  for  pr(ti,t2)  and  Rr(t  1^2)  for  the  cases 
At  =  50  and  200  ns  imply  a  decoherence  process  oper¬ 
ative  on  a  time  scale  ~  175  ns.  To  investigate  this 
decay,  we  have  performed  a  separate  set  of  experiments 
with  the  delay  At  varied  0  <  At  <  400  ns.  For  each 
At  we  determine  the  normalized  correlation  function  g\  2 
from  the  ratio  of  integrated  coincidence  counts  to  singles 
counts  over  the  entire  detection  window  (i.e.,  r  =  200 
ns),  with  the  results  presented  in  Fig.  5. 

In  Fig.  5,  the  initial  growth  of  g]_  2  for  small  At  is 
due  to  the  finite  time  required  to  produce  sequentially 
photons  in  the  (1,2)  fields,  which  is  already  evident  in 
Fig.  2.  More  troublesome  is  the  rapid  decay  of  g\2' 
A  likely  candidate  responsible  for  this  decay  is  Larmor 
precession  among  the  various  Zeeman  states  of  the  F  = 
3,4  hyperfine  levels  of  the  65*1/2  ground  level. 

To  investigate  this  possibility,  we  have  extended  the 
treatment  of  Ref.  [15]  to  include  the  process  involving 
the  read  beam  as  well  as  the  full  set  of  Zeeman  states  for 
the  F  =  3,4  hyperfine  levels.  The  sample  of  Cs  atoms  is 
assumed  to  be  initially  unpolarized  and  distributed  over 
the  same  range  of  magnetic  fields  as  for  the  MOT.  With 
write  and  read  pulses  that  approximate  those  used  in 


our  experiment,  we  obtain  an  expression  for  the  prob¬ 
ability  pr{t\,t2)  to  generate  a  pair  of  photons  at  times 
(ti,£ 2)  for  fields  1  and  2  as  a  function  of  the  offset  At. 
By  summing  contributions  for  all  (£1,^2)  over  the  detec¬ 
tion  windows,  we  arrive  at  the  joint  probability  pi^(At) 
that  we  compare  to  the  measured  #1,2  (At)  by  way  of  a 
single  overall  scaling  parameter  for  all  At,  as  the  rate 
of  single  counts  in  fields  (1,2)  is  measured  not  to  de¬ 
pend  on  At  (to  within  20%)  The  result  is  the  solid  curve 
in  Fig.  5  that  evidently  adequately  describes  the  im¬ 
pact  of  Larmor  precession  on  our  experiment.  The  form 
of  pi^{At)  strongly  depends  upon  the  inhomogeneity  of 
Zeeman  splitting  across  the  MOT,  which  is  described  by 
the  parameter  K  =  (iBgFgLb/h ,  where  L  is  the  MOT 
diameter,  b  is  the  gradient  of  the  magnetic  field  for  the 
MOT,  and  gpg  is  the  Lande  factor.  The  curve  in  Fig.  5  is 
the  theoretical  result  for  an  initially  unpolarized  sample 
with  K  =  1.1  MHz  as  for  our  experiment  (i.e.,  L  ~  3.6 
mm  and  b  ~  8.4  G/cm). 

An  obvious  remedy  for  this  dephasing  is  to  eliminate 
the  magnetic  field  altogether,  as  by  transferring  the  sam¬ 
ple  to  a  dipole- force  trap  [11].  Alternatively,  we  have 
developed  a  scheme  that  should  allow  for  long  coherence 
times  even  in  the  presence  of  the  quadrupole  field  of  the 
MOT  by  utilizing  only  magnetic-field  insensitive  states. 
The  write ,  read  beams  are  polarized  a±  and  are  aligned 
along  the  z— axis  of  the  MOT,  which  provides  the  quanti¬ 
zation  axis.  Atoms  within  the  approximately  cylindrical 
volume  illuminated  by  these  beams  are  initially  spin  po¬ 
larized  into  F  =  3,rriF  =  0  [16].  The  (1,2)  fields  are 
selected  to  be  cr±,  which  results  in  spin  excitation  stored 
in  F  =  4 ,rriF  =  0.  The  prediction  of  our  model  for 
this  new  protocol  for  the  same  experimental  conditions 
as  before  but  now  with  an  initially  spin  polarized  sam¬ 
ple  is  shown  as  the  dashed  curve  in  Figure  5,  resulting 
in  an  increase  of  more  than  3  x  in  g 1 2 ,  and  significantly 
extending  the  decoherence  time  to  more  than  ~  10/is. 

In  conclusion,  we  have  reported  the  first  observa¬ 
tions  of  the  temporal  dependence  of  the  joint  probability 
Pr (ti ,  £2)  for  the  generation  of  correlated  photon  pairs 
from  an  atomic  ensemble,  which  is  critical  for  the  pro¬ 
tocol  of  Ref.  [4].  Our  measurements  of  pr(ti,t2)  are 
an  initial  attempt  to  determine  the  structure  of  the  un¬ 
derlying  two-photon  wavepacket  [17].  The  nonclassical 
character  of  the  emitted  (1,2)  fields  has  been  tracked 
by  way  of  time  dependence  of  the  ratio  RT{ti,t2),  with 
R max  =  292 ±57  ^  1.  Decoherence  due  to  Larmor  preces¬ 
sion  has  been  characterized  and  identified  as  a  principal 
limitation  of  the  current  experiment.  A  new  scheme  for 
effectively  eliminating  this  decay  process  has  been  pro¬ 
posed  and  analyzed,  and  could  be  important  for  the  ex¬ 
perimental  realization  of  scalable  quantum  networks  [4] 
as  well  as  for  an  improved  source  for  single  photons  [8]. 

This  work  is  supported  by  ARDA,  the  Caltech  MURI 
Center  for  Quantum  Networks,  and  the  NSF. 
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I.  INTRODUCTION 

Symmetry  reduces  complexity.  In  physical  systems  real¬ 
izing  quantum  computers,  the  highest  degree  of  symmetry  is 
therefore  not  the  most  desirable.  A  quantum  computer  needs 
to  be  sufficiently  simple  and  robust  to  be  controllable  in  an 
experiment  yet  complex  enough  to  be  universal.  One  may 
therefore  ask  the  question,  “how  much  symmetry  does  a 
quantum  computer  allow  for?”  In  fact,  a  number  of  physical 
systems  considered  for  the  realization  of  a  quantum  com¬ 
puter  such  as  optical  lattices  [1]  or  arrays  of  microlenses  [2] 
are  translation  invariant,  and  the  above  question  acquires  a 
practical  aspect. 

Quite  surprisingly,  it  turns  out  that  universal  quantum 
computation  can  tolerate  a  fair  amount  of  symmetry.  Re¬ 
cently,  a  scheme  of  quantum  computation  using  the  rotation- 
invariant  measurement  of  the  total  “spin”  of  two  qubits  as 
the  only  gate  has  been  devised  [3].  Furthermore,  a 
translation-invariant  computation  scheme  has  been  described 

[4]. 

The  computational  power  of  translation-invariant  or 
nearly  translation  invariant  quantum  systems  was  revealed  in 
Lloyd’s  [5]  and  Watrous’  [6]  work  on  quantum  cellular  au¬ 
tomata  (QCA).  In  Ref.  [6]  it  was  shown  that  a  one¬ 
dimensional  QCA  can  simulate  any  quantum  Turing  ma¬ 
chine.  Translation  invariance  is  broken  only  by  the  initial 
state  which  encodes  the  program.  The  schemes  in  Refs.  [5] 
and  [7]  allow  one  to  simulate  quantum  circuits  using  a  chain 
of  qubits  with  a  generic  translation-invariant  interaction. 
They  require  different  species  of  qubits  in  a  periodic  arrange¬ 
ment  and  local  addressability  at  one  end-point  of  the  chain. 
In  the  scheme  in  Ref.  [8]  such  individual  addressing  is  only 
required  in  the  initialization.  The  method  proposed  in  Ref. 
[4]  is  completely  translation  invariant  in  space.  It  requires 
homogeneous  one-  and  two-local  operations  on  five-level 
systems. 

Here  I  describe  a  scheme  for  universal  quantum  compu¬ 
tation  via  translation-invariant  operations  on  a  chain  of  qu¬ 
bits.  No  individual  addressability  is  required.  The  scheme 
uses  an  Ising-type  interaction  and  spatially  uniform  one- 
qubit  gates.  The  qubits  are  all  of  the  same  species.  Cold 
atoms  in  optical  lattices  [1],  where  the  requirement  of  local 
control  adds  to  the  overall  technological  challenge,  are  a  can¬ 
didate  for  the  realization  of  the  presented  scheme. 


II.  CONSTRUCTIVE  ELEMENTS,  UNIVERSALITY,  AND 
SCALABILITY 

I  consider  a  one-dimensional  chain  of  N  qubits  initialized 
in  the  state  |0O  •  -0)  which  is  repetitively  updated  according 
to  the  transition  function 

V-l  \/N  \ 

®  A(Z)ifi+1j  (  (1) 

That  is,  in  each  elementary  step  of  the  evolution  first  a  Had- 
amard  gate  is  applied  to  each  qubit  and  second,  conditional 
phase  gates  are  simultaneously  applied  to  all  pairs  of  neigh¬ 
boring  qubits.  This  QCA  transition  function  has  previously 
been  discussed  in  Ref.  [9]. 

Between  the  transitions  one  may  apply  translation- 
invariant  unitary  transformations  of  the  form 

UA(a)  =  ®  exp »  (2) 

with  A  e  { X ,  Y ,Z}.  (Note  that  the  subscript  “/”  labels  the  site. 
The  same  operation  is  applied  to  each  qubit.)  These  require¬ 
ments  are  equivalent  to  bang-bang  pulses  of  the  form  (2)  and 
a  permanent  Ising-type  interaction  W=2^1|ll)^-+1(ll|. 

Let  us  first  observe  that 

TN+l  =  R ,  (3) 

where  R  is  the  reflection  operator  that  sends  the  state  of  the 
qubit  chain  into  its  mirror  image.  Thus,  despite  the  fact  that 
the  qubits  at  the  end  points  need  not  be  addressed,  it  is  rel¬ 
evant  that  the  chain  has  ends.  (For  adaption  of  the  scheme  to 
a  ring  of  qutrits,  see  remark  1  in  Sec.  III).  A  proof  of  Eq.  (3) 
is  given  in  the  Appendix. 

Apart  from  its  use  in  the  computational  scheme  described 
here,  the  bit  reversal  operation  R  is  interesting  in  its  own. 
Recently,  proposals  for  both  approximate  and  perfect  bit  re¬ 
versal  in  qubit  chains  with  a  Heisenberg  and  XY  interaction 
have  been  made;  see  Refs.  [10-12],  and  references  therein. 
For  perfect  mirror  reflection  in  an  XY  chain  the  coupling 
strength  needs  to  vary  with  position  [11],  but  only  mildly 
[12]. 

Relation  (3)  represents  a  method  to  achieve  spatial  reflec¬ 
tion  in  systems  with  an  Ising-type  interaction.  Here,  the  in¬ 
teraction  strength  is  independent  of  position,  but  additional 
stroboscopic  pulses  (2)  are  required  to  realize  the  uniform 
Hadamard  transformations. 
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fore,  qubit  i  cannot  be  addressed  separately  from  its  mirror 
image  at  location  ii=N+l-i.  To  cope  with  this  constraint, 
the  same  network  is  simulated  twice  on  the  chain,  once  on 
the  left  side  and  once — as  a  mirror  image — on  the  right.  This 
doubles  the  required  length  of  the  chain  and  also  influences 
the  readout  process,  as  will  be  discussed  in  Sec.  II B. 

To  probe  the  available  spatial  control,  consider  the 
sequence 

Uz( c,a)  =  [TYcvTYcv-'T-  •  •  TYC'TY(WZ{-  a/2)\ 

X  [TYC*TYC"-'T-  •  •  TYc'TYcvUz{a/2)\ ,  (5) 

where  c  =  (c0, . . .  ,cN)T  is  a  binary  vector.  To  analyze  the  ef¬ 
fect  of  this  sequence,  the  Pauli  operators  YCi  are  propagated 
backwards  in  time  until  they  reach  the  Uz  gates.  There  they 

accumulate  to  7, 


Q/  ■ 

FIG.  1.  (Color  online)  The  evolution  of  a  local  Pauli  observable 
Z3  for  a  qubit  chain  of  length  N=S.  The  color-coded  boxes  denote 
Pauli  operators  Xt  and  Z„  respectively,  and  each  row  of  boxes  rep¬ 
resents  a  tensor  product  of  such  Pauli  operators.  Within  the  cycle 
that  leads  to  reflection  of  the  chain,  each  local  observable  undergoes 
the  phases  of  expansion,  transmission,  and  contraction.  When  ex¬ 
panding  or  contracting,  the  operators  pick  up  a  sign  factor  of  -1 
under  conjugation  by  a  7  pulse. 


Y  =  (1^nYCnTn)  ■  ■  ■  {T1  YC'T) Yc°.  (6) 

Then, 

N 

I/Z(c,  a)  =  RYUZ{-  al2)RYUz{al2)  =  <g>  exp[^(a/2)Zj, 

i=i 

(7) 

where  we  have  used  that  RYR=Y  and 


A.  Universal  set  of  gates 

The  key  to  the  construction  of  a  universal  set  of  gates  for 
the  described  automaton  is  displayed  in  Fig.  1.  The 
(A/+l)-fold  repetition  of  the  elementary  transition  function 
constitutes  half  a  clock  cycle  of  the  automaton.  Within  this 
period  each  local  Pauli  observable  goes  through  the  phases 
of  expansion,  transmission,  and  contraction  to  the  mirror  im¬ 
age  of  the  initial  position.  During  expansion  and  contraction, 
the  propagated  observable  is  susceptible  to  a  global  7  pulse 

N 

Y:=UY(TT)=®Yi.  (4) 

i=l 

Namely,  it  picks  up  a  sign  factor  under  conjugation.  Contrar- 
ily,  during  the  phase  of  transmission  where  the  observables 
behave  as  left  or  right  movers  [9]  a  7  pulse  has  no  effect. 
The  duration  of  expansion,  transmission  and  contraction 
phase  depend  on  the  initial  position  of  the  local  Pauli  observ¬ 
able.  Therefore,  each  local  Pauli  observable  shows  a  charac¬ 
teristic  response  to  sequences  of  7  pulses  within  the  half 
cycle.  In  this  way,  temporal  control  can  be  translated  into 
spatial  control.  In  the  construction  described  below,  suitably 
tailored  sign  flips  of  Pauli  observables  are  used  to  reverse 
rotation  angles.  Depending  on  whether  a  rotation  angle  is 
reversed  or  not,  two  matching  rotations  will  either  cancel  or 
amplify  each  others  effect. 

The  degree  of  spatial  control  obtained  suffices  to  simulate 
a  quantum  logic  network.  One  constraint  arises:  every  opera¬ 
tion  applied  to  the  qubit  chain  is  reflection  symmetric.  There- 


0  if  [7,Zj  =  0, 

_  (8) 

1  if  {Y,Z/j  =  0. 

In  this  way,  temporal  control  has  been  converted  into  spatial 
control,  provided  that — for  suitable  choices  of  c — the  binary 
variables  ^(c)  do  indeed  vary  with  i. 

The  s(  are  easily  computed  in  the  stabilizer  formalism 
[13-15].  Following,  Ref.  [15]  we  write  Pauli  operators  A  in 
the  form  is(-l)era,  where  a  =  (*)  is  a  2Wcomponent  binary 
vector,  z=(vl, ...  ,vN)T,  x=(vtq, ...  ,wN)T: ;  ^,ee  Z2,  and  ra  = 
The  evolution  of  A  under  conjugation  by  our 
Clifford  unitary  T, A  — ►  T{A) -TAT^ ,  may  then  be  followed  in 
terms  of  8 ,  6,  and  a.  The  scalars  8  and  e  have  no  influence  on 
the  sign  factor  ( -l)Si  (8),  and  we  need  to  consider  the  update 
of  a  only,  a(t)  — »a(f+  l)  =  Ca(/L).  Therein,  C  is  a  2NX2N 
binary  symplectic  matrix  which  takes  the  form 


T  is  the  adjacency  matrix  of  the  interaction  graph  (the  line 
graph).  Further,  denote  by  F  the  2N X  27V-matrix  and 

observe  that  FC~l  =  CF.  Now,  the  vector  s  =  (sx , . . .  ,sN)T  car¬ 
rying  the  information  about  the  sign  flips  under  conjugation 

by  7  is  related  to  the  vector  c  describing  the  temporal  se¬ 
quence  of  7  pulses,  via  s=Mzc.  The  matrix  Mz  encodes  how 
temporal  control  is  converted  to  spatial  control.  Its  elements 
are  given  by 
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Mz(i,t)  =  2ltzFC~%  =  a  £  C*ay.  (10) 

The  binary  quantities  Mz(i,t )  can  be  defined  for  all 
t(i=l...N).  But  the  matrix  Mz  is  the  collection  of  Mz(i,t ) 
only  within  the  half-cycle  0  ^  ^  N. 

For  the  interval  t  e  [-1  ,N]  (including  [0,Af])  the  Mz{i,t) 
take  the  values 

Mz(i,  t)=  d(i-t  -  1)  +  0(t+i  —  N—  1) .  (11) 

Therein,  the  addition  is  modulo  2  and  the  step  function 
0  is  equal  to  one  for  all  non-negative  arguments,  and  zero 
otherwise. 

To  prove  Eq.  (11)  one  may — in  addition  to  Mz — define  a 
matrix  Mx  that  encodes  the  sign  factor  acquired  by  the 
Usteps-backward-propagated  observable  Xt  under  conjuga¬ 
tion  with  F,Mz(z,0:=a£c?az.  Then,  because  of  TXiJ^l=Zi 
and  TZirrl=Xi®j\Y..=\Zj,  the  Mz(i,t)  and  Mx(i,t)  obey  the 
recursion  relations 

Mz{i,t+\)  =  Mz{i,t-\)+  2  Mz(j,t), 

Wtr1 

Mx(ij+l)=Mz(i,t).  (12) 

The  solution  of  the  recursion  relation  for  Mz(i,t )  is  unique 
once  the  boundary  conditions  on  two  consecutive  time  slices 
are  specified.  In  the  discussed  case,  the  boundary  conditions 
are  Mz(/,-l)=Mz(/,0)  =  1,  for  all  i=  \  •  -  -N.  The  expression 
on  the  right-hand  side  (RHS)  of  Eq.  (11)  obeys  the  recursion 
relation  (12)  in  the  interval  O^t^N-1  and  the  boundary 
conditions,  and  is  thus  the  correct  expression  for  Mz. 

To  perform  a  z  rotation  on  qubit  i,  one  may  apply  a  Y 
pulse  at  times  i- 1  and  i.  For  ct=  S(t,i-1)  + with  (11), 
one  obtains  Sj=  S(j  ,i)  +  S(j  ,N+ 1  -/).  Therefore,  the  pulse  for 
a  z  rotation  of  qubit  i  is 

ei(a/2)Ziei(a/2)ZJ  =  jj Oil 2)] 

x  [ jW+  l-i YTYV~ i  f/z( a/2) ] ,  (13) 

with  i=N+ 1  -i  the  mirror  site  of  i.  In  this  way,  two  out  of  N 
z  rotations  can  be  selected.  A  further  discrimination  between 
two  sites  i  and  i  is  not  possible  because  of  the  reflection 
symmetry  of  every  sequence  U(c,  a)=RU(c,  a)R~l.  As  a  re¬ 
sult  of  this  symmetry,  each  quantum  algorithm  is  run  on  the 
qubit  chain  in  two  copies,  one  being  the  mirror  image  of  the 
other.  This  accounts  for  a  factor  of  two  in  spatial  overhead. 

From  Eq.  (13)  the  remaining  universal  gates  can  be  de¬ 
duced  easily.  Relation  (13)  is  conjugated  by  T^1.  Using 
T^lZiT=Xi,  one  finds 

ei(a/2)Xiei(a/2)Xr=[TN-iYTYTiUx(-  a/2)\ 

X  [TN-lYTYTUx(al2)\.  (14) 

Conjugating  relation  (14)  again  by  and  noting  that 
T~1XiT=Zi<g>j\Y..=iXj:=Ki  one  obtains 

ei(a/2)Kiei(a/2)Ki=^TN-\-iYTYTiUZ_  a/2)T] 

X  [TN-'-'YTYrUx(a/2)T].  (15) 

Now,  the  length  of  the  chain  is  doubled  a  second  time. 


Specifically,  the  logical  qubits  in  state  1 0)  are  interlaced  by 
ancilla  qubits  which  remain  in  the  state  |0)  throughout  the 
computation 

l*A)l,2,3,...,«  >  1^)  =  l0)l,3,5,...,2n-l  ®  |0  •  •  •  0)2,4,...,2n-2- 

(16) 

At  this  point  it  is  suitable  to  introduce  a  logical  coordinate  [/] 
that  is  related  to  the  physical  location  within  the  qubit 
chain  via  [/] =2j  —  1,  for  such  that,  e.g., 

I<A')=I^)[i],[2],...,[b]®  |o..  •  2)2,4,. .  .,2n-2m  Then,  the  action 
of  the  K2j  gate  is  equivalent  to  a  two-qubit  next-neighbor 
entangling  gate 

CX\  2^21)1^  }  =  exp('fZ[/]XD+i]) I */'')•  (1?) 

The  gates  (13)— (15)  form  a  gate  set  that  can  be  easily 
converted  into  the  standard  universal  set  [16]. 


B.  Readout 

As  an  example  for  global  readout  a  measurement  of  the 
z-component  of  the  total  spin  is  considered.  The  measured 
observable  is 

(18) 

i 

This  is  a  model  for  readout,  e.g.,  of  atomic  qubits  via  reso¬ 
nance  fluorescence  spectroscopy,  if  the  atoms  are  well  within 
the  Lamb-Dicke  limit  (their  separation  and  fluctuation  of  po¬ 
sition  are  much  smaller  than  the  optical  wave  length).  In  this 
case  the  underlying  physics  prohibits  the  extraction  of  local 
information.  The  readout  method  described  below  equally 
works  for  a  scenario  where  a  local  readout  could  in  principle 
be  performed  but  is  not  pursued  due  to  technological  limita¬ 
tions.  For  specification  of  such  a  measurement  model,  see 
remark  3  in  Sec.  III. 

With  the  capability  to  perform  arbitrary  unitary  evolution 
on  a  quantum  register  the  total  spin  observable  Sz  acting  on 
that  register  may  be  conjugated  into  any  desired  observable, 
and  the  standard  network  readout  of  the  individual  Zt  should 
be  feasible.  However,  for  the  setting  described  here  the  read¬ 
out  procedure  is,  in  the  case  of  probabilistic  algorithms, 
slightly  complicated  by  the  fact  that  two  copies  of  the  algo¬ 
rithm  are  run  simultaneously  and  the  readout  measurement 
couples  them. 

For  the  readout  each  of  the  two  circuits  on  the  chain 
requires  one  additional  logical  qubit  that  is  in  the  state  |0) 
until  the  readout  starts.  The  location  of  this  qubit  within  the 
logical  quantum  register  is  denoted  as  [0]  which  shall 
correspond  to  the  physical  position  2^+1  in  the  qubit  chain 
[see  Fig.  2(a)].  The  readout  consists  of  three  steps.  First,  Sz  is 
measured  and  the  outcome  m  is  recorded.  Second,  for  all 
logical  qubits  j=  1 . .  .n  the  following  procedure  is  performed. 
(2.1)  Apply  Controlled  NOT  (cnot)  gates  A(Y)^[-0] 
<8>  A(X)[j]?[o]  ([0],  [0]  are  the  target  qubits).  (2.2)  Measure  the 
observable  Sz  and  record  the  outcome  m(j).  (2.3)  Apply 
the  CNOT  gates  A^^^^A®^  again.  Third,  denote  by 
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a) 

physical  1  2  3  4  5  6  7  8  9  10  1 1  12  13  14 

position  _  _  _  _ 

logical  [1]  [2]  [3]  [0]  [0]  [3]  [2]  [1] 

•o#o#o#«o#o#o# 

I _ v _ I 

f  |  mirror  image 
qubits  required  in  readout 


b) 


Vzpm  Y  Y  Z -rotation  on  qubit  [2] 


ux(cx/2)  Y  Y  A -rotation  on  qubit  [2] 


Ux&/ 2)  Y  Y 


XX-gate  on  qubits  [1],  [2] 


0 


time 


FIG.  2.  (Color  online)  Arrangement  of  logical  qubits  within  the 
chain  (a)  and  pulse  sequences  for  gates  (b),  for  n  =  3.  (a)  Only  every 
second  physical  qubit  in  the  chain  (•)  is  used  as  a  logical  qubit. 
The  qubits  in  between  (O)  are  ancillas.  (b)  Pulse  sequences  for  the 
implementation  of  the  universal  gates,  during  the  first  half  of  a 
clock  cycle.  The  second  half-cycle  is  analogous,  with  a^-a.  The 
symbol  ■  denotes  application  of  the  elementary  transition  function 
T.  The  vertical  bars  denote  pulses  (2). 

,j\j\}  those  j^n  for  which  m-m(J)  =  2.  If  \J\<2 
the  readout  is  finished.  Otherwise,  for  all  k=2...\j\  perform 
the  following  procedure.  (3.1)  Apply  the  Toffoli  gates 
A2®D1],[,'t];[o]®V2(X)|^[!|^];[o]  ([0],  [0]  are  the  target  qu¬ 
bits).  (3.2)  Measure  Sz  and  record  the  measurement  outcome 
m(j\Jk).  (3.3)  Apply  the  Toffoli  gates  A2PO[7-l]>[/-Jfc];[0] 

®  A2(X)|7(],[ip;[0]  again- 

The  conditional  phase  and  Toffoli  gates  are  their  own  in¬ 
verse  such  that  the  above  protocol  amounts  to  a  sequence  of 
measurements  where  the  first  one  is  of  Sz  and  the  following 
ones  are  of  conjugated  observables.  These  observables  mu¬ 
tually  commute  because  they  are  all  diagonal  in  the  compu¬ 
tational  basis,  and  the  final  state  is  a  simultaneous  eigenstate 
of  them.  The  observables  measured  in  step  2  are  Z^jZ^-j 
+Z[o]Z[j]  +  E/#|-0]  io]Z/.  Since  the  qubits  at  locations  [0]  and 
[0]  are  individually  in  the  state  |0),  and  the  observable 
Sz-^iZi  already  has  been  measured  in  step  1,  step  2  effec¬ 
tively  acts  as  the  measurement  of  the  observables  Z^j+Zq]. 

Similarly,  step  3  amounts  to  the  measurement  of  the  ob¬ 
servables  1 1 1  1 1 1  + 1 1 1  1 1 1 5  Vfc.  This  measure¬ 
ment  assigns  a  sharp  value  to  +  PurPose 

is,  in  combination  with  the  information  gathered  in  step  2,  to 
discriminate  between  the  two  cases  { r[/ 1  ]  -  r[jk]  ,  rjj  ]  ]  =  and 

H 'J  *  rL4]’F[/1]  *  fL4]}>  for  each  L 

The  state  of  the  two  quantum  registers  after  the  measure¬ 
ment,  leaving  out  all  the  ancillas  in  the  state  |0),  may 


N+l 

N 


1 

0 


-2  -1  0  1  2  3  4  5  6  time 

FIG.  3.  (Color  online)  The  solution  for  the  recursion  relations 
(A3)  in  free  space.  Each  vertex  in  the  lattice  denotes  a  space-time 
point  (i,t).  The  symbol  •  indicates  Vi(t)  =  1  and  +  vXt)  =  0.  To 
obtain  the  solution  for  the  case  with  boundary  from  the  free  space 
solution,  the  lattice  is  “folded  back”  to  the  space  interval 
[0,V+1]. 

be  written  as  a  superposition  S^cjr^lr^)  of  computa¬ 
tional  basis  states  |r^)|r^),  where  r=(r[1], . . .  ,r[„])r, 
-  =  (rU[,  •  •  •  ,r[«])r-  The  measurements  then  impose  con¬ 
straints  upon  which  basis  states  are  admissible,  i.e.,  occur 
with  nonzero  coefficients  ca.  The  constraints  imposed  by  the 
measurements  in  step  2  read 

r[7]  =  ryj  =  0  for  m  -  m(j )  =  0, 
rl/] +  Hj]  mod  2=1  for  m  —  m(j)  =  2 , 

rDJ =  Hj]  -  1  for  m  -  m(j )  =  4,  (19) 

for  all  /  =  1 ...  N.  The  constraints  that  arise  from  the  measure¬ 
ments  in  step  3  are 


rL/i] +  rL 4]  mod  2  =  0  for  m  -  m(jujk)  =  2 , 

(20) 

r[/'i] +  r\jk. ]  mod  2  =  1  for  m  -  m(JiJk)  =  0, 

f°r  L/iLL/jJ  ^7,  k=2...\j\.  The  system  (19)  and  (20)  of  lin¬ 
ear  equations  has  a  unique  solution  if  7=  0  and  two  solutions 
otherwise.  In  the  latter  case,  the  two  solutions  and 
correspond  to  two  states  |rO))|r^)  and  |r^)|r(1)),  and  the 
state  of  the  quantum  register  after  readout  is  their  reflection- 
symmetric  linear  combination.  The  solutions  and  are, 
modulo  interchange,  uniquely  specified  by  the  measurement 
outcomes  m,  {m(j)}  and  {m(JiJk)}.  For  a  probabilistic  algo¬ 
rithm,  the  automaton  produces  two  potential  solutions  in 
each  run. 
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C.  Overhead 

For  the  described  method  of  universal  computation  via 
translation-invariant  interaction  it  is  relevant  that  the  elemen¬ 
tary  transition  function  T  repeated  a  sufficient  number  of 
times  reduces  to  the  identity  operation.  Such  an  elementary 
“clock  cycle”  takes  2(N+ 1)  time  steps  where  N  is  the  length 
of  the  chain.  But  as  described  so  far,  one  still  ends  up  with  a 
set  of  local  and  next-neighbor  gates  and  thus  pays  the  price 
of  next-neighbor  slowdown  twice.  This  is  not  necessary.  Be¬ 
fore  the  resources  are  counted,  a  long-distance  entangling 
gate  is  described.  As  compared  to  a  logically  equivalent 
composition  of  next-neighbor  and  local  gates,  it  cuts  the  re¬ 
quired  operational  resources  by  a  factor  of  order  n. 

The  natural  gate  set  for  the  described  scheme  of  quantum 
computation  does  not  only  contain  local  and  next-neighbor 
gates,  but  all  gates  of  the  form 

Ux(a,Ll,L2)  =  ex  X[;]j .  (21) 

As  before,  the  logical  position  [/]  corresponds  to  the  physical 
location  2/-1  in  the  chain.  The  sequence  for  implementing 
Ux(cz,L\  ,L2)  is 

Ux(a,LuL2)  =  [Tt3YTYTt2Ux(-  a/2)T^] 

X  \Tt3YTYTt2Ux(a/2)T^],  (22) 

with  ti=L2-Lu  t2=Ll+L2-2  and  t3=N-2Ll  +  2.  Now  note 
that,  with  |— ):=— Y|— ), 

gH- ->/„/,<— I  =  Ux(TT/2,lhl2)UX(-  TT/2,h  +  l,/2) 

XUx(tt/2,11  +  1,Z2-1) 

XUx(-  7r/2,/|,/2 -  1).  (23) 

Thus,  a  long-distance  v-controlled  spin  flip  can  be  imple¬ 
mented  with  four  elementary  gates  (22).  These  four  opera¬ 
tions  can  be  grouped  into  two  pairs,  with  constant  value  of  L2 
in  each  pair.  Either  pair  of  operations  can  be  implemented  in 
one  clock  cycle  because  the  Y  pulse  sequences  of  the  two 
operations  coincide.  An  v-controlled  spin  flip  thus  takes  two 
clock  cycles  irrespective  of  the  distance  between  control  and 
target  qubit. 

Now  counting  the  resources,  a  quantum  computation  on  n 
qubits  in  the  described  scheme  requires  a  qubit  chain  of 
length  N=An  +  2.  Each  clock  cycle  takes  8^+6  elementary 
time  steps  (applications  of  T ),  and  each  of  the  universal  gates 
exp(/a72Z/),  exp(ia/2Xl),  and  >h»*2<— I  takes  at  most  two 

clock  cycles.  As  compared  to  the  network  model,  there  arises 
a  constant  overhead  of  4  in  the  spatial  and  a  linear  overhead 
of  16 n  in  temporal  resources. 

III.  CONCLUDING  REMARKS 

Universal  quantum  computation  can  be  performed  by 
translation-invariant  operations  on  a  chain  of  qubits  initial¬ 
ized  in  the  state  |0O  •  -0).  The  described  method  requires 
translation-invariant  interaction  of  Ising  type  and  spatially 
uniform  local  rotations.  As  compared  to  the  network 
model,  there  occurs  a  constant  spatial  and  a  linear  temporal 
overhead. 


Three  remarks  are  in  order.  (1)  In  the  described  setting 
translation  invariance  is  broken  by  the  finite  extension  of  the 
qubit  chain.  To  create  a  setting  with  perfect  translational  in¬ 
variance,  one  may  replace  the  qubits  by  qutrits,  where  the 
additional  level  \2)  does  not  take  part  in  any  interaction,  and 
consider  a  A/+1  -qubit  ring  geometry  instead  of  a  chain.  Set 
|<p(0}  =  |0*  *  * 0) 1 2);|0*  •  '0)i+i...N+i.  With  either  the 
translation-invariant  superposition  1  /  Va/  +  lE^j1 1  cp(i))  or 
mixture  l/(N+  l)2^j1|<p(z))(<p(z)|  as  the  initial  state  the 
scheme  works  as  before. 

(2)  It  may  occur  that  a  qubit  chain  capable  of  the  required 
interaction  has  been  created  but  its  length  N  is  unknown. 
It  can  be  found  out  easily  by  repetitions  of  the  following 
protocol:  Initialize  the  chain  in  the  state  |0O  •  -0),  apply 
the  transition  function  T  t  times  and  subsequently  measure 
the  spin  observable  Sz='Z^=iZi.  The  received  signal 
( Sz(t ))  carries  a  characteristic  signature  of  N,  namely 
(Sz(t))=NS(t  mod  A+ 1). 

(3)  Deviating  from  the  setting  discussed  in  Sec.  II  B,  the 
available  measurement  may  be  such  that  local  information 
could  in  principle  be  retrieved  but  is  not  retrieved,  due  to 
technological  limitations.  As  an  example,  consider  cold  at¬ 
oms  in  an  optical  lattice  which  are  read  out  via  resonance 
fluorescence  spectroscopy  and  whose  separation  is  larger 
than  the  wavelength  of  the  probing  laser.  In  this  case,  the 
readout  can  be  modeled  by  the  map 

p  — >  p'(m)  ~  X  P({m,})pP({m,}),  (24) 

W  |2  ltn=m 

with  the  projectors  P({ml})  =  <g>f=1(l+miZi)/2,  and 
mt=  ±  1  V/  =  1 . . . N.  Every  atom  is  individually  measured  but 
only  the  global  signal  m='Zimi  is  recorded.  For  measure¬ 
ments  of  this  type  the  readout  procedure  is  the  same  as  dis¬ 
cussed  in  Sec.  II  and  has  the  same  efficiency. 
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APPENDIX:  BIT  REVERSAL 

Sufficiently  many  applications  of  the  elementary  transi¬ 
tion  function  T  reverse  the  order  of  qubits  within  the  chain. 
As  stated  in  Eq.  (3),  with  R  the  spatial  reflection, 

'J1N+  1  _  ^ 

This  phenomenon  is  related  to  the  transmission  of  one-qubit 
states  through  chains  described  in  Ref.  [17].  Relation  (3)  is 
now  proved.  What  needs  to  be  shown  is  TN+1(Xp)=Xp  and 
TN+l(Zp)=Zp ,  for  all  p- 1 , . . .  ,N.  For  this  purpose,  the  vector 
space  formulation  [13-15]  of  the  stabilizer  formalism  is 
used.  In  particular,  I  use  the  conventions  and  results  of  Ref. 
[15].  The  evolution  of  the  Pauli  observables  Xp,  Zp  is  fol- 
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lowed  in  terms  of  the  binary  quantities  a  and  e,  introduced  in 
Sec.  II  below  Eq.  (8). 

First,  the  recursion  relation  a(t+l)  =  Ca(t)  is  translated 
into  a  recursion  relation  for  the  z  part  of  a  alone  and  a 
relation  that  expresses  x  in  terms  of  z.  From  Eq.  (9),  taking 
into  account  that  T  represents  a  line  graph,  the  following 
recursion  relations  are  obtained  for  z  =  (u1?  •  •  •  ,vN)T  and 
x  =  {wl,---  ,WN)T: 

Vi(t+  1)  =  V2(t)  +  V\(t-  1), 


vi(t)  2  vi+2l(N+l)  +  v-i+2l(N+l)  (A4) 

/=- 00 

obeys  the  recursion  relation  (Al).  The  reduction  of  the  case 
with  spatial  boundary  to  the  case  without  boundary  is  illus¬ 
trated  in  Fig.  3. 

It  is  easily  checked  that 

Vi(t)  =  [0{p  -  i  +  t)  +  6(r  -i-t  -  1  )]S(p  -i  +  t  mod  2) 

(A5) 


Vi(t  +  1)  =  Vi_i(t)  +  vi+i(t)  +  Vi(t  -  1),  1  <  i  <  N, 

vN(t+  1  )  =  vN_1(t)  +  vN(t-  1), 

Wj(t  +  1)  =  Vj(t),  1  ss  i  s£  N.  (Al) 

We  seek  the  solution  of  these  recursion  relations  for  the  time 
t=N+l,  with  boundary  conditions 

U;(-1)  =  0,  Vi(0)  =  8{p  -  i),  V  1  ^  K  iV.  (A2) 

The  translation-invariance  of  the  recursion  relations  (Al)  is 
broken  by  the  finite  extension  of  the  chain.  The  problem  of 
solving  these  recursion  relations  is  now  reduced  to  the 
translation-invariant  case  of  the  infinite  chain.  Note  that  if  a 
configuration  {v^t)}  obeys  the  recursion  relations 


vi(t+l)  =  vi_i(t)  +  vi+i(t)  +  vi(t-l),  (A3) 


then 


is  the  solution  of  Eq.  (A3)  with  the  boundary  conditions 
fi)(-l)  =  0,  Ui(0)  =  S(p-i),  V/.  If  this  expression  is  inserted 
into  the  RHS  of  Eq.  (A4)  one  obtains  a  solution  for  v^t)  with 
boundary  conditions  (A2).  It  is  observed  that 


Vi(t)  ~  S(p  -i  +  t  mod  2) , 
Wf(t)  ~  8(p  -i  +  t+1  mod  2) . 


(A6) 


Next,  the  solution  v(t)  (A4)  and  (A5)  is  evaluated  for 
t=N+l.  One  obtains 


Vi(N+  l)  =  S[p-(N+l- /)].  (A7) 

This  implies  TN+\Zp)  =  ±Zp. 

Next,  the  sign  factor  (-1)1 2 3 4 5 6 7 8  in  the  above  relation  is 
worked  out.  Using  the  result  of  Ref.  [15],  the  recursion 
relation  for  e  reads  e(t+  l)  =  e(t)  +  z(t)TTLz(t)+x(t)Tz(t), 
with  Tl  the  lower  triangular  part  of  J  .  With  Eq.  (A6), 
6(r)  =  6(0)  =  0  such  that  TN+l(Zp)=Zp.  Finally,  with 
T(Xp)=Zp,  TN+1(Xp)  =  r'TN+]T(Xp)  =Xp. 
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We  describe  a  phase  transition  for  long-range  entanglement  in  a  three-dimensional  cluster  state  affected  by 
noise.  The  partially  decohered  state  is  modeled  by  the  thermal  state  of  a  short-range  translation-invariant 
Hamiltonian.  We  find  that  the  temperature  at  which  the  entanglement  length  changes  from  infinite  to  finite  is 
nonzero.  We  give  an  upper  and  lower  bound  to  this  transition  temperature. 

DOI:  10.11 03/Phy sRevA.7 1 .0623 1 3  PACS  number(s):  03.67.Mn 


I.  INTRODUCTION 

Nonlocality  is  an  essential  feature  of  quantum  mechanics, 
put  to  the  test  by  the  famous  Bell  inequalities  [1]  and  verified 
in  a  series  of  experiments  (see,  e.g.,  [2]).  Entanglement  [3]  is 
an  embodiment  of  this  nonlocality  which  has  become  a  cen¬ 
tral  notion  in  quantum-information  theory. 

In  realistic  physical  systems,  decoherence  represents  a 
formidable  but  surmountable  obstacle  to  the  creation  of  en¬ 
tanglement  among  far  distant  particles.  Devices  such  as 
quantum  repeaters  [4]  and  fault-tolerant  quantum  computers 
are  being  envisioned  in  which  the  entanglement  length  [5,6] 
is  infinite,  provided  the  noise  is  below  a  critical  level.  Here 
we  are  interested  in  the  question  of  whether  an  infinite  en¬ 
tanglement  length  can  also  be  found  in  spin  chains  with  a 
short-range  interaction  that  are  subjected  to  noise.  A  prereq¬ 
uisite  for  our  investigation  is  the  existence  of  systems  with 
infinite  entanglement  length  at  zero  temperature.  An  example 
of  such  behavior  has  been  discovered  by  Verstraete,  Martm- 
Delgado,  and  Cirac  [7]  with  spin-1  chains  in  the  Affleck- 
Kennedy-Lieb-Tasaki  model  [8],  and  by  Pachos  and  Plenio 
with  cluster  Hamiltonians  [9];  see  also  [10].  In  this  paper,  we 
study  the  case  of  finite  temperature.  We  present  a  short- 
range,  translation-invariant  Hamiltonian  for  which  the  en¬ 
tanglement  length  remains  infinite  until  a  critical  temperature 
Tc  is  reached.  The  system  we  consider  is  a  thermal  cluster 
state  in  three  dimensions.  We  show  that  the  transition  from 
infinite  to  finite  entanglement  length  occurs  in  the  interval 
0.30A^rc^1.15A,  with  A  being  the  energy  gap  of  the 
Hamiltonian. 

We  consider  a  simple  three-dimensional  (3D)  cubic  lattice 
C  with  one  spin- 1/2  particle  (qubit)  living  at  each  vertex  of 
the  lattice.  Let  Xu ,  Yu,  and  Zu  be  the  Pauli  operators  acting  on 
the  spin  at  a  vertex  u  e  C.  The  model  Hamiltonian  is 

H  =  -^KU,  KU=XU  II  Z,-  (1) 

Here  N(u)  is  a  set  of  nearest  neighbors  of  vertex  u.  The 
ground  state  of  H  obeys  eigenvalue  equations  Ku\(f}c=\(/))c 
and  coincides  with  a  cluster  state  [11].  We  define  a  thermal 
cluster  state  at  a  temperature  T  as 

Pcs  =  ^  exP(-  PH)  >  (2) 

where  Z=Tr  e~^H  is  a  partition  function  and  fi=  7’"1.  Since 
all  terms  in  H  commute,  one  can  easily  get 


Pcs =  i  n  [/  +  tanh(/3A/2W„] .  (3) 

2  u&C 

Let  A,B  C  C  be  two  distant  regions  on  the  lattice.  Our  goal 
is  to  create  as  much  entanglement  between  A  and  B  as  pos¬ 
sible  by  doing  local  measurements  on  all  spins  not  belonging 
to  AUB.  Denote  a  as  the  list  of  all  outcomes  obtained  in 
these  measurements  and  p^B  as  the  state  of  A  and  B  condi¬ 
tioned  on  the  outcomes  a.  Let  E[p ]  be  some  measure  of 
bipartite  entanglement.  Lollowing  [5]  we  define  the  localiz- 
able  entanglement  between  A  and  B  as 

E(A,B)  =  max^paE[pAaBl  (4) 

a 

where  pa  is  a  probability  to  observe  the  outcome  a  and  the 
maximum  is  taken  over  all  possible  patterns  of  local  mea¬ 
surements.  To  specify  the  entanglement  measure  E[p ]  it  is 
useful  to  regard  p^B  as  an  encoded  two-qubit  state  with  the 
first  logical  qubit  residing  in  A  and  the  second  in  B.  We 
choose  E[p]  as  the  maximum  amount  of  two-qubit  entangle¬ 
ment  (as  measured  by  entanglement  of  formation)  contained 
in  p.  Thus  0  ^  E(A ,  B)  ^  1  and  an  equality  E(A ,  B)  =  1  implies 
that  a  perfect  Bell  pair  can  be  created  between  A  and  B. 
Conversely,  E(A,B)  =  0  implies  that  any  choice  of  a  measure¬ 
ment  pattern  produces  a  separable  state. 

In  this  paper  we  consider  a  finite  3D  cluster 

C  =  {u  =  (ui,u2,u3):l  ^  uhu2  +  1  ^  /;  1  ^  u3  ^  d} 
and  choose  a  pair  of  opposite  2D  faces  as  A  and  B: 

A  =  {u^C:u3=  1},  B  =  {u  e  C:u3  =  d}, 

so  that  the  separation  between  the  two  regions  is  d- 1.  In 
Sec.  II  we  show  that1 

lim  E(A,B)  =  1  for  T  <  0.30A. 

l,d^>  oo 

Lurther,  we  show  in  Sec.  Ill  that  if  T>  1.15 A  then  E{A,B) 
=  0  for  J^2  and  arbitrarily  large  /. 


References  [15,16]  consider  a  lattice  with  proportions  of  a  cube, 
corresponding  to  l=d.  However,  numerical  simulations  indicate  that 
lim/ E(A,B)  =  1  even  if  l=C\n(d);  see  remarks  to  Sec.  II. 
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II.  LOWER  BOUND 

We  relate  the  lower  bound  on  the  transition  temperature  to 
quantum  error  correction.  From  Eq.  (3)  it  follows  that  pcs 
can  be  prepared  from  the  perfect  cluster  state  |  <fi)c  by  apply¬ 
ing  the  Pauli  operator  Zu  to  each  spin  weC  with  a  probability 


1  +  exp(/3A)  ‘ 

Thus,  thermal  fluctuations  are  equivalent  to  independent  lo¬ 
cal  Z  errors  with  an  error  rate  p. 

We  use  a  single  copy  of  pcs  and  apply  a  specific  pattern  of 
local  measurements  which  creates  an  encoded  Bell  state 
among  sets  of  particles  in  A  and  B.  For  encoding  we  use  the 
planar  code,  which  belongs  to  the  family  of  surface  codes 
introduced  by  Kitaev.  The  3D  cluster  state  has,  as  opposed  to 
its  ID  counterpart  [11],  an  intrinsic  error  correction  capabil¬ 
ity  which  we  use  in  the  measurement  pattern  described  be¬ 
low.  Therein,  the  measurement  outcomes  are  individually 
random  but  not  independent;  parity  constraints  exist  among 
them.  The  violation  of  any  of  these  indicates  an  error.  Given 
sufficiently  many  such  constraints,  the  measurement  out¬ 
comes  specify  a  syndrome  from  which  typical  errors  can  be 
reliably  identified.  The  optimal  error  correction  given  this 
syndrome  breaks  down  at  a  certain  error  rate  (temperature), 
and  the  Bell  correlations  can  no  longer  be  mediated.  This 
temperature  is  a  lower  bound  to  Tc,  because  in  principle  there 
may  exist  a  more  effective  measurement  pattern. 

To  describe  the  measurement  pattern  we  use,  let  us  intro¬ 
duce  two  cubic  sublattices  Te ,  T0CC  with  a  double  spacing. 
Each  qubit  neC  becomes  either  a  vertex  or  an  edge  in  one  of 
the  sublattices  Te  and  T0.  The  sets  of  vertices  V(Ye)  and 
V(Y0)  are  defined  as 

V(Te)  =  {u  =  (e,e,e)  eC}, 


V{T0)-{u  =  {o,o,o)  e  C}, 

where  e  and  o  stand  for  even  and  odd  coordinates.  The  sets 
of  edges  E{Te )  and  E(T0)  are  defined  as 

E{Te)  =  {u  =  (e,e,o),(e,o,e),(o,e,e)  e  C}, 


E(T0)  =  {u  =  (o,o,e),(o,e,o),(e,o,o)  e  C}. 

The  lattices  Te,  T0  play  an  important  role  in  the  identification 
of  error  correction  on  the  cluster  state  with  a  Z2  gauge  model 
[12].  They  are  displayed  in  Fig.  2  below. 

Let  us  assume  that  the  lengths  l  and  d  are  odd.2  The  Bell 
pair  to  be  created  between  A  and  B  will  be  encoded  into 
subsets  of  qubits 

L  =  {u  =  (o,£,l),(e,<9,l)  e  C}CA, 


9 

There  is  no  loss  of  generality  here  since  one  can  decrease  the  size 
of  the  lattice  by  measuring  all  of  the  qubits  on  some  of  the  2D  faces 
in  the  Z  basis. 
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FIG.  1.  (Color  online)  (a)  Measurement  pattern  on  the  first  and 
last  slice  of  C,  for  /=5.  The  resulting  state  is  in  the  code  space  of  the 
planar  code.  (The  unmeasured  qubits  are  displayed  as  shaded 
circles.)  (b)  Lattice  for  the  planar  code,  (c)  and  (d)  Measurement 
pattern  for  even  and  odd  inner  slices. 


R  =  {u  =  (o,e,d),(e,o,d)  e  Cj  C  B. 


Each  qubit  u  e  C  is  measured  either  in  the  Z  or  X  basis  unless 
it  belongs  to  L  or  R.  Denoting  by  Mx  and  Mz  local  X  and  Z 
measurements,  we  can  now  present  the  measurement  pattern: 


Mz  V  a  e  V(Te)  U  V(T0), 

Mx  V  u  e  E{Te )  U  E(T0)  \(LUR). 


(6) 


We  denote  the  measurement  outcome  + 1  at  vertex  u  by  zu  or 
xu ,  respectively.  A  graphic  illustration  of  the  measurement 
patterns  for  the  individual  slices  is  given  in  Fig.  1. 

Before  we  consider  errors,  let  us  discuss  the  effect  of  this 
measurement  pattern  on  a  perfect  cluster  state.  Consider 
some  fixed  outcomes  {vj ,  {zu}  of  local  measurements  and  let 
I  0)lk  be  the  reduced  state  of  the  unmeasured  qubits  L  and  R. 
We  will  now  show  that  |  if/)LR  is,  modulo  local  unitaries,  an 
encoded  Bell  pair,  with  each  qubit  encoded  by  the  planar 
code  [13],  the  planar  counterpart  of  the  toric  code  [14].  The 
initial  cluster  state  obeys  eigenvalue  equations  Ku\(f>)c=\(f))c. 
This  implies  for  the  reduced  state 

Zp,u\1I/)lr  =  ^p,u\1//)lr  V  u  =  (e,e,  1),  (7) 

where  ZPu=(S>v^N^nLZv  is  a  plaquette  (z-type)  stabilizer  op¬ 
erator  for  the  planar  code  [13].  The  eigenvalue  XPu  depends 
upon  the  measurements  outcomes  as  XPu=xuZ(Uvu2^y  Note 
that  in  the  planar  code  the  qubits  live  on  the  edges  of  a  lattice 
rather  than  on  its  vertices.  The  planar  code  lattice  is  distinct 
from  the  cluster  lattice  C;  see  Figs.  1  and  2. 

From  the  equation  UveN^Kv\(/))c=\(/))c,  for  w  =  (0,0,1), 
we  obtain 

Xs,u\1I/)lR  =  ^S,u\1//)lR  V  u  =  (0,0,1),  (8) 

where  XSu  =  <S>veN^u)nLXv  coincides  with  a  site  (v-type) 
stabilizer  operator  for  the  planar  code  [13],  and 
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FIG.  2.  (Color  online)  The  measurement  pattern  on  the  cluster  C. 
The  sublattices  Te  and  T0  are  displayed  (thick  lines).  For  reference, 
the  cluster  lattice  is  also  shown  (thin  lines)  and  the  axis  labeling 
shows  the  cluster  coordinates.  Cluster  qubits  measured  in  the  Z 
basis  (on  the  sites  of  T0  and  Te)  are  displayed  in  black,  and  qubits 
measured  in  the  X  basis  (on  the  edges  of  T0  and  Te)  are  displayed  in 
gray  (red).  The  large  circles  to  left  and  to  the  right  denote  the 
unmeasured  qubits  which  form  the  encoded  Bell  pair.  The  measure¬ 
ment  pattern  has  a  bee  symmetry. 

Ku=Huvu2,2)Zu^v*no{u)Zv>  where  No  r^ers  to  a  neighborhood 
relation  on  the  sublattice  T0.  The  code  stabilizer  operators  in 
Eqs.  (7)  and  (8)  are  algebraically  independent.  There  are 
(/2  - 1)/2  code  stabilizer  generators  for  (/2  + 1)/2  unmeasured 
qubits,  such  that  there  exists  one  encoded  qubit  on  L.  By 
direct  analogy,  there  is  also  one  encoded  qubit  located  on  R. 

Next,  we  show  that  \if/)LR  is  an  eigenstate  of  XLXR  and 
ZLZR ,  where  X  and  Z  are  the  encoded  Pauli  operators  X  and 
Z,  respectively,  i.e.,  |  if/)LR  is  an  encoded  Bell  pair.  The  en¬ 
coded  Pauli  operators  [13]  on  L  and  R  are  XL[R] 

=  ®uloddx(ul,u2,i[d])  for  any  even  u2,  and  ZL[R] 
=  ®U2  even^( w t , w2, l [d])  for  any  odd  ux.  To  derive  the  Bell  corre¬ 
lations  of  |  ifj)LR  let  us  introduce  2D  slices 

I^XX  =W  =  (o,u2,o)  g  C}  C  T0, 

T^zz  =W  =  (ux,e,e)  g  Cj  C  Te. 

The  eigenvalue  equation  Uv with  even  u2 
implies  for  the  reduced  state 

XlXr\ i/s)lR  -  ^Xx\  4*) LRi  (9) 

with  Here  and  here- 

AA  AA  AA 

after  it  is  understood  that  xu=zu=  1  for  all  u  $  C.  Similarly, 
from  \4>)c=T\v&Iiux)Kv\(j))c,  for  ux  odd,  we  obtain  for  the  re¬ 
duced  state 
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ZlXr  I  4/)lR  -  ^ zzl  44} LR  >  (10) 

with  Xzz=ni;e^1+i)u^i-i)zi;ni;e7f«i)jti;.  Thus  the  eigenvalue 
equations  (7)-(10)  show  that  the  measurement  pattern  of  Eq. 
(6)  projects  the  initial  perfect  cluster  state  into  a  state  equiva¬ 
lent  under  local  unitaries  to  the  Bell  pair,  with  each  qubit 
encoded  by  the  planar  code. 

It  is  crucial  that  the  measurement  outcomes  {zu}  and  {vj 
are  not  completely  independent.  Indeed,  for  any  vertex  u 
g  T0  with  1  <u3<d  the  eigenvalue  equation  ff VBN(u)Kv\<f>)c 
=  |  (p)c  implies  the  constraint 

n  xv  n  zw=\.  (ii) 

ug  N{u)  w^N0{u) 

Analogously,  for  any  vertex  u  e  Te  one  has  a  constraint 

n  xv  n  zw=i,  (12) 

UG  N(u)  weNe(u ) 

where  Ne  refers  to  a  neighborhood  relat  ion  on  the  lattice  Te. 
Thus  there  exists  one  syndrome  bit  for  each  vertex  of  Te  and 
T0  (with  an  exception  for  the  vertices  of  T0  with  u3-  1  or 
u3=d). 

What  are  the  errors  detected  by  these  syndrome  bits? 
Since  we  have  only  Z  errors  (for  generalization,  see  remark 
1),  only  the  X  measurements  are  affected  by  them.  Each  X 
measured  qubit  is  on  an  edge  of  either  T0  or  Te.  Thus,  we  can 
identify  the  locations  of  the  elementary  errors  with  E(T0 )  and 
E(Te).  From  Eqs.  (11)  and  (12),  each  error  located  on  an 
edge  creates  a  syndrome  at  its  end  vertices. 

Let  us  briefly  compare  with  [12].  Therein,  independent 
local  X  and  Z  errors  were  considered  for  storage  whose  cor¬ 
rection  runs  completely  independently.  The  X  errors  in  this 
model  correspond  to  our  Z  errors  on  qubits  in  E(Te ),  and  the 
Z  storage  errors  to  our  Z  errors  on  qubits  in  E(T0),  if  the  X 
and  Z  error  correction  phases  in  [12]  are  pictured  as  alternat¬ 
ing  in  time. 

The  syndrome  information  provided  by  Eqs.  (11)  and  (12) 
is  not  yet  complete.  There  are  two  important  issues  to  be 
addressed:  (i)  There  are  no  syndrome  bits  at  the  vertices  of 
T0  with  u3  =  l  or  u3=d\  (ii)  edges  of  Te  with  u3=l  or  u3=d 
have  only  one  end  vertex,  so  errors  that  occur  on  these  edges 
create  only  one  syndrome  bit.  Concerning  (i),  to  get  the 
missing  syndrome  bits  we  will  measure  eigenvalues  X P  u  and 
XS  u  for  the  plaquette  and  the  site  stabilizer  operators  living 
on  the  faces  A  and  B  [see  Eqs.  (7)  and  (8)].  Such  measure¬ 
ments  are  local  operations  within  A  or  within  B ,  so  they 
cannot  increase  entanglement  between  A  and  B.  For  any  u 
=  (o,o,  1)  or  u  =  (o,o, d)  it  follows  from  Eq.  (8)  that 

^s, «•*(»!, h2,2)Z«  II  zv=  1  for  w3  =  1 , 

1>G  Na(u) 

(13) 

^s,uX(uvu2,d-i)Zu  II  zv=l  for  u3  =  d. 

UG  N0(u) 

For  any  vertex  u  =  (o,o,l)  or  u  =  (o,o,d)  there  are  several 
edges  of  the  lattice  T0  incident  on  u.  It  is  easy  to  see  that  a 
single  Z  error  that  occurs  on  any  of  these  edges  changes  a 
sign  in  Eqs.  (13).  Thus,  these  two  constraints  yield  the  syn- 
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drome  bits  living  at  the  vertices  u  =  (o,o,  1)  and  u  =  (o,o,d ), 
so  the  issue  (i)  is  addressed.  Concerning  (ii),  we  make  use  of 
Eq.  (7)  and  obtain 

^p,uxuZ(uvu2,2)=  1  for  any  u  =  (e,e,  1), 

(14) 

^p,uxuZ(uvu2,d- i)  =  l.  for  any  u  =  (e,e,d). 

Since  we  have  only  Z  errors,  the  eigenvalues  XPu  and  the 
outcomes  are  not  affected  by  errors.  Thus 

the  syndrome  bits  Eqs.  (14)  are  equal  to  -1  if  and  only  if  an 
error  has  occurred  on  the  edge  u  =  (e,e,  1)  or  u  =  (e,e,d)  of 
the  lattice  Te.  Since  each  of  these  errors  shows  itself  in  a 
corresponding  syndrome  bit  which  is  not  affected  by  any 
other  error,  we  can  reliably  identify  these  errors.  This  is 
equivalent  to  actively  correcting  them  with  unit  success 
probability.  We  can  therefore  assume  in  the  subsequent 
analysis  that  no  errors  occur  on  the  edges  and 

(e,e,d),  which  concludes  the  discussion  of  the  issue  (ii). 

As  in  [14],  we  define  an  error  chain  £  as  a  collection  of 
edges  where  an  elementary  error  has  occurred.  Each  of  the 
two  lattices  Te  and  T0  has  its  own  error  chain.  An  error  chain 
£  shows  a  syndrome  only  at  its  boundary  d{£),  and  errors 
with  the  same  boundary  thus  have  the  same  syndrome.  One 
may  identify  an  error  £  only  modulo  a  cycle  D,  £'=£+D, 
with  d(D)  =  0. 

There  are  homologically  trivial  and  nontrivial  cycles.  A 
cycle  D  is  trivial  if  it  is  a  closed  loop  in  T0  (Te),  and  homo¬ 
logically  nontrivial  if  it  stretches  from  one  rough  face  in  T0 
( Te )  to  another.  A  rough  face  here  is  the  2D  analog  of  a  rough 
edge  on  a  planar  code  [13].  The  rough  faces  of  T0  are  on  the 
upper  and  lower  sides  of  C,  and  the  rough  faces  of  Te  are  on 
the  front  and  back  of  C  (recall  that  no  errors  occur  on  the  left 
and  right  rough  faces  of  Te). 

Let  us  now  study  the  effect  of  error  cycles  on  the  identi¬ 
fication  of  the  state  |  if/)LR  from  the  measurement  outcomes. 
We  only  discuss  the  error  chains  on  T0  here,  which  poten¬ 
tially  affect  the  eigenvalue  Eq.  (9).  The  discussion  of  the 
error  chains  in  Te — which  disturb  the  ZLZR  correlations — is 
analogous.  An  individual  qubit  error  on  dgC  will  modify  the 
XLXR  correlation  of  |  if/)LR  if  it  affects  either  XL,  XR  or  Xxx. 

That  happens  if  v  e  T^xx-  Now,  the  vertices  in  T^x  corre¬ 
spond  to  edges  in  T0.  If  an  error  cycle  D  in  T0  is  homologi¬ 
cally  trivial,  it  intersects  T^x  in  an  even  number  of  vertices; 
see  Fig.  3.  This  has  no  effect  on  the  eigenvalue  Eq.  (9). 
However,  if  the  cycle  is  homologically  nontrivial,  i.e.,  if  it 
stretches  between  the  upper  and  lower  face  of  C,  then  it 
intersects  7^  in  an  odd  number  of  vertices.  This  does 
modify  the  eigenvalue  Eq.  (9)  by  a  sign  factor  of  (-1)  on  the 
left-hand  side,  which  leads  to  a  logical  error.  Therefore,  for 
large  system  size,  we  require  the  probability  of  misinterpret¬ 
ing  the  syndrome  by  a  nontrivial  cycle  to  be  negligible  [12]: 

2  prob(£)  2  prob(£  +  D\£)  ~  0.  (15) 

£  D  nontrivial 

We  have  now  traced  back  the  problem  of  reconstructing  an 
encoded  Bell  pair  |  i//)LR  to  the  same  setting  that  was  found  in 
[12]  to  describe  fault- tolerant  data  storage  with  the  toric 


FIG.  3.  (Color  online)  A  homologically  nontrivial  and  a  homo¬ 
logically  trivial  error  cycle  on  the  lattice  Ta.  The  nontrivial  error 
cycle  stretches  from  one  rough  face  to  the  opposite  one  while  the 
trivial  error  has  both  ends  on  the  same  face.  Only  the  qubits  belong¬ 
ing  to  T0  are  shown  and  the  qubits  important  for  establishing  the 
XAX5-correlation  are  displayed  enlarged. 


code.  Via  the  measurement  pattern  Eq.  (6),  we  may  introduce 
two  lattices  T0,Te  such  that  (1)  syndrome  bits  are  located  on 
the  vertices  of  these  lattices,  (2)  independent  errors  live  on 
the  edges  and  show  a  syndrome  on  their  boundary,  and  (3) 
only  the  homologically  nontrivial  cycles  give  rise  to  a  logical 
error.  This  error  model  can  be  mapped  onto  a  random 
plaquette  Z2-gauge  field  theory  in  three  dimensions  [12,15] 
which  undergoes  a  phase  transition  between  an  ordered  low- 
temperature  and  a  disordered  high-temperature  phase.  In  the 
limit  of  /,  full  error  correction  is  possible  in  the  low- 

temperature  phase. 

In  our  setting,  the  error  probabilities  for  all  edges  are 
equal  to  p.  For  this  case  the  critical  error  probability  has  been 
computed  numerically  in  a  lattice  simulation  [16],  pc 
=  0.033  +  0.001.  This  value  corresponds,  via  Eq.  (5),  to  Tc 
=  (0.296+0.003)  A. 

Remark  1.  The  error  model  equivalent  to  Eq.  (3),  i.e.,  Z 
errors  only,  is  very  restricted.  We  have  a  physical  motivation 
for  this  model,  but  we  would  like  to  point  out  that  the  very 
strong  assumptions  we  have  made  about  the  noise  are  not 
crucial  to  our  result  of  the  threshold  error  rate  being  nonzero. 
One  may,  for  example,  generalize  the  error  model  from  a 
dephasing  channel  to  a  depolarizing  channel,  with  px=py 
=pz=p' /3.  Then,  two  changes  need  to  be  addressed,  those  in 
the  bulk  and  those  on  the  faces  L  and  R.  Concerning  the 
faces,  the  errors  on  the  rough  faces  to  the  left  and  right  of  Te 
can  no  longer  be  unambiguously  identified  by  measurements 
of  the  code  stabilizer  (14),  which  raises  the  question  of 
whether — for  depolarizing  errors — it  may  be  these  surface 
errors  that  set  the  threshold  for  long-range  entanglement. 
This  is  not  the  case.  To  see  this,  note  that  two  slices  of  2D 
cluster  states  may  be  attached  to  the  left  and  right  of  C,  at 
u3  =  0,-1  and  u3=d+ 1,  d+ 2.  The  required  operations  are 
assumed  to  be  perfect.  They  do  not  change  the  localizable 
entanglement  between  the  left  and  right  sides  of  the  cluster  C 
because  they  act  locally  on  the  slices  -1 , . . . ,  1  and  d, ... ,d 
+  2,  respectively.  The  subsets  A  and  B  of  spins  are  relocated 
to  the  slices  -1  and  d+2,  with  the  corresponding  changes  in 
the  measurement  pattern.  The  effect  of  this  procedure  is  that 
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FIG.  4.  This  figure  plots  data  for  simulations  of  error  correction 
on  an  LX  LX  c/loric  lattice,  with  periodic  boundary  conditions  in  the 
first  two  directions,  for  various  L  and  dtoric  (d=2dtoric+ 1  ,/=2L). 
The  error  rate  is  p-  0.01.  The  logarithms  have  are  base  e.  Two 
standard  deviations  above  and  below  the  computed  values  (as  given 
by  statistical  noise  due  to  the  sample  sizes)  are  shown  by  the  error 
bars.  The  solid  lines  each  have  slope  1,  and  they  are  spaced 
equally  apart.  This  lends  good  support  to  the  model  of  fidelity 
F~exp[-J^1exp(-/^2)]  for  error  rates  below  threshold. 

the  leftmost  and  rightmost  slices  of  the  enlarged  cluster  are 
error  free,3  and  only  the  bulk  errors  matter. 

Concerning  the  bulk,  note  that  the  cluster  qubits  measured 
in  the  Z  basis  serve  no  purpose  and  may  be  left  out  from  the 
beginning.  Then,  the  considered  lattice  for  the  initial  cluster 
state  has  a  bcc  symmetry  and  double  spacing.  The  lattices 
T0,Te  remain  unchanged.  Further,  X  errors  are  absorbed  in 
the  X  measurements  and  Y  errors  act  like  Z  errors,  such  that 
we  still  map  to  the  original  Z2  gauge  model  [12]  at  the  Nishi- 
mori  line.  The  threshold  for  local  depolarizing  channels  ap¬ 
plied  to  this  configuration  is  thus  p'c  =  3/2pc=4.9%.  In  addi¬ 
tion,  numerical  simulations  performed  for  the  initial  simple 
cubic  cluster  and  depolarizing  channel  yield  an  estimate  of 
the  critical  error  probability  of  p[-\A%. 

Remark  2.  Finite-size  effects.  We  carried  out  numerical 
simulations  of  error  correction  on  an  iXlXd  lattice  with 
periodic  boundary  conditions  (as  opposed  to  the  open  bound¬ 
ary  conditions  of  the  planar  codes  within  the  cluster  state). 
For  differing  error  rates  below  the  threshold  value  of  2.9% 
[15],  we  found  good  agreement  for  the  fidelity  F  between  the 
perfect  and  the  error-corrected  encoded  Bell  state  with  the 
model  F~exp[-dki  exp(-/k2)].  Some  data  are  shown  in  Fig. 
4  corresponding  to  a  Z  error  rate  of  1.0%.  Provided  that 
planar  codes  and  toric  codes  have  similar  behavior  away 
from  threshold,  our  simulations  suggest  that,  in  order  to 
achieve  constant  fidelity,  the  length  /  specifying  the  surface 


3The  following  operations  are  required  to  attach  a  slice:  (I)  A(Z) 
gates  within  the  slice,  (II)  A(Z)  gates  between  the  slice  and  its  next 
neighboring  slice,  and  (III)  X  and  Z  measurements  within  the  slice 
(see  Fig.  1).  All  these  operations  are  assumed  to  be  perfect,  and  the 
errors  on  slices  1  and  d  are  not  propagated  to  slices  -1  and  d+ 2  by 
the  A(Z)  gates  (II). 
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FIG.  5.  (Color  online)  (a)  Correspondence  between  physical  and 
virtual  qubits.  Domains  are  shown  by  dashed  lines,  (b)  A  bipartite 
cut  of  a  cubic  lattice.  The  regions  A  and  B  are  highlighted. 

code  need  only  scale  logarithmically  with  the  distance  d. 

Remark  3.  For  even  d ,  the  construction  presented  above 
can  be  used  to  mediate  an  encoded  conditional  Z  gate  on 
distant  encoded  qubits  located  on  slices  1  and  d. 

III.  UPPER  BOUND 

In  this  section  we  analyze  the  high-temperature  behavior 
of  thermal  cluster  states  and  find  an  upper  bound  on  the 
critical  temperature  Tc.  Our  analysis  is  based  on  the  isomor¬ 
phism  between  cluster  states  and  the  so-called  valence  bond 
solids  (VBSs)  pointed  out  by  Verstraete  and  Cirac  in  [17] 
which  can  easily  be  generalized  to  a  finite  temperature. 

With  each  physical  qubit  weCwe  associate  a  domain  u .  * 
of  d(u)  virtual  qubits,  where  d{u)  =  \N{u)\  is  the  number  of 
nearest  neighbors  of  u  [see  Fig.  5(a)].  Let  us  label  virtual 
qubits  from  a  domain  w.*  as  u.v ,  v  eiV(w).  Denote  E  to  be 
the  set  of  edges  of  the  lattice  C  and  define  a  thermal  VBS 
state  pVBS  as 

PVBS  ~  IT  (/  +  0)eXu .yZy  M)  (/  +  COeZu  VXV .u)  •  (1^) 

e=(u,v)&E  ^ 

Here  {mj  are  arbitrary  weights  such  that  0^me^l.  It 
should  be  emphasized  that  pVBS  is  a  state  of  virtual  qubits 
rather  than  physical  ones.  Our  goal  is  to  convert  pVBS  into  pcs 
by  local  transformations  mapping  a  domain  u .  *  into  a  single 
qubit  weC.  The  following  theorem  is  a  straightforward  gen¬ 
eralization  of  the  Verstraete  and  Cirac  construction  (here  we 
put  A/2=  1). 

Theorem  1.  Let  pcs  be  a  thermal  cluster  state  on  the  3D 
cubic  lattice  C  at  a  temperature  T=/3~l.  Consider  a  thermal 
VBS  state  pVBS  as  in  Eq.  (16)  such  that  the  weights  a)e  satisfy 

II  M(u,v)  ^  tanh(/5)  for  each  neC.  (17) 

ue  N(u) 

Then  pVBS  can  be  converted  into  pcs  by  applying  a  com¬ 
pletely  positive  transformation  Wu  to  each  domain  w.*, 

Pcs =  w(pvbs) *  w=x>yvu.  (18) 

U  G 

Let  us  first  discuss  the  consequences  of  this  theorem.  Note 
that  each  edge  e  e  E  of  pVBS  carries  a  two-qubit  state 

Pe=  ueXxZ2){I+  coeZ,X2).  (19) 

The  Peres-Horodecki  partial  transpose  criterion  [18,19]  tells 
us  that  pe  is  separable  if  and  only  if  a)e^  V2-1.  Consider  a 
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bipartite  cut  of  the  lattice  by  a  hyperplane  of  codimension  1 
[see  Fig.  5(b)].  We  can  satisfy  Eq.  (17)  by  setting  a)e 
=  tanh(/3)  for  all  edges  crossing  the  cut  and  setting  (oe=  1  for 
all  other  edges.  Clearly,  the  state  pVBS  is  biseparable  when¬ 
ever  tanh(yS)  ^  y/2- 1 .  But  biseparability  of  pVBS  implies 
biseparability  of  pcs.  We  conclude  that  the  localizable  en¬ 
tanglement  between  the  regions  A  and  B  is  zero  whenever 
tanh(/3)  ^  y/2- 1,  which  yields  the  upper  bound  on  Tc  pre¬ 
sented  earlier. 

Remarks.  We  can  also  satisfy  Eq.  (17)  by  setting  oje  =  (o 
for  all  e  e  E,  with  w1 2 3 4 5 6 7 8 9=tanh (0).  This  choice  demonstrates  that 
pcs  is  completely  separable  for  tanh(/3)  <  (V2- 1)6  (that  is, 
7—200).  It  reproduces  the  upper  bound  [20]  of  Dur  and 
Briegel  on  the  separability  threshold  error  rate  for  cluster 
states. 

In  the  remainder  of  this  section  we  prove  Theorem  1. 
Consider  an  algebra  Au  of  operators  acting  on  some  particu¬ 
lar  domain  w.*.  It  is  generated  by  the  Pauli  operators  Zuv 
and  Xu  v  with  v  e  N(u).  The  transformation  Wu  maps  Au  into 
the  one-qubit  algebra  generated  by  the  Pauli  operators  Zu  and 
Xu.  First,  we  choose 

V\4(  v)  =  K  vWu,  =  |0®d(M))(0|  +  1 1 0dM)(  1 1 . 

One  can  easily  check  that 

W%,v  =  ZuWl  and  ZUVWU  =  WUZU,  (20) 

for  any  v  gN(u).  As  for  commutation  relations  between  Wu 
and  Xu  v  one  has 

wt(  n  xu.v\wu=xu, 

\UG  N(u)  / 

wl(Uxu.v)wu=o,  (21) 

\veS  / 

for  any  nonempty  proper  subset  SCN(u).  Taking  W= 
®uecWu  and  using  Eqs.  (20)  and  (21)  one  can  easily  get 


W(pvss)  = -4|  n  (1+  VuKu),  Vu=  II  «(«,«)•  (22) 

^  meC  ug  N(u) 


We  can  regard  the  state  in  Eq.  (22)  as  a  thermal  cluster  state 
with  a  local  temperature  tanh(/3j  =  yu  depending  upon  u. 
The  inequality  of  Eq.  (17)  implies  that  (3  for  all  u.  To 
achieve  a  uniform  temperature  distribution  f3u  =  /3  one  can 
intentionally  apply  local  Z  errors  with  properly  chosen  prob¬ 
abilities. 


IV.  CONCLUSION 

Thermal  cluster  states  in  three  dimensions  exhibit  a  tran¬ 
sition  from  infinite  to  finite  entanglement  length  at  a  nonzero 
transition  temperature  Tc.  We  have  given  a  lower  and  an 
upper  bound  to  Tc ,  0.3 A  ^  Tc^  1.15 A  (A  is  the  energy  gap  of 
the  Hamiltonian).  The  reason  for  Tc  being  nonzero  is  an  in¬ 
trinsic  error-correction  capability  of  3D  cluster  states.  We 
have  devised  an  explicit  measurement  pattern  that  establishes 
a  connection  between  cluster  states  and  surface  codes.  Using 
this,  we  have  described  how  to  create  a  Bell  state  of  far 
separated  encoded  qubits  in  the  low-temperature  regime  T 
<0.3 A,  making  the  entanglement  contained  in  the  initial 
thermal  state  accessible  for  quantum  communication  and 
computation. 
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Many  proposals  for  quantum  information  processing  require  precise  control  over  the 
motion  of  neutral  atoms,  as  in  the  manipulation  of  coherent  matter  waves  or  the  con¬ 
finement  and  localization  of  individual  atoms.  Patterns  of  micron-sized  wires,  fabricated 
lithographically  on  a  flat  substrate,  can  conveniently  produce  large  magnetic-field  gradi¬ 
ents  and  curvatures  to  trap  cold  atoms  and  to  facilitate  the  production  of  Bose-Einstein 
condensates.  The  intent  of  this  paper  is  to  provide  the  researcher  who  has  access  to  a 
standard  clean-room  enough  information  to  design  and  fabricate  such  devices. 

Keywords :  atom  optics,  quantum  computation,  magnetic  microtraps 
Communicated  by:  D  Wineland  &  E  Polzik 


1  Introduction 

Cold  samples  of  neutral  atoms  and  Bose-Einstein  condensates  have  become  readily  available 
using  the  techniques  of  laser  cooling  and  trapping  [1],  and  it  has  been  widely  recognized  that 
cold  atoms  are  a  rich  resource  for  experiments  in  quantum  information  science.  For  many  pro¬ 
posals,  however,  quantum  control  of  the  atomic  motional  degrees  of  freedom  is  essential.  For 
example,  quantum  computation  in  a  cavity  QED  setting  or  through  controlled  cold  collisions 
requires  the  ability  to  trap  and  control  single  atoms  in  the  Lamb-Dicke  regime  [2,  3,  4].  In 
1995,  Weinstein  and  Libbrecht  noted  that  micron-sized  wires,  fabricated  on  a  substrate,  are 
capable  of  producing  the  large  magnetic  field  gradients  and  curvatures  required  for  trapping 
atoms  in  the  Lamb-Dicke  regime  [5].  Westervelt  et  al ,  in  1998,  succeeded  in  fabricating  the 
wire  patterns  used  in  the  trap  designs  of  Weinstein  and  Libbrecht  [6] .  These  microwire  de¬ 
vices,  now  commonly  known  as  atom  chips  [7],  have  been  used  to  great  success  in  atom  optics 
and  in  the  production  of  Bose-Einstein  condensates  (BEC),  and  are  promising  tools  not  just 
for  quantum  computation,  but  for  atom  interferometry,  cavity  QED,  and  the  study  of  cold 
collisions  as  well  [8,  7,  9,  10].  In  this  paper  we  describe  techniques,  which  have  been  adapted 
from  the  standard  lore  of  microfabrication,  for  fabricating  this  increasingly  important  tool  for 
atomic  physics  and  quantum  optics. 

Atom  optical  elements,  such  as  mirrors,  waveguides,  splitters,  traps,  and  conveyor  belts 
have  been  demonstrated  using  atom  chips  [11,  12,  13,  14,  15,  16,  17].  Cesium  cold  collisions  in 
the  presence  of  light  have  been  studied  using  a  magnetic  microtrap  [18],  and  the  use  of  fiber 
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gap  [19,  20]  and  microsphere  cavities  [20]  for  on-chip  atom  detection  is  being  explored.  Ion 
trap  experiments  are  now  using  substrates  with  microfabricated  electric  pads  for  the  purpose 
of  controlling  ion  position  [21,  22]. 

On-chip  production  of  a  BEC  has  been  one  of  the  most  successful  uses  of  the  atom  chip 
thus  far  [23,  24,  25].  Ioffe  traps  formed  from  microwires  can  produce  extremely  large  trap 
compressions  that  enhance  the  efficiency  of  evaporative  cooling.  Consequently,  condensate 
production  time  can  be  reduced  from  one  minute  to  approximately  ten  seconds  [23],  and 
MOT  loading  can  occur  from  a  thermal  vapor  in  a  glass  cell  with  a  vacuum  of  only  a  few 
10-10  Torr.  All  of  the  required  magnetic  fields  can  be  produced  on-chip  [26,  27],  removing 
the  necessity  of  large,  high  power  external  coils.  The  atom  chip  greatly  miniaturizes  BEC 
production  and  will  enable  the  integration  of  matter  waves  with  chip-based  atom  optics  and 
photonics. 

Another  exciting  avenue  of  research  involves  the  use  of  an  atom  chip  to  trap,  in  the  Lamb- 
Dicke  regime,  one  or  more  atoms  in  the  mode  of  a  high  finesse  cavity.  The  combination 
of  magnetic  microtraps  and  photonic  bandgap  (PBG)  cavities  would  be  an  excellent  cavity 
QED  system  for  the  implementation  of  scalable  quantum  computation,  or  for  the  study  of 
continuous  measurement  and  quantum-limited  feedback.  One  technical  proposal  involves  the 
integration  of  a  PBG  cavity  with  an  Ioffe  trap  formed  from  microwires  patterned  on  the  same 
surface  [28].  The  combination  of  small  mode  volume  and  modest  optical  quality  factor  that 
should  be  obtainable  with  PBG  structures  would  enable  strong  atom-cavity  coupling.  This 
would  be  an  interesting  alternative  to  present  experiments  that  utilize  a  Far  Off  Resonance 
Trap  (FORT)  to  confine  atoms  inside  optical  Fabry-Perot  cavities  [29].  Several  PBG  cavities, 
each  with  an  independent  microwire  trap,  could  be  fabricated  on  the  same  substrate  and 
coupled  together  with  a  network  of  line-defect  optical  waveguides. 

Atom  chips  exploit  the  interaction  potential,  V  =  -ji-B,  between  an  atom’s  magnetic  mo¬ 
ment,  /2,  and  a  wire’s  magnetic  field,  B,  to  trap  or  guide  weak- field  seeking  states  of  a  neutral 
atom.  In  general,  the  field’s  magnitude,  gradient,  and  curvature  scale  as  J/r,  J/r2,  and  J/r3, 
respectively,  where  I  is  the  wire’s  current  and  r  is  its  characteristic  dimension.  Microscopic 
wire  patterns  maximize  field  gradients  and  curvatures  while  keeping  power  dissipation  to  a 
minimum.  Experiments  involve  ultra-high  vacuum  chambers  wherein  atoms  are  trapped  and 
cooled  near  the  vicinity  of  the  atom  chip’s  confining  magnetic  potentials. 

2  Fabrication  Challenges  and  Constraints 

Fabrication  of  atom  chips  poses  several  challenges  in  addition  to  those  encountered  in  standard 
photolithography  [30] .  Many  applications  require  the  wires  to  be  a  couple  microns  wide  by  a 
few  microns  tall  and  spaced  only  a  few  microns  from  one  another.  One  micron  resolution  is 
near  the  limit  of  standard  photolithography,  and  much  care  must  be  taken  to  accurately  pro¬ 
duce  these  micron-sized  wires.  Wires  with  widths  much  less  than  a  micron — though  perhaps 
important  for  realizing  potentials  with  sub-micron  scale  features — are  of  limited  usefulness 
for  creating  large  magnetic  field  gradients  and  curvatures  since  they  become  limited  to  the 
same  maximum  current  density  as  micron-sized  wires  [31].  Further  fabrication  complications 
arise  from  the  need  to  trap  the  atoms  near  the  substrate’s  surface,  and  the  need  to  connect 
the  microwires  to  macroscopic  leads  without  blocking  optical  access.  A  common  technique 
for  trapping  atoms  near  the  substrate  surface,  the  mirror  magneto-optical  trap  (MMOT),  re- 
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quires  that  this  surface  be  an  optical  mirror  as  well  as  the  support  surface  for  the  microwires 
(see  figure  1)  [8].  The  substrate  surface  needs  to  be  larger  than  5  to  10  cm2  to  accommodate 
the  reflected  trapping  beams  as  well  as  to  allow  the  pads  for  macroscopic  wire  contacts  to 
be  outside  of  the  mirror  area  and  not  blocking  the  optical  access  needed  for  the  trapping, 
imaging,  and  pumping  beams.  Consequently,  the  wire  pattern  must  be  flawless  over  an  ex¬ 
ceptionally  large  surface  area:  during  fabrication  one  must  be  extremely  careful  that  no  dust 
or  surface  defects  break  or  short  the  wires. 


Fig.  1.  Diagram  of  the  mirror  MOT  experimental  set-up.  A  quadrupole  field  and  two  45°  laser 
beams  and  one  retroreflected  grazing  beam  (perpendicular  to  the  page  and  not  shown)  form  a 
MOT  1.5  to  4  mm  above  the  substrate. 

The  major  fabrication  challenge  lies  in  increasing  the  height  of  the  wires  to  a  few  microns. 
Even  the  smallest  wires  need  to  support  up  to  an  amp  of  current,  and  consequently,  the  cross- 
sectional  area  of  the  wire  must  be  maximized.  This  reduces  wire  resistance  and  limits  the 
heating  that  causes  wire  breakdown.  Moreover,  attention  must  be  paid  to  the  thermal  conduc¬ 
tivity  of  the  substrate  and  mounting  system  to  ensure  sufficient  power  dissipation.  Sapphire 
or  polished  aluminum  nitride  (AIN)  substrates  provide  sufficient  thermal  conductivity,  but 
are  slightly  trickier  to  use  for  fabrication  than  more  standard  substrates. 

The  use  of  microwires  to  create  an  Ioffe  trap  illustrates  these  challenges.  The  wire  pattern 
shown  in  figures  2(a)  and  (b)  creates  a  3D  harmonic  trap  when  combined  with  a  perpendicular 
homogenous  bias  field  [5].  Unlike  a  quadrupole  trap,  the  Ioffe  trap  has  a  non-zero  field  at 
the  trap  center  and  thus  does  not  suffer  from  Majorana  spin-flip  losses.  An  atom  is  confined 
within  the  Lamb-Dicke  regime  when  its  recoil  energy  is  less  then  the  trap’s  vibrational  level 
spacing  (rj  =  (Erecou/ Evi^)xl2  <  1),  and  for  a  cesium  atom  this  occurs  when  the  trap  curvature 
exceeds  2  x  106  G/cm2.  To  achieve  this  extremely  large  field  curvature  in  all  three  dimensions, 
the  radius  of  the  wire  pattern  in  figure  2(a)  must  be  smaller  than  ~  30  /im.  For  a  trap  of 
inner  radius  10  /im,  outer  radius  15  /im,  and  wire  current  I  —  1  A,  the  curvature  and  Lamb- 
Dicke  parameter,  77,  at  the  center  of  the  trap  in  the  axis  perpendicular  (plane  parallel)  to  the 
substrate  is  2  x  108  G/cm2  (2  x  1010  G/cm2)  and  77  =  0.38  (77  =  0.11).  The  closely  spaced 
wires  can  only  be  a  few  microns  wide,  and  even  if  fabricated  to  a  height  of  2  to  4  microns,  the 
wires  would  need  to  support  the  large  current  density  of  ~  10 11  A/m2.  The  accommodation 
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of  laser  beams  for  atom  cooling,  loading,  and  imaging  constrains  and  complicates  the  atom 
chip’s  design.  The  trap  minimum  is  only  7  /im  from  the  substrate’s  surface,  and  the  mirror 
patterned  on  the  surface  for  use  with  the  MMOT  must  neither  short  the  Ioffe  wires  nor  extend 
more  than  ^  5  /im  from  the  surface.  The  following  sections  describe  the  necessary  fabrication 
tools  and  the  techniques  we  use  to  overcome  these  challenges. 


Fig.  2.  The  planar  Weinstein  and  Libbrecht-style  Ioffe  trap,  a)  When  combined  with  an  opposing 
bias  field,  this  wire  pattern  produces  a  3D  harmonic  potential  above  the  substrate  with  a  non-zero 
field  at  the  trap  center  [5] .  b)  A  planar  Ioffe  trap  with  an  on-chip  bias  coil  fabricated  with  gold  on 
sapphire  using  the  lift-off  method.  In  the  sample  shown  here,  the  wire  height  is  1.5  /im  and  the 
minimum  wire  width  is  10  [im.  The  gold  between  the  wires  forms  a  mirror  for  creating  a  mirror 
MOT. 


3  The  Elements  of  Atom  Chip  Fabrication 

Microfabrication  is  a  labor  intensive  process,  often  involving  several  weeks  of  trial  and  error 
to  perfect  the  fabrication  recipe.  However,  once  the  process  works,  five  to  ten  atom  chips 
can  be  produced  over  a  span  of  two  to  three  days.  The  intent  of  this  paper  is  to  provide 
the  researcher  who  has  access  to  a  standard  clean-room  enough  information  to  design  and 
fabricate  an  atom  chip.  We  will  describe  the  use  of  fabrication  instruments  and  techniques 
only  insofar  as  they  are  relevant  to  atom  chips.  Fabrication  is  not  an  exact  science,  and  the 
techniques  described  here  may  not  be  optimal,  but  nevertheless  have  proven  successful  for  the 
chips  we  have  fabricated. 

In  photolithography,  UV  light  shone  through  a  photomask  casts  shadows  onto  photoresist, 
a  light  sensitive  polymer,  which  is  coated  on  the  surface  of  the  substrate.  Either  positive  or 
negative  photoresist  may  be  used,  with  the  primary  difference  being  that  exposed  areas  of 
positive  photoresist  are  removed  after  developing  whereas  exposed  areas  remain  in  a  process 
using  negative  photoresist.  The  various  fabrication  techniques  differ  in  how  the  wire  metal 
and  photoresist  are  used  to  create  the  wire  patterns.  For  instance,  the  wire  metal  may  be 
either  thermally  evaporated  into  the  trenches  created  in  the  photoresist,  or  grown  upward 
trough  the  trenches  by  electroplating  onto  a  seed  metallic  layer  underneath  the  photoresist. 
The  photoresist  and  unwanted  metal  are  removed  leaving  only  the  desired  wire  pattern.  Gen¬ 
erally,  chip  fabrication  consists  of  six  steps:  creating  a  photomask  containing  the  desired  wire 
pattern,  using  photolithography  to  transfer  the  wire  pattern  to  photoresist  on  a  substrate, 
thermally  evaporating  wire  material,  increasing  the  wire  height,  preparing  the  surface  mir- 
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ror,  and  making  contacts  to  macroscopic  wires.  The  details  and  exact  order  of  these  steps 
vary  depending  on  the  specific  requirements  of  the  microwire  pattern  to  be  fabricated.  For 
instance,  wires  wider  than  30  ftm  or  less  than  one  micron  in  height  may  be  fabricated  with  a 
much  simpler  technique  than  thinner  or  taller  wires.  This  section  discusses  the  steps  common 
to  all  techniques.  Procedures  required  to  increase  the  wire’s  thickness  pertain  to  individual 
fabrication  techniques  and  will  be  discussed  in  the  next  section. 

3.1  The  photomask 

The  photomask  is  typically  a  10  cm  square  piece  of  glass  or  transparent  plastic  on  which  is 
printed  a  positive  or  negative  1:1  image  of  the  wire  pattern.  Wire  patterns  with  widths  or 
spacings  less  than  ~  30  ftm  require  a  professionally  made  chrome  mask:  one  in  which  the 
pattern  is  written  with  chromium  on  a  glass  plate.  We  have  used  the  company  Photronics, 
Inc.  (telephone  619-992-8467)  to  make  photomasks  from  AutoCAD  drawings.  Much  care 
must  be  taken  in  producing  the  AutoCAD  files  since  not  all  functions  are  properly  converted 
to  the  company’s  file  format.  These  masks  are  quite  expensive,  costing  between  $600  and 
$800,  but  have  sub-micron  resolution  and  are  typically  shipped  within  a  week.  It  is  possible 
to  purchase  a  laser  writer  to  produce  in-house  photomasks  with  resolution  down  to  0.8  ftm. 
This  can  be  a  cost  effective  alternative  to  purchasing  individual  masks  from  companies. 

Many  commercial  printing  shops  are  capable  of  printing  overhead  transparencies  with 
high  enough  resolution  to  serve  as  photomasks  for  wire  patterns  with  features  larger  than 
^  30  ft m.  The  line  edges  are  granular  on  a  scale  of  a  few  microns,  and  the  UV  exposure 
time  must  be  adjusted  to  account  for  the  ink  not  being  perfectly  opaque.  However,  the  one 
day  turn-around,  low  cost  of  ~  $20,  and  ease  of  file  preparation — only  an  .eps  file  is  typically 
needed — make  the  transparency  photomask  quite  an  attractive  alternative  for  large  features. 

3. 2  The  substrate 

As  mentioned  earlier,  the  substrate  material  for  the  atom  chip  should  be  carefully  chosen: 
it  must  be  electrically  insulating,  highly  polished,  insusceptible  to  fractures  upon  localized 
heating,  and  an  excellent  thermal  conductor.  We  have  found  that  both  sapphire  and  AIN 
substrates  satisfy  these  requirements.  Sapphire  substrates  0.5  mm  to  2  mm  thick  with  surface 
areas  of  several  cm2  may  be  purchased  from  companies  such  as  Meller  Optics,  Inc.  (telephone 
800-821-0180)  for  $30  to  $40  apiece.  A  surface  quality  of  80-50  scratch-dig  is  sufficient  for 
fabrication.  The  thermal  conductivity  of  AIN,  ~  170  —  180  Wm-1K-1  at  20° C,  is  ~  4.5 
times  higher  than  that  of  sapphire  [31].  We  measured  that  the  max  current  density  supported 
by  microwires  on  AIN,  ~  2  x  1011  A/m2,  is  a  factor  of  two  greater  than  for  microwires 
patterned  on  sapphire.  This  was  measured  using  electroplated  gold  wires  of  varying  cross- 
sections  patterned  exactly  the  same  way  on  both  AIN  and  sapphire  substrates.  Specifically, 
we  used  several  3  ftm  and  20  ftm  wide  wires  whose  heights  ranged  from  one  to  three  microns. 
The  substrates  were  glued  to  room  temperature  copper  blocks  using  EPO-TEK  H77  (Epoxy 
Technology,  telephone  978-667-3805),  a  thermally  conductive  epoxy. 

Sapphire  substrates  are  easier  to  use  for  fabrication  because  their  transparency  allows 
one  to  detect  and  avoid  defects  and  dust  during  the  photolithography  process.  Polished 
AIN  substrates  may  be  purchased  in  bulk  for  less  than  ^  $75,  and  unlike  sapphire,  AIN 
substrates  can  be  cleaved  with  a  diamond  scorer  to  any  shape  desired.  The  polished  AIN  still 
has  a  considerable  amount  of  surface  roughness — one  micron  wide  plateaus  a  few  hundred 
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nanometers  tall  are  typical — but  we  found  that  it  is  nevertheless  possible  to  fabricate  on  this 
surface  perfect  three  micron  wide  wires  spaced  less  than  three  microns  from  one  another.  The 
surface  bumps  simply  map  directly  onto  the  upper  surface  of  the  wires. 

3 . 3  Substrate  cleaning 

Before  the  photolithography  process  may  begin,  the  surface  of  the  substrate  must  be  cleaned 
to  remove  all  organic  material  and  dust.  Although  some  of  the  following  steps  may  seem 
unnecessary  and  “overkill,”  investing  the  time  to  thoroughly  clean  minimizes  the  chance  that 
after  many  hours  of  work,  one  discovers  that  a  piece  of  dirt  has  broken  or  shorted  a  wire. 
The  first  step  is  to  immerse  the  substrate  in  a  beaker  of  “piranha  etch,”  sulfuric  acid  and 
hydrogen  peroxide  in  a  10:1  volume  ratio  brought  to  100° C  on  a  hot  plate  for  ~  5  min. 
Teflon  coated,  flat  tipped  tweezers  are  ideal  for  manipulating  substrates.  After  the  etch,  the 
substrate  should  be  placed  in  a  beaker  of  acetone,  heated  again  to  100° C  for  a  few  minutes, 
and  finally  inserted  into  an  ultrasound  cleaner  for  few  more  minutes.  In  extreme  cases  of 
substrate  grime,  a  cotton  tipped  dowel  can  be  used  to  manually  wipe  away  the  dirt.  Acetone 
leaves  a  thin  film — and  sometimes  even  particulate — when  allowed  to  dry  on  a  substrate’s 
surface.  It  is  imperative  that  one  spray  isopropanol  (IPA)  onto  the  substrate  as  it  is  removed 
from  the  acetone  bath.  This  rinses  the  surface  of  acetone  and  wets  it  with  IPA  which  does  not 
quickly  dry.  The  substrate  must  then  be  rinsed  with  methanol,  which  is  relatively  clean  and 
does  not  leave  a  film,  and  quickly  blown  dry  with  an  air  or  nitrogen  gun.  It  is  crucial  that  the 
air  jet  is  aimed  almost  parallel  to  the  surface  so  that  the  methanol  is  blown-off  rather  than 
dried  on  the  substrate.  When  done  correctly,  the  only  remaining  dirt  particles  will  be  along 
the  edge  of  the  substrate  that  is  downwind  of  the  air  jet,  and  not  in  the  center  fabrication 
region.  If  the  substrate  is  reasonably  clean  after  the  piranha  etch,  then  the  acetone  step 
(which  may  actually  add  some  dirt  particulate)  may  be  skipped,  and  the  substrate  should 
instead  be  immersed  in  IPA  and  placed  inside  an  ultrasound  cleaner. 

3.4  Thermal  evaporation 

Certain  fabrication  techniques,  to  be  discussed  below,  require  that  a  100  nm  metal  layer  be 
thermally  evaporated  before  coating  the  surface  with  photoresist.  We  take  this  opportunity 
to  discuss  the  thermal  evaporation  process.  We  use  gold  for  the  wires  because  of  its  high 
electrical  conductivity,  resistance  to  corrosion,  and  ease  of  evaporation,  electroplating,  and 
wet  etching.  To  successfully  deposit  gold  on  a  substrate’s  surface,  one  must  first  evaporate  a 
50  A  metallic  layer  that  promotes  adhesion  between  the  gold  and  the  sapphire  or  AIN.  We 
typically  use  chromium,  but  titanium  may  also  be  used.  The  magnetic  effects  from  the  thin 
layer  of  chromium  are  negligible.  In  a  thermal  evaporator,  the  substrate  is  mounted  in  a 
vacuum  chamber  facing  a  tungsten  crucible  positioned  a  few  tens  of  centimeters  below.  The 
crucible,  known  as  a  boat,  can  hold  10  to  20  pieces  of  ^  2  mm  long  and  0.5  mm  diameter 
gold  wire.  Current  flows  through  the  boat,  melting  the  gold  and  spewing  it  upwards  toward 
the  substrate.  A  calibrated  crystal  monitor  measures  the  deposition  rate.  One  to  two  boats 
are  sufficient  to  deposit  100  to  200  nm  of  gold,  and  this  costs  $10  to  $15  per  boat.  There  are 
typically  only  four  sets  of  electrical  feedthroughs  in  the  evaporator’s  vacuum  chamber,  and 
to  deposit  more  gold,  one  needs  to  bring  the  chamber  up  to  atmosphere,  reload  the  boats 
with  gold,  and  pump  back  down  to  base  pressure— a  process  that  takes  about  an  hour.  The 
substrate  mounting  area  allows  several  substrates  to  be  coated  at  once.  Evaporating  less  than 
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1  ftm  of  gold  is  reasonable,  but  depositing  more  than  1  gm  becomes  too  expensive  and  time 
consuming,  and  the  quality  of  the  gold  surface  begins  to  diminish.  Moreover,  the  vacuum 
chamber  eventually  becomes  hot  which  may  result  in  the  failure  of  the  crystal  monitor  or  the 
burning  of  photoresist. 

3.5  Photoresist  spinning  and  baking 

Photoresist  does  not  always  adhere  well  to  the  substrate’s  surface.  Before  coating  with  pho¬ 
toresist,  the  substrate  should  be  baked  on  a  hot  plate  at  ~  150°  C  for  a  few  minutes  to  remove 
surface  moisture.  Hexamethyldisilazane  (HMDS)  should  be  used  with  sapphire  substrates  to 
promote  adhesion  (this  is  unnecessary  for  AIN).  Only  a  few  monolayers  of  HMDS  are  required: 
after  baking,  place  the  sapphire  in  a  dish  next  to  several  drops  of  HMDS  and  cover  for  a  few 
minutes.  Note  that  both  HMDS  and  photoresist  are  carcinogenic  and  should  be  handled  with 
care. 

Spinning  photoresist  onto  a  substrate  is  a  relatively  straightforward  process.  The  sub¬ 
strate,  with  beads  of  photoresist  dripped  onto  its  surface,  is  spun  by  a  vacuum  chuck  to  a 
few  thousand  rpm  for  several  tens  of  seconds.  A  faster  rotation  results  in  a  thinner  film  of 
photoresist.  Typically,  a  film  thickness  of  a  few  microns  is  possible  with  standard  photore¬ 
sists,  and  there  exists  special  resists  that  are  four  to  twenty  microns  thick.  These  thick  resists 
are  often  important  for  making  tall  wire  structures.  The  thickness  of  a  photoresist  may  be 
increased  beyond  its  specification  by  dripping  resist  onto  its  surface  during  rotation.  After 
spin-coating,  the  photoresist  needs  to  be  baked  on  a  hot  plate  to  prepare  the  polymer  for 
UV  exposure.  The  exact  temperature  and  bake  duration  are  often  crucial  to  the  success  of 
the  fabrication.  We  would  like  to  note  that  it  is  possible  to  layer  microwire  patterns  on  top 
of  one  another  by  fabricating  each  new  wire  layer  on  top  of  a  spin-coated  insulator  such  as 
polyimide  [32]. 

3.6  UV  exposure 

The  central  step  in  photolithography  is  the  UV  exposure  of  the  photoresist.  An  instrument 
known  as  a  mask  aligner  allows  one  to  accurately  position  the  photomask  flush  to  the  sub¬ 
strate’s  photoresist-coated  surface,  and  a  built-in  UV  lamp  exposes  the  photoresist  for  a 
specified  amount  of  time.  Essential  for  photomask  and  substrate  registration  is  an  optical 
microscope  mounted  on  the  mask  aligner.  This  enables  one  to  simultaneously  view  the  wire 
patterns  on  the  mask  and  the  underlying  substrate.  Dust  particles  or  scratches  often  remain 
on  the  substrate  even  after  a  thorough  cleaning.  If  these  defects  are  sparse,  then  the  substrate 
may  be  translated  such  that  the  wires  avoid  all  defects.  Aligning  the  chip’s  wire  pads  along 
one  or  more  edges  of  the  substrate  further  constrains  the  relative  position  of  the  photomask  to 
the  substrate.  It  should  be  noted  that  it  is  difficult  to  properly  develop  the  pads  (or  other  wire 
features)  less  than  a  millimeter  from  the  edge  due  to  photoresist  beading.  Certain  fabrication 
recipes  require  the  photoresist  to  be  baked  and  exposed  again  before  developing. 

It  is  good  practice  to  clean  the  chrome  photomasks  after  every  use.  Photoresist  can  stick 
to  the  surface,  and  if  left  for  days,  will  produce  hard  to  remove  specs  that  can  block  the  UV 
light,  creating  unwanted  features  or  breaks  in  the  patterned  wires.  Immersing  in  a  dish  of 
acetone  and  rinsing  with  IPA  and  methanol  is  sufficient  for  routine  cleaning.  Some  chrome 
masks  can  withstand  ultrasound  cleaning  as  well  as  being  wiped  with  a  soft,  lint-free  cloth, 
and  this  seems  to  be  the  only  way  to  remove  encrusted  grime  or  particulate. 
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3. 7  Developing 

To  remove  the  photoresist  regions  defined  by  the  UV  exposure,  the  substrate  must  be  im¬ 
mersed  and  slightly  agitated  in  a  beaker  of  developer  for  a  few  tens  of  seconds  followed  by  a 
water  rinse.  The  exact  developing  time  depends  on  the  previous  fabrication  steps,  but  it  is 
generally  possible,  especially  with  the  transparent  sapphire  substrates,  to  see  a  characteristic 
change  in  opacity  of  the  photoresist  as  it  becomes  fully  developed.  For  instance,  when  using 
a  positive  process,  one  first  sees  the  exposed  photoresist  turn  hazy,  revealing  the  wire  pat¬ 
tern.  After  a  few  seconds,  the  hazy  region  sloughs  off  exposing  the  bare  substrate  and  leaving 
darker,  patterned  regions  of  photoresist.  If  a  mistake  is  made  at  any  point  in  the  photolithog¬ 
raphy  process,  the  substrate  can  be  reused  by  removing  the  photoresist  in  a  beaker  of  acetone 
and  cleaning  the  substrate  as  mentioned  above,  starting  with  the  ultrasound. 

3.8  Ozone  dry  stripping 

Certain  fabrication  processes  require  the  substrate  surface  to  be  etched  in  an  ozone  dry 
stripper.  This  uses  UV  light,  ozone,  and  heat  to  remove  thin  films  of  unwanted  organic 
material,  photoresist,  or  HMDS  that  may  prevent  the  deposition  of  thermally  evaporated  or 
electroplated  gold. 

3. 9  Wire  contacts 

Wire  bonding  and  ultrasonic  fluxless  soldering  are  useful  methods  for  attaching  macroscopic 
wires  to  the  substrate’s  contact  pads.  Wire  bonding  is  the  standard  method  for  making 
contacts  to  micro-  or  nanofabricated  devices.  The  wire  bonder  attaches  each  end  of  a  thin 
thread  of  gold  wire  to  a  pad  using  a  heated,  ultrasonically  vibrating  tip.  The  thin  wire  may  be 
stretched  over  several  millimeters  between  the  pad  on  the  substrate  and  a  pad  on  the  substrate 
support  structure.  The  pads  on  the  support  structure  may  then  be  connected  to  standard  wire 
contact  pins.  Because  the  wire  threads  are  prone  to  break  and  cannot  individually  support 
more  than  a  few  hundred  mA  of  current,  it  is  necessary  to  make  several  redundant  bonds  per 
pad.  This  process  can  be  quite  time  consuming.  As  an  alternative,  ultrasonic  soldering  irons 
are  capable  of  attaching  regular  wires  to  sapphire  or  AIN  using  fluxless  solder.  Attaching 
wires  is  nearly  as  simple  as  standard  soldering,  and  the  fluxless  solder  is  vacuum  compatible 
to  at  least  1CV9  Torr.  Unfortunately,  the  solder  material  forms  mounds  on  the  substrate’s 
surface  that  can  limit  optical  access. 

3.10  The  mirror 

Finally,  we  would  like  to  discuss  methods  for  making  the  atom  chip’s  surface  mirror-like. 
The  most  straightforward  method  involves  simply  patterning  gold  on  the  entire  chip’s  surface 
except  for  thin,  >  10  /im,  wide  gaps  around  the  actual  wires  [7].  This  technique  does  not 
add  any  additional  steps  to  the  fabrication  procedure,  but  it  does  increase  the  likelihood  that 
surface  defects  will  short  the  wires  through  contacts  to  the  large  mirrored  areas.  The  mirror 
gaps  that  define  the  wires  imprint  defects  onto  the  reflected  mirror  MOT  beams,  but  we  have 
nevertheless  been  able  to  trap  more  than  a  million  cesium  atoms  with  this  less  than  perfect 
mirror.  Another  technique  involves  coating  the  chip’s  surface  with  an  insulator  and  then 
applying  a  mirror  coating.  For  example,  several  layers  of  polymethyl  methacrylate  (PMMA) 
can  be  spun  onto  the  substrate.  Swabbing  with  acetone  removes  the  PMMA  covering  the 
wire  pads  near  the  substrate’s  edge,  and  the  mirror  is  created  by  using  a  mask  to  thermally 
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evaporate  gold  only  onto  the  PMMA-coated  region.  Epoxying  a  silver  mirror  (with  EPO-TEK 
353)  to  the  surface  also  forms  a  good  mirror,  and  it  eliminates  any  corrugations  on  the  mirror 
surface  caused  by  the  underlying  wires  [11].  Unfortunately,  the  minimum  distance  between 
the  atoms  and  the  wires  is  set  by  the  mirror  and  epoxy  thickness.  An  improved  mirror  can 
be  made  by  epoxying  a  dialectic  mirror  onto  the  surface.  Vacuums  of  2  x  10“ 10  Torr,  in  a 
chamber  baked  to  150° C,  have  been  achieved  despite  using  this  glue  and  dielectric  coating. 

4  Specific  fabrication  techniques:  wet  etching,  ion  milling,  lift-off  method,  and 

electroplating 

The  minimum  required  wire  dimensions  vary  significantly  depending  on  the  the  atom  chip’s 
application,  and  an  optimal  fabrication  technique  should  be  chosen  accordingly.  This  section 
describes  the  recipe  and  relative  merit  of  each  fabrication  method. 

4-1  Wet  etching  and  ion  milling 

The  simplest  chip  to  fabricate  has  wire  widths  no  smaller  than  30-40  gm  and  wire  heights  less 
than  1  gm.  A  transparency  mask  should  be  used  for  the  photolithography  (see  Section  3.1). 
The  wire  height  is  set  by  a  thermally  evaporated  gold  layer  and  the  photoresist  masks  the 
gold  intended  for  wires  from  the  wet  etch  solution  (see  figure  3(a)).  To  begin  the  procedure, 
the  cleaned  substrate  should  be  placed  in  the  ozone  dry  stripper  for  five  minutes  at  65 °C 
to  ensure  that  no  organic  material  will  prevent  the  adhesion  of  chromium  and  gold.  The 
thermal  evaporation  step  follows,  with  the  thickness  of  the  gold  layer  determined  by  chip’s 
current  density  requirements.  Because  the  photoresist  adheres  well  to  gold,  only  a  5  min  bake 
at  180° C  is  necessary  for  adhesion.  Wet  etching  removes  exposed  gold,  and  the  photoresist 
should  be  patterned  such  than  it  covers  the  areas  intended  for  wires,  i.e.  the  photoresist  should 
be  a  positive  image  of  the  wire  pattern.  A  photomask  on  which  the  wires  are  opaque,  used 
in  conjunction  with  positive  photoresist,  will  produce  a  positive  image  of  the  wire  pattern. 
We  use  the  photoresist  AZ5214  (Clariant),  which  can  serve  as  both  a  negative  and  positive 
photoresist  depending  on  the  bake  and  exposure  procedure.  The  positive  process  recipe  is 
as  follows:  spin  coat  at  5000  rpm  for  50  s,  bake  at  95° C  for  2  min,  expose  for  10  to  20  s, 
and  develop  in  AZ327  MIF  (or  some  similar  developer)  for  30  s.  All  of  the  above  times  are 
approximate  and  will  vary  depending  on  the  UV  light  intensity  of  the  specific  mask  aligner 
and  on  various  environmental  conditions.  It  may  be  necessary  to  try  various  exposure  and 
bake  times  to  find  the  optimal  recipe.  These  exposure  times  are  based  on  the  16  mW/cm2 
UV  intensity  of  our  mask  aligner.  To  remove  the  gold  not  covered  by  photoresist,  submerge 
the  substrate  in  gold  etch  solution  (Gold  Etchant  TFA,  Transene  Company,  Inc.,  telephone 
978-777-7860)  for  a  few  tens  of  seconds  until  only  the  dull  gray  of  the  chromium  layer  remains. 
Finally,  remove  the  chromium  layer  with  chrome  etchant  (CR-7S,  Cyantek,  Co.,  telephone 
510-651-3341).  Figures  4(a)  and  (b)  show  a  substrate  patterned  in  this  manner.  The  wet  etch 
dissolves  the  gold  isotropically,  and  the  decrease  in  wire  width  is  insignificant  for  wires  larger 
than  10  to  20  gm.  Of  course,  transparency  masks  cannot  be  used  for  features  smaller  than  a 
few  tens  of  microns. 

Ion  milling  can  be  useful  alternative  to  wet  etching.  Instead  of  removing  the  unwanted 
gold  with  an  etch  solution,  argon  ions  anisotropically  bombard  the  surface,  removing  the  gold 
not  covered  by  photoresist  (see  figure  3(b)).  This  method  can  produce  very  narrow  features, 
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Fig.  3.  Fabrication  techniques,  (a)  Patterned  positive  photoresist  masks  the  gold  layer  from  the 
gold  and  chromium  wet  etch,  (b)  The  argon  ions  mill  away  the  gold  not  covered  by  positive  pho¬ 
toresist.  (c)  Gold  is  thermally  evaporated  into  the  trenches  patterned  in  the  negative  photoresist. 
The  undercut  allows  the  photoresist  and  unwanted  gold  to  separate  from  the  substrate  without 
peeling  away  the  gold  in  the  trenches,  (d)  Wires  are  defined  by  gaps  in  the  positive  photoresist, 
and  the  walls  of  the  photoresist  guide  the  wires  as  they  are  electroplated.  After  electroplating, 
acetone  removes  the  photoresist  and  gold  and  chromium  etches  remove  the  seed  layer. 
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Fig.  4.  Gold  wire  patterned  using  the  wet  etch  technique,  (a)  This  atom  chip  contains  a  quadrupole 
trap  in  the  U  configuration.  The  gold  wire,  patterned  on  sapphire  and  surrounded  by  a  gold  mirror, 
is  300  /am  wide  and  1  /am  tall,  (b)  Close-up  of  the  wire  region.  The  gold  appears  darker  than  the 
uncovered  sapphire  substrate. 


limited  only  by  photoresist  resolution,  with  heights  determined  by  the  thermally  evaporated 
gold  layer.  The  photoresist  is  also  milled,  but  this  is  of  no  consequence  as  long  as  it  is  thicker 
than  the  gold  layer.  The  substrate  may  become  quite  hot  during  the  ion  etching,  and  one 
needs  to  be  careful  that  the  substrate  does  not  overheat,  causing  the  photoresist  to  become 
hard  and  difficult  to  remove.  We  have  used  ion  milling  to  make  atom  chips  as  well  as  to  etch 
a  common  hard  drive  for  use  as  a  magnetic  atom  mirror  [17]. 

4-2  The  lift-off  method 

The  quick  and  easy  wet  etch  technique  is  unfortunately  not  suitable  for  wire  widths  smaller 
than  10  ft m,  and  ion  milling  machines  are  not  readily  available.  The  lift-off  method  should  be 
used  for  the  case  in  which  the  wires  need  not  be  taller  than  1  ftm  but  less  than  10  ftm  wide. 
In  contrast  to  the  wet  etch  technique,  the  photoresist  in  this  method  is  used  as  a  mask  for 
the  deposition  of  thermally  evaporated  gold.  Trenches  are  created  in  a  negative  photoresist 
using  a  photomask  with  opaque  wires,  and  evaporated  gold  deposits  both  into  the  trenches, 
adhering  to  the  substrate,  and  onto  the  surface  of  the  photoresist  (see  figure  3(c)).  If  done 
properly,  the  walls  of  the  trenches  have  an  overhang — which  looks  like  an  undercut  when 
viewed  from  above — that  prevents  the  unwanted  gold  on  the  photoresist  from  connecting  to 
the  gold  in  the  trenches.  An  acetone  bath  dissolves  the  photoresist,  allowing  the  unwanted 
gold  to  lift-off  leaving  the  wire  pattern  formed  from  the  gold  in  the  trenches. 

After  cleaning  the  substrate,  the  AZ5214  is  spun  on  the  substrate  for  45  s  at  5000  rpm.  The 
maximum  height  of  the  thermally  evaporated  wires  is  set  by  the  thickness  of  the  photoresist 
since  lift-off  will  not  work  once  the  top  of  the  gold  connects  with  the  gold  on  the  overhang. 
We  have  been  able  to  achieve  lift-off  with  wires  1.5  ftm  tall  by  spinning  the  photoresist  on  at 
2000  rpm  and  thermally  evaporating  many  boats  of  gold  over  a  period  of  three  to  four  hours. 
The  photoresist  should  then  be  baked  for  45  s  at  100° C,  UV  exposed  with  the  photomask  for 
10  s,  baked  again  for  45  s  at  123° C,  UV  exposed  with  no  mask  for  2.1  min,  and  developed 
for  25  to  35  s.  Developing  is  finished  when  one  can  see  the  wire  pattern  in  the  photoresist.  A 
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successful  undercut  can  be  seen  in  a  microscope  as  a  bright  outline  of  the  edges  of  the  trenches. 
Before  thermal  evaporation,  the  substrate  should  be  placed  in  the  ozone  dry  stripper  at  65 °C 
for  5  minutes.  This  removes  unwanted  material  that  could  prevent  gold  adhesion,  and  does 
not  seem  to  hamper  photoresist  removal  as  in  the  electroplating  process  described  below. 
To  promote  lift-off,  the  acetone  bath  should  be  heated  on  a  hot  plate,  and  the  substrate, 
while  inside  the  beaker,  should  be  sprayed  with  an  acetone  squirt  bottle.  It  is  very  important 
that  all  of  the  gold-coated  photoresist  be  pealed  away  before  the  substrate  is  removed  from 
the  acetone.  Otherwise,  once  dried,  the  unwanted  gold  flakes  become  extremely  difficult  to 
separate  from  the  surface.  Difficulty  in  achieving  lift-off  may  be  overcome  by  briefly  exposing 
the  substrate  to  ultrasound.  This  is  risky,  however,  since  the  gold  wires  might  be  stripped-off 
as  well.  Figure  2(b)  shows  an  atom  chip  fabricated  with  the  lift-off  method. 

4-3  Electroplating 

The  above  methods  rely  on  thermal  evaporation  to  achieve  the  required  wire  thickness.  This 
limits  the  wire  heights  to  ^  1  gm.  Electroplating  the  wires  can  increase  the  wire  height 
considerably:  for  example,  we  have  made  3  gm  wide  wires,  4  gm  tall.  Thick  photoresist 
spun  and  patterned  on  a  thin  gold  seed  layer  provide  a  template  for  the  growth  of  the  wires. 
The  walls  of  the  photoresist  maintain  a  constant  wire  width  as  the  wire  height  increases  (see 
figure  3(d)).  An  acetone  wash  followed  by  a  brief  wet  etch  removes  the  photoresist  and  gold 
seed  layer.  Electroplating  is  a  tricky  process  that  does  not  always  produce  reliable  results. 
We  provide  here  a  general  guideline  for  the  process,  and  with  this  process  we  have  typically 
been  able  to  achieve  a  75%  yield  with  a  wire  height  accuracy  of  ±0.5  gm. 

Fabrication  begins  with  cleaning  and  ozone  dry  stripping  the  substrate,  followed  by  the 
thermal  evaporation  of  a  100  to  150  nm  seed  layer  of  gold  along  with  a  50  A  chromium 
adhesion  layer.  For  proper  wire  guiding,  the  photoresist  must  always  be  taller  than  the  elec¬ 
troplated  wires,  and  a  photoresist  thicker  than  the  one  used  in  the  aforementioned  techniques 
is  necessary.  Clariant’s  AZ9200  series  photoresists  are  4  to  24  microns  thick,  and  can  achieve 
aspect  ratios  of  5  to  7  with  resolutions  of  <  1  gm  to  3.5  gm  depending  on  the  resist  thickness. 
After  spin  coating,  the  photoresist  should  be  UV  exposed  for  60  s  (or  longer  depending  on 
the  photoresist  thickness)  using  a  photomask  with  transparent  wire  patterns.  The  resist  is 
developed  in  a  1:4  solution  of  AZ400K  and  water  for  a  minute  or  more:  the  exposed  photore¬ 
sist  will  turn  hazy  before  dissolving  away.  The  gold  seed  layer  also  acts  as  the  cathode  in  the 
electroplating  process,  and  some  of  the  photoresist  must  be  whipped  away  with  acetone — or  a 
blank  spot  designed  in  the  photoresist — to  serve  as  a  contact  for  the  cathode  lead.  An  ozone 
dry  etch  is  then  used  to  remove  any  layers  of  HMDS,  photoresist,  or  organics  that  might  mask 
regions  of  the  gold  from  the  electroplating  solution.  The  time  and  temperature  of  this  process 
is  crucial:  too  long  of  an  exposure  at  too  high  of  a  temperature  will  make  the  photoresist 
difficult  to  remove  between  closely  spaced  wires,  and  too  short  of  an  exposure  will  not  remove 
enough  unwanted  masking  material.  For  example,  we  found  that  an  18  s  room-temperature 
ozone  dry  etch  was  optimal  for  removing  unwanted  material  while  also  enabling  the  removal 
of  photoresist  between  wires  spaced  by  3  gm. 

We  use  a  sodium  gold  sulfite  solution  (TG-25E,  Technic,  Inc.  telephone  714-632-0200)  for 
the  electroplating.  The  solution  is  temperature  controlled  on  a  hot  plate  to  60 °C  and  agitated 
with  a  magnetic  stirrer.  The  anode  is  a  platinum  foil,  and  the  substrate  is  connected  to  the 
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power  supply  with  a  standard  mini  alligator  clip.  This  clip  can  be  dipped  into  the  bath  to 
enable  the  complete  submersion  of  the  substrate.  We  usually  use  a  current  of  0.1  to  0.2  mA 
to  electroplate.  Higher  currents  seem  to  produce  rougher  wire  surfaces.  The  solution  should 
remain  clear  to  slightly  yellowish  during  the  process,  and  something  is  wrong  if  the  solution 
starts  to  turn  brown.  The  substrate  should  be  gently  agitated  while  electroplating  to  promote 
even  plating  and  suppress  the  formation  of  ^  5  ftm  tall  towers  of  gold.  Typically,  it  takes  10 
to  30  minutes  to  electroplate  several  microns  of  gold  at  this  current  setting. 

After  electroplating,  the  photoresist  should  be  removed  in  a  room-temperature  acetone 
bath.  Sometimes  it  is  difficult  to  remove  the  photoresist  between  wires  spaced  only  several 
microns  from  one  another,  and  in  these  cases  the  substrate — while  in  the  acetone — should  be 
placed  in  an  ultrasound  for  a  few  minutes.  The  gold  should  not  peel  away  since  it  is  attached 
to  the  entire  substrate  surface.  After  rinsing  the  acetone  away  with  IPA  and  methanol,  the 
gold  seed  layer  is  removed  with  a  ^  15  s  wet  etch.  The  chromium  adhesion  layer  should  also 
be  wet  etched  away.  Occasionally,  the  air  jet  does  not  remove  all  of  the  methanol  from  the 
substrate,  and  tiny  drops  of  methanol  can  sometimes  dry  on  leeward  side  of  the  wires.  This 
dried  methanol  acts  as  a  mask  for  the  gold  etch,  leaving  small  puddles  of  the  seed  layer  that 
can  short  adjacent  wires.  These  puddles  can  be  removed  by  rinsing  with  methanol,  blow- 
drying  from  a  different  angle,  and  briefly  wet  etching  a  second  time.  The  surface  reflectance 
of  the  gold  is  typically  diminished  after  the  wet  etch,  and  a  mirror  fabricated  with  this  gold 
may  not  be  ideal. 

A  surface  profilometer,  commonly  known  as  an  alpha  step  machine,  is  quite  useful  for 
quickly  measuring  the  height  of  the  wires.  Inevitably,  a  few  substrates  must  be  spent  opti¬ 
mizing  the  electroplating  process  for  a  specific  wire  height.  Figures  5(a)  and  (b)  show  an  atom 
chip-based  BEC  interferometer  that  we  fabricated  by  electroplating  on  an  AIN  substrate  [33] . 
The  smallest  features  are  five,  1  mm  long  wires  that  are  each  3  ftm  wide,  4  ftm  tall,  and 
spaced  less  than  3  ftm  from  one  another. 

5  Conclusion 

The  techniques  described  in  this  paper  provide  a  basic  starting  point  for  the  design  and 
fabrication  of  these  atom  chips.  The  precise  control  of  atomic  position  enabled  by  these 
chips  is  quite  crucial  to  many  areas  of  research.  Moreover,  these  devices  allow  an  incredible 
miniaturization  of  experiments  involving  cold  atoms.  From  constructing  atom  optical  elements 
to  studies  of  BECs  and  cavity  QED,  atom  chips  are  proving  invaluable  to  the  fields  of  atomic 
physics,  quantum  optics,  and  quantum  computation. 
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Fig.  5.  An  atom  chip-based  BEC  interferometer  fabricated  by  electroplating  onto  a  AIN  substrate, 
a)  The  chip  will  produce  a  BEC  and  transport  it  to  the  center  region  where  b)  five  wires  3  g m 
wide,  4  g m  tall,  and  spaced  by  3  [im  will  split  the  BEC  in  a  double  well  potential. 
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We  describe  a  fault-tolerant  one-way  quantum  computer  on  cluster  states  in  three 
dimensions.  The  presented  scheme  uses  methods  of  topological  error  correction  resulting 
from  a  link  between  cluster  states  and  surface  codes.  The  error  threshold  is  1.4%  for 
local  depolarizing  error  and  0.11%  for  each  source  in  an  error  model  with  preparation-, 
gate-,  storage-  and  measurement  errors. 

1  Introduction 

A  quantum  computer  as  a  physical  device  has  to  cope  with  imperfections  of  its  hardware.  Fortu¬ 
nately,  it  turns  out  that  arbitrary  large  quantum  computations  can  be  performed  with  arbitrary 
accuracy,  provided  the  error  level  of  the  elementary  components  of  the  quantum  computer  is  be¬ 
low  a  certain  threshold.  This  is  the  content  of  the  threshold  theorem  for  quantum  computation 
[1,  2,  3,  4].  The  threshold  theorem  also  provides  lower  bounds  to  the  error  threshold  which  are  in 
the  range  between  10-10  and  10-4,  depending  on  the  error  model.  It  thus  appears  that  there  is  a 
gap  between  the  required  and  the  currently  available  accuracy  of  quantum  operations,  and  it  invites 
narrowing  from  both  the  experimental  and  the  theoretical  side.  In  this  context,  significant  progress 
has  been  made  in  [5]  where  a  threshold  estimate  in  the  percent  range  has  been  demonstrated.  For 
experimentally  viable  quantum  computation  there  is  a  further  desideratum  besides  a  high  error 
threshold.  With  the  exception  of  certain  schemes  for  topological  quantum  computation  [6,  7,  8],  the 
price  for  fault-tolerance  is  an  overhead  in  quantum  resources.  This  overhead  should  be  moderate. 

Here  we  describe  a  fault-tolerant  version  of  the  one-way  quantum  computer  ( QCc )•  The  QCc  is 
a  scheme  for  universal  quantum  computation  by  one-qubit  measurements  on  cluster  states  [9]. 
Cluster  states  [10]  consist  of  qubits  arranged  on  a  two-  or  three-dimensional  lattice  and  may  be 
created  by  a  nearest-neighbor  Ising  interaction.  Thus,  for  the  QCc,  only  nearest-neighboring  qubits 
need  interact  and  furthermore,  only  once  at  the  beginning  of  the  computation.  For  this  scenario  in 
three  dimensions  we  present  methods  of  error  correction  and  a  threshold  value. 

The  existence  of  an  error  threshold  for  the  QCc  has  previously  been  established  [11,  12,  13]  and 
threshold  estimates  have  been  obtained  [12,  13],  by  mapping  to  the  circuit  model.  Here  we  take 
a  different  path.  We  make  use  of  topological  error  correction  capabilities  that  the  cluster  states 
naturally  provide  [14]  and  which  can  be  linked  to  surface  codes  [6,  15].  The  main  design  tool  upon 
which  we  base  our  construction  are  engineered  lattice  defects  which  are  topologically  entangled. 

The  picture  is  the  following:  quantum  computation  is  performed  on  a  three-dimensional  cluster 
state  via  a  temporal  sequence  of  one-qubit  measurements.  The  cluster  lattice  is  subdivided  into  three 
regions,  V,  D  and  S.  The  set  S  comprises  the  ‘singular’  qubits  which  are  measured  in  an  adaptive 
basis.  The  quantum  computation  happens  essentially  there.  The  sets  V  and  D  are  to  distribute 
the  correct  quantum  correlations  among  the  qubits  of  S.  V  stands  for  ‘vacuum’,  the  quantum 
correlations  can  propagate  and  spread  freely  in  V.  D  stands  for  ‘defect’.  Quantum  correlations 
cannot  penetrate  the  defect  regions.  They  either  end  in  them  or  wrap  around  them.  In  both  cases, 
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Figure  1:  Topological  error  correction  for  the  QCc .  The  three-dimensional  cluster  state,  shown  as  a 
block,  is  structured  by  lattice  defects  which  exist  in  two  kinds:  primal  and  dual.  The  defects  wind 
around  another.  Some  of  them  hold  singular  qubits  (red)  which  realize  the  non-Clifford  part  of  the 
quantum  computation. 


the  defects  guide  the  quantum  correlations.  V  and  D  are  distinguished  by  the  bases  in  which  the 
respective  cluster  qubits  are  measured.  The  region  V  fills  most  of  the  cluster.  Embedded  in  V  are 
the  defects  (D)  most  of  which  take  the  shape  of  loops.  These  loops  are  topologically  entangled  with 
another.  Further,  there  are  defects  in  the  shape  of  ear  clips  which  each  hold  an  5-qubit  in  their 
opening.  Such  a  defect  and  the  belonging  5-qubit  form  again  a  loop.  These  are  the  only  locations 
where  the  5-qubits  occur.  See  Fig.  1. 

There  are  two  codes  upon  which  we  base  our  construction,  the  planar  code  [15]  and  the  concate¬ 
nated  [15,3,1]  quantum  Reed-Muller  code  [16,  17].  Both  codes  individually  have  their  strengths 
and  weaknesses,  but  they  can  be  advantageously  combined.  The  planar  code  has  a  relatively  high 
error  threshold  of  about  11%  [18],  and  the  symmetries  of  its  stabilizer  fit  well  with  cluster  states. 
Moreover,  fault-tolerant  data  storage  with  the  planar  code  and  the  creation  of  long  range  entangle¬ 
ment  among  planar  code  qubits  are  easily  accomplished  in  three-dimensional  cluster  states,  via  a 
bcc-symmetric  pattern  of  one-qubit  measurements.  Therefore,  it  is  suggestive  to  base  fault-tolerant 
cluster  state  computation  on  this  code.  However,  the  planar  code  is  not  well  suited  to  non-Clifford 
operations  which  are  essential  for  universal  quantum  computation. 

Now,  in  cluster-  and  graph  state  quantum  computation,  the  non-Clifford  part  of  the  circuit 
is  implemented  by  destructive  measurements  of  the  observables  ( X  ±  Y)/y/2  (The  Clifford  part 
is  implemented  by  X-,  Y-  and  ^-measurements.).  The  Reed-Muller  quantum  code  is  well  suited 
for  fault-tolerant  quantum  computation  via  local  measurements  because  the  measurements  of  the 
encoded  observables  (X  ±Y)/y/ 2  (and  of  X ,  Y,  Z  besides)  are  accomplished  fault-tolerantly  by  the 
respective  measurements  on  the  bare  level — and  bare  level  measurements  is  what  we  are  allowed 
to  do  in  the  QCc •  If  we  could  assume  we  had  given  a  graph  state  state  as  algorithmic  resource 
where  each  cluster  qubit  was  encoded  with  the  concatenated  Reed-Muller  code  and  where  noise 
acted  locally  on  the  bare  level,  then  fault-tolerant  quantum  computation  were  trivial  to  achieve. 
However,  is  not  obvious  how  to  create  such  an  encoded  graph  state  affected  by  local  noise  only.  But 
this  is  just  what  the  topological  error  correction  in  cluster  states  can  do. 
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This  paper  is  organized  as  follows.  In  Section  2,  we  introduce  the  ingredients  required  for  the 
error  correction  mechanisms  we  use,  namely  cluster  states,  the  planar  code  and  the  15-qubit  Reed- 
Muller  quantum  code.  In  Section  3  the  measurement  pattern  used  for  fault-tolerant  cluster  state 
quantum  computation  is  described,  and  in  Sections  5  -  6  it  is  explained.  Specifically,  in  Section  5 
we  describe  the  physical  objects  relevant  for  the  discussed  scheme — defects,  cluster  state  quantum 
correlations,  errors  and  syndrome  bits — in  the  language  of  homology.  In  Section  6  we  introduce  the 
techniques  for  structuring  quantum  correlations  via  topological  entanglement  of  lattice  defects.  Our 
error  models  are  stated  in  Section  7.1  and  the  fault-tolerance  threshold  is  derived  in  Section  7.2. 
The  overhead  is  estimated  in  Section  8.  We  discuss  our  results  in  Section  9. 


2  Cluster  states  and  quantum  codes 

This  section  is  a  brief  review  of  the  ingredients  for  the  described  fault-tolerant  QCc- 


Cluster  states.  A  cluster  state  is  a  stabilizer  state  of  qubits,  where  each  qubit  occupies  a  site 
on  a  d-dimensional  lattice  C.  Each  site  a  G  C  has  a  neighborhood  N(a)  which  consists  of  the 
lattice  sites  with  the  closest  spatial  distance  to  a.  Then,  the  cluster  state  \(j))c  is — up  to  a  global 
phase — uniquely  defined  via  the  generators  Ka  of  its  stabilizer 

Ka  ■=  Xa  0  Zb,  Va  G  C,  (1) 

beN(a) 

i.e.,  | <j>)c  —  Ka\(f))c •  Here,  Xa  and  Z &  are  a  shorthand  for  the  Pauli  operators  and  that 
we  use  throughout  the  paper.  We  refer  to  the  generators  Ka  of  the  cluster  state  stabilizer  as  the 
elementary  cluster  state  quantum  correlations. 

In  this  paper,  we  will  use  as  the  lattice  underlying  the  cluster  state  a  bcc-symmetric  lattice  in 
tree  dimensions.  That  is,  the  location  of  cluster  qubits  is  given  by  lattice  vectors 

{(o[dd],  e[ven],  e),  (e,  o,  e),  (e,  e,  o)},  odd  qubits,  _ 

{(e,  o,  o),  (o,  e,  o),  (o,  o,  e) } ,  even  qubits.  ^ 

We  sub-divide  the  set  of  qubits  into  two  subsets,  the  even  and  the  odd  qubits.  For  even  (odd)  qubits 
the  sum  of  the  coordinates  of  their  respective  lattice  site  is  even  (odd). 

Note  that  instead  of  with  a  bcc-symmetric  lattice  we  could  have  equivalently  started  with  a 
cluster  state  on  an  sc-symmetric  lattice,  because  a  cluster  state  on  the  latter  is  mapped  to  a  cluster 
state  on  the  former  by  ^-measurements  on  the  qubits  (e,e,e)  and  (o,o,  o);  see  [9]. 


The  [15,  3, 1]  quantum  Reed-Muller  code.  By  this  we  denote  a  15  qubit  CSS-code  based  on 
the  (classical)  punctured  Reed-Muller  code  72,(1, 4)*  [19].  Its  stabilizer  generator  matrix  has  the 
form 


Grm 


Gx  0  \ 

0  )  ’ 


(3) 


where 


i 

0 

0 

0 


and  Gz  is  given  by  Gx _L  =  Gz 
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Figure  2:  Elementary  cell  of  the  bcc  lattice.  Qubits  live  on  the  faces  and  edges  of  the  elementary 
cell.  Syndrome  bits  are  located  in  the  cube  and  on  the  sites.  Each  elementary  cell  has  a  volume  of 
2  x  2  x  2  in  cluster  units.  Black:  elementary  cell  of  the  bcc  lattice,  blue:  edges  of  the  cluster  graph. 


This  code  has  the  fairly  rare  property  that  the  encoded  non-Clifford  gate  Uz(tt/4:)  =  exp(— i7v/8Z) 
is  local  [16,  17],  i.e. , 


15 

exp  z)  -  ®exp  Zi )  • 


(5) 


2=1 


This  property  has  been  used  in  magic  state  distillation  [17].  In  the  computational  scheme  described 
here  we  use  it  to  fault-tolerantly  measure  the  encoded  observables  via  local  measurements  of 
observables  X%^%  • 


Surface  codes.  For  the  surface  codes  [6,  15]  physical  qubits  live  on  the  edges  of  a  two-dimensional 
lattice.  The  support  of  a  physical  error  must  stretch  across  a  constant  fraction  (typically  1/2)  of 
the  lattice  to  cause  a  logical  error.  The  protection  against  errors  is  topological. 

The  stabilizer  generators  of  the  code  are  associated  with  the  faces  /  and  the  vertices  v  of  the 
lattice, 

Sx(v)=  0  Xe,  Sz(f)=  0  Ze.  (6) 

e\ve{de}  ee{df} 

Therein,  d  is  the  boundary  operator.  The  number  of  qubits  that  can  be  stored  depends  on  the 
boundary  conditions  of  the  code  lattice.  The  code  resulting  from  periodic  boundary  conditions,  the 
‘toric  code’  [6],  can  store  two  qubits. 

As  an  example  we  would  briefly  like  to  discuss  the  planar  code  [15]  which  encodes  one  qubit;  see 
Fig.  3a.  This  example  exhibits  many  features  of  our  subsequent  constructions  one  dimension  higher 
up:  Errors  are  identified  with  1-chains  and  show  a  syndrome  only  at  their  end  points.  Homologically 
equivalent  chains  correspond  to  physically  equivalent  errors.  Error  chains  can  end  in  the  system 
boundary  without  leaving  a  syndrome. 

Specifically,  Pauli  operators  Zi  live  on  the  edges  of  the  primal  (=shown)  lattice,  and  Pauli 
operators  Xj  live  on  edges  of  the  dual  lattice.  The  encoded  Z-operator  is  a  tensor  product  of 
individual  Z{  operators  corresponding  to  a  primal  1-chain  stretching  from  left  to  right  across  the 
code  lattice.  The  encoded  X-operator  corresponds  to  a  1-chain  of  the  dual  lattice  that  stretches 
from  top  to  bottom. 

The  code  stabilizer  is  modified  at  the  system  boundary.  For  example,  a  face  to  the  left  or  right 
of  the  lattice  has  only  three  elementary  1-chains  in  its  boundary,  instead  of  four.  Such  boundary  is 
called  a  ‘rough  edge’.  Where  no  modification  of  the  faces  occurs  the  system  boundary  is  a  ‘smooth 
edge’.  ‘Smooth’  on  the  primal  lattice  is  ‘rough’  on  the  dual,  and  vice  versa.  Error  chains  can  end 
in  a  rough  edge  of  their  respective  lattice  without  leaving  a  syndrome,  but  not  in  a  smooth  edge. 
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a)  b) 


smooth  edge  (primal) 


Figure  3:  Codes,  a)  Planar  code.  The  encoded  Pauli  observables  X  and  Z,  two  errors  and  the 
different  boundary  types  ‘smooth’  and  ‘rough’  are  shown.  The  errors  Ex,  Ez  end  in  their  respective 
rough  boundary.  Ex  corresponds  to  a  trivial  1-cycle  on  the  dual  lattice  and  has  no  effect  on  the 
encoded  state.  Ez  corresponds  to  an  non-trivial  1-cycle  on  the  primal  lattice.  It  leaves  no  syndrome 
but  causes  a  logical  Z-e rror.  b)  Bi-colorable  graph  state  locally  unitary  equivalent  to  the  |+) -state 
of  the  15-qubit  quantum  Reed-Muller  code. 


The  surface  codes  will  occur  rather  implicitly  in  our  constructions.  The  reason  is  that  here  we 
do  not  use  such  codes  to  encode  logical  qubits.  Instead,  we  use  them  to  appropriately  “wire”  a 
subset  of  the  cluster  qubits,  the  5-qubits.  The  link  between  surface  codes  and  cluster  states  has 
been  established  in  [14],  for  the  purpose  of  creating  long  range  entanglement  in  noisy  3D  cluster 
states  via  local  measurements.  It  has  been  found  that  the  error  correction  implemented  by  the  local 
measurements  is  described — like  fault-tolerant  data  storage  with  the  toric  code — by  the  so-called 
Random  plaquette  Z2 -gauge  model  in  three  dimensions  (RPGM)  [18].  The  three-dimensional  cluster 
state  is  like  a  surface  code,  one  dimension  higher  up.  The  third  dimension,  which  is  temporal  in  data 
storage  with  the  toric  code,  is  spatial  for  the  cluster  state.  The  extra  spatial  dimension  can  be  used 
to  fault-tolerantly  mediate  interaction  among  qubits.  The  creation  of  an  encoded  Bell  state  over 
large  distances  [14]  is  the  simplest  example.  The  long-range  quantum  correlations  are  engineered 
by  the  suitable  choice  of  boundary  conditions. 

Why  are  the  above  two  codes  chosen?  For  the  fault-tolerant  scheme  of  quantum  computation 
described  in  this  paper  we  require  a  quantum  code  with  the  following  three  properties:  1)  The  code 
is  of  CSS-type,  2)  The  code  satisfies  Eq.  (5),  and  3)  The  code  fits  with  cluster  states.  For  some 
arrangement  of  qubits  on  a  translation-invariant  two-dimensional  lattice,  the  code  has  a  translation- 
invariant  set  of  stabilizer  generators  and  these  generators  each  have  a  small  support  on  the  lattice. 

The  Reed-Muller  code  has  properties  1  and  2  but  not  3.  The  surface  codes  have  properties  1  and 
3  but  not  2.  Thus,  neither  of  the  codes  alone  suffices.  But  their  combination  does,  as  is  described 
in  the  subsequent  sections. 

3  The  measurement  pattern 

As  pointed  out  in  the  introduction,  we  subdivide  the  cluster  C  into  the  three  disjoint  subsets  V, 
D  and  S.  S  is  the  the  set  of  qubits  where  the  non-Clifford  part  of  the  quantum  computation  is 
performed,  and  V  and  D  are  to  connect  the  qubits  of  S  in  the  proper  way.  We  have  not  yet  explained 
what  the  defects  are,  and  we  will  do  so  only  in  the  next  section.  For  the  moment  it  suffices  to  note 
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that  the  defects  are  located  on  the  subset  D  of  the  cluster,  and  that  D  is  the  union  of  the  two 
disjoint  subsets  D\  and  D2.  The  measurement  pattern  on  D,  S  and  V  is  given  by 


Defect  qubits  aGfl:  Measurement  of 


Xa 


if 


a  g  d2 

cl  G  D\ 


Singular  qubits  a  G  S  :  Measurement  of  ^ a  ^  ^ a 


(7) 


V2 


Vacuum  qubits  a  G  V  :  Measurement  of  Xa. 


Now  we  have  to  explain  why  we  choose  this  measurement  pattern,  which  is  best  done  using  the 
language  of  homology. 


4  Involving  the  Reed-Muller  quantum  code 


In  this  section  we  explain  the  role  of  the  Reed-Muller  code  for  the  described  computational  scheme. 
Consider  a  cluster  state  \(j>)c2  on  a  two-dimensional  cluster  C2.  It  is  a  resource  for  universal  quantum 
computation  by  measurements  of  the  local  observables  X,  Z  and  ,  [9].  Denote  by  Q  C  C2  the 

set  of  qubits  which  are  measured  in  the  eigenbasis  of  .  These  measurements  implement  the 
non-trivial  part  of  a  quantum  circuit.  The  measurements  of  X  and  Z  on  the  qubits  C2\Q  implement 
the  Clifford  part.  They  are  performed  simultaneously  in  the  first  round  of  measurements.  |Taigo)Q 
is  the  state  of  the  unmeasured  qubits  after  the  first  measurement  round.  It  is  an  algorithm-specific 
stabilizer  state,  hence  the  subscript  “algo”.  Since  it  is  a  stabilizer  state,  it  is  easy  to  create  and 
one  may  start  with  this  state  as  an  algorithm-specific  resource  instead  of  the  universal  cluster  state. 
Quantum  computation  with  this  state  proceeds  by  measuring  local  observables  . 

Now  suppose  an  encoded  version  of  this  state,  |^raigo)5,  was  given.  The  state  were  not  perfect 
but  only  affected  by  local  noise  on  the  bare  level.  Of  course,  a  question  that  arises  immediately  is 
how  such  a  state  is  obtained.  The  main  part  of  work  in  this  paper  goes  into  answering  this  ques¬ 
tion,  see  subsequent  sections.  Now,  with  iTaigo)^  given,  one  could  perform  fault-tolerant  quantum 
computation  by  fault-tolerant  measurement  of  the  encoded  observables  .  This  is  not  what 
we  have  in  mind,  because  we  are  seeking  a  scheme  of  fault-tolerant  quantum  computation  by  local 
measurements.  Here  lies  the  reason  for  involving  the  (concatenated)  Reed-Muller  code:  For  this 
code,  the  fault-tolerant  measurement  of  the  observables  — proceeds  by  local  measurements  of 
observables  .  The  reason  for  this  is  property  (5).  If  J  is  a  set  such  that  gx(J)  :=  is 

in  the  RM  code  stabilizer,  then  also 


9±(J)  :=0 

jeJ 


V2 


€  RM  code  stabilizer. 


The  relevant  encoded  observables  are  given  by 


X±Y  ^  Xi  T  Yj 

~ r  = 


(8) 


(9) 


The  “=f”  is  for  the  total  number  of  concatenation  levels  being  odd.  If  the  number  is  even,  replace 
“=F”  by  “±”.  Therefore,  if  all  the  bare  qubits  belonging  to  an  encoded  qubit  are  individually 
measured  in  the  eigenbasis  of  3 ,  then  the  eigenvalue  found  in  a  measurement  of  the  encoded 

observable  x^  and  the  eigenvalues  of  the  stabilizer  generators  g+(J)  can  be  deduced  from  the 
individual  measurement^out comes.  This  is  all  what  is  needed  for  fault-tolerant  measurement  of  the 
encoded  observable  . 
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The  Z- part  of  the  code  stabilizer  is  lost  in  the  local  measurement,  but  it  is  not  needed  for  the 
fault-tolerant  measurement  of  .  This  can  be  seen  as  follows.  For  simplicity  assume  a  local 
depolarizing  error  p/3([Xj\  +  [Yj]  +  [. Zj ])  =  ([Xj  +  Yj]  +  [Xj  —  Yj]  +  y/2 [Zj])  for  all  qubits  j. 


The  brackets  “[•] 
the  eigenbasis  of  Xj  Yj 


The  second  error 


indicate  a  super-operator.  W.l.o.g.  assume  that  the  local  measurements  are  in 
ren,  the  error  Xj  J3  is  absorbed  in  the  measurement  and  has  no  effect. 

.Xi-Yi 


V2 

Xj+Yj 


Y2  ~  °  Vi.  J  ~  J 
errors  are  identified  by  the  stabilizer  elements  (g+(J)}. 


Zj  =  Zj ,  so  all  remaining  errors  are  equivalent  to  Z-errors.  Such 


To  summarize,  the  Reed-Muller  code  is  involved  to  perform  the  fault-tolerant  measurement  of 
the  encoded  observables  locally  on  the  bare  level.  The  eigenvalues  corresponding  to  the  encoded  ob¬ 
servable  and  to  the  relevant  stabilizer  generators  are  simultaneously  inferred  from  the  measurement 
outcomes.  Error  correction  proceeds  by  classical  post-processing  of  these  quantities. 


5  Involving  a  topological  quantum  code  -  Homology 

The  remaining  question  is  how  we  actually  create  the  state  | '3/ algo)#  with  only  local  error  from  a 
three-dimensional  cluster  state.  To  accomplish  this  task  we  involve  topological  error  correction. 


5.1  Errors  and  correlations  as  chains 

The  physical  objects  of  our  discussion — cluster  state  correlations  and  error  operators — may  be 
identified  with  faces  and  edges  of  an  underlying  lattice.  Compositions  of  such  faces  or  edges  are 
called  2-chains  and  1-chains,  respectively.  For  the  chains  homology  provides  an  equivalence  relation; 
namely,  two  chains  are  homologically  equivalent  if  they  differ  by  the  boundary  of  a  third  chain  one 
dimension  higher  up  [20,  21,  22].  Homology  plays  a  role  in  our  constructions  because  homological 
equivalence  of  the  underlying  chains  implies  physical  equivalence  of  the  associated  physical  objects. 
First  we  introduce  the  two  simple  cubic  sub-lattices  C  and  C  whose  vertices  are  at  locations 


C  :  {(e,e,e)}, 

C  :  {(o,o,o)}. 


(10) 


One  lattice  can  be  obtained  from  the  other  via  translation  by  a  vector  (±1,  ±1,  ±1).  C  and  C  are 
dual  to  another  in  the  sense  that  the  faces  of  C  are  the  edges  of  £,  the  cubes  of  C  are  the  vertices 
of  £,  and  vice  versa.  We  denote  by  *  the  duality  transformation  that  maps  primal  edges  into  the 
corresponding  dual  faces  (*e  =  /),  and  so  forth. 

We  denote  by  £>(Co)  :=  {vk}  the  set  of  vertices  in  £,  by  B{C\)  —  {e{\  the  set  of  edges  in  £,  by 
£>(62)  =  {fm}  the  set  of  faces  in  £,  and  by  B{C 3)  =  {qn}  the  set  of  elementary  cells  [or  cubes]  of 
C.  We  may  now  define  chains  in  C  [20].  £>(Co)  forms  a  basis  for  the  set  Co  of  so  called  0-chains  co, 
B{C\)  forms  a  basis  for  the  set  C\  of  1-chains  ci,  and  so  forth.  Specifically,  the  chains  are  given  by 

Co  =  YkZkVk,  Cl  =  Y.iZiei,  C2  =  J2mzmfm,  C3  =  YnZnQn-  (H) 

where  zi,  zm,  zn  G  Z2.  The  sets  Co,  Ci,  C2  and  C2  are,  in  fact,  abelian  groups  under  component¬ 
wise  addition,  e.g.  c\  +  c[  =  Yhizlel  +  Yhiziel  ~  Y2i(zl  +  zi)el •  For  each  i  =  1..3,  there  exists  a 
homomorphism  d{  mapping  C{  to  Ci- 1,  with  the  composition  c^-i  0  di  —  0.  Then, 

C.  =  {Cz,C^CuCti  (12) 

is  called  a  chain  complex,  and  d  is  called  boundary  operator.  It  maps  an  i-chain  C{  to  its  bound¬ 
ary,  which  is  an  i  —  1-chain.  In  the  same  way,  C  can  be  defined  as  a  dual  chain  complex, 
C  —  {C3,  C 2,  Ci,  Cq},  with  chains  C3,  C2,  ~c\  and  cq. 
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Now,  considering  a  space  C,  two  chains  cn,  dn  G  Cn(C)  are  homologically  equivalent  if  dn  — 
cn  +  dcn+ 1  for  some  cn+ 1  G  Cn+\{C )  [20].  Of  interest  for  topological  error-correction  is  the  notion 
of  relative  homology.  Consider  a  pair  of  spaces  (C,  I?)  with  D  C  C.  Then,  two  chains  cn,  are 
called  equivalent  w.r.t  relative  homology,  =r  cn,  if  dn  =  cn  +  dcn+i  +7^  for  some  cn+ 1  G  Cfn+i(C), 
7n  C  Cn(D );  see  [21].  Relative  cycles  may  end  in  D. 

Below  we  describe  how  the  cluster  state  quantum  correlations  may  be  identified  with  the  2- 
chains,  the  errors  with  the  1-chains  and  the  syndrome  with  the  0-chains  of  C  and  C.  All  these 
objects  appear  in  two  kinds,  ‘primal’  and  ‘dual’,  depending  on  whether  they  are  defined  with 
respect  to  C  or  C. 

Cluster  state  correlations.  We  define  primal  such  correlations,  K(c 2),  and  dual  ones,  K(c 2), 
which  can  be  identified  with  2-chains  in  C  and  £,  respectively,  by 

K(C2):=  IJ  Kf,  K(c2):=J[  Kj.  (13) 

felc  2}  fe{c  2} 

Therein,  e.g.  the  set  {c2}  is  defined  via  a  mapping  c2  =  J2mzmfm  — >  {c2}  =  {fm\zm  =  !}• 
Further,  we  introduce  the  notion  O(c)  :=  <S)ae{c}  f°r  all  c  G  C  and  O  G  {X,  Z}.  It  is  now  easily 
verified  that 

K(c2)  =  X(c2)Z(dc2),  K(c2 )  =  X(c2)Z(dc2).  (14) 

Errors.  We  will  mainly  discuss  (correlated)  probabilistic  noise.  Then,  it  is  sufficient  to  restrict 
the  attention  to  Pauli  phase  flips  Z,  because  Xa  =  ^beN(a)  etc.  We  combine  Z-errors  on  odd 
(even)  qubits  to  primal  (dual)  error  chains  E(c{)  (E(ci)), 

E(Cl)  :=  Z(a),  Efa)  :=  Z(cx).  (15) 

Syndrome.  The  first  type  of  correlations  we  discuss  are  those  for  error  correction  in  V.  They  are 
characterized  by  the  property  that  the  corresponding  2-chains  have  no  boundary.  I.e.,  we  consider 
K(c2),  K(c2 )  with  {c2},{c2}  G  V  and  dc2  =  dc2  =  0.  With  (13),  these  correlations  take  the  form 
K{c2)  —  X(c2),  K(c2)  —  X(c2).  They  are  measured  by  the  X-measurements  in  V,  see  (7),  and  are 
used  to  identify  errors  occurring  on  the  qubits  {c2 },  {c2}. 

The  group  of  2-chains  in  the  kernel  of  d  we  denote  by  Z2(C).  Since  dd  =  0,  a  subgroup  of  those, 
B2(C ),  is  formed  by  the  2-chains  which  are  themselves  a  boundary  of  a  3-chain.  Denote  by  q  (q)  a 
3-chain  from  the  basis  B{C 3)  ( B(C 3)).  It  represents  an  individual  cell  [or  cube]  of  the  primal  lattice 
C  (dual  lattice  C).  The  associated  quantum  correlations  are 

Kq  :=  K(dq)  =  X(3g),  Kq  :=  K{9q)  =  X{dq).  (16) 

When  being  measured,  each  of  these  correlations  yields  a  syndrome  bit  Sy(q ),  Sy(q)  which,  we  say, 
is  located  at  q  or  <7,  respectively.  Because  the  lattices  C  and  C  are  dual  to  another,  we  may  identify 
the  cell  q  in  C  with  a  vertex  v  in  £,  and  vice  versa.  In  this  way,  the  syndrome  bits  become  located 
at  vertices  of  the  lattices  C  and  C.  The  syndrome  resulting  from  the  quantum  correlations  (16)  are 
those  which  enable  topological  error  correction  [18]. 

Syndrome  and  errors.  Let  £'(ci)  denote  a  primal  error  chain,  q  G  B(C 3)  a  cell  in  the  dual 
lattice  C  and  v  G  B(Co)  —  *<7.  K(q)  detects  the  error  E(c\)  if  |{ci}  D  {dq}\  —  odd.  Equivalently, 
K(q )  detects  E(c\)  if  v  G  {dc\}.  Thus,  error  chains  show  a  syndrome  only  at  their  ends. 
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Correlations  and  errors.  Primal  cluster  state  correlations  are  affected  by  dual  error  chains  and 
dual  correlations  are  affected  by  primal  error  chains.  Primal  correlations  are  not  affected  by  primal 
error  chains,  and  dual  correlations  are  not  affected  by  dual  error  chains. 

To  see  this,  note  that  a  primal  correlation  K(c2)  consists  of  Pauli  operators  X  on  even  qubits 
and  Pauli  operators  Z  on  odd  qubits.  A  dual  error  chain  E(ci)  consists  of  operators  Z  on  even 
qubits.  Then,  E(c\)K{c2)  =  (— l)l{Cl}u(C2}l  K(c2)E(ci).  If  | {ci } U {c2 } |  is  odd,  the  correlation  K(c2) 
is  conjugated  to  —K (c2)  by  the  error.  If  it  is  even,  then  the  correlation  remains  unchanged.  This 
situation  has  a  geometric  interpretation.  |{ci}  U  {C2 }  |  is  the  number  of  intersection  points  between 
the  primal  2-chain  C2  and  the  dual  1-chain  ~c\.  If  the  number  of  intersections  is  odd  (even)  then  the 
correlation  is  (is  not)  affected  by  the  error.  For  dual  correlations  and  primal  errors  the  situation 
is  the  same.  Further,  a  primal  error  chain  consists  of  Pauli  operators  Z  on  odd  qubits.  Thus, 
[K(c2),E(ci)\  —  0  always.  Similarly,  [if (£2), E(ci)\  =  0,  Vci,C2- 

Defects.  The  purpose  of  defects  is  to  structure  the  space  underlying  the  pair  of  lattices  £,  C. 
Practically,  a  defect  can  be  thought  of  as  a  set  of  qubits  that  are  removed  from  the  initial  cluster 
C  before  the  remaining  qubits  are  entangled.  For  the  chain  complexes  (7,  (7,  a  defect  is  a  set  d  of 
missing  edges.  What  defines  a  defect  as  an  entity  is  that  the  belonging  edges  are  connected.  As  all 
the  other  objects,  defects  are  either  primal  or  dual, 

dcB(Ci),  dcB(Ci).  (17) 

The  sets  d,  d  of  defect  qubits  are  not  arbitrary.  Seen  from  afar  they  take  the  shape  of  doughnuts. 
These  doughnut-shaped  defects  will  be  topologically  entangled  with  another,  and  the  way  they 
are  entangled  encodes  the  quantum  algorithm  to  be  performed.  From  the  viewpoint  of  quantum 
logic,  what  matters  about  the  doughnuts  is  that  they  are  loops.  Their  ‘thickness’  is  required  for 
fault-tolerance. 

We  now  briefly  explain  how  the  above  definition  of  a  defect  as  a  set  of  missing  cluster  qubits  fits 
with  the  measurement  pattern  (7).  Formally,  each  defect  d  will  be  assigned  a  set  D(d)  of  locations 
on  the  cluster.  This  set  is  subdivided  into  a  set  of  edge-  and  a  set  of  face  qubits,  D\(d)  and  D2(d). 
Here,  the  notions  of  ‘edge’  and  ‘face’  are  in  reference  to  the  lattice  the  defect  belongs  to.  If  the 
defect  is  primal  (dual)  then  the  edges  and  faces  are  taken  with  respect  to  the  primal  (dual)  lattice. 
For  primal  defects,  the  sets  D\{d)  and  D2(d)  are  defined  as 

Di(d)  :=  d,  D2(d)  =  {/  g  B{C2)  I  {df}  Hd=  {df}}  .  (18) 

For  dual  defects,  replace  f  by  f  and  C2  by  C2  in  the  above  definition.  The  whole  defect  region  D 
splits  into  an  edge  part  D\  and  a  face  part  D2,  D  —  D\  U  D2  where 

D^ljD^d),  D2  =  \jD2(d).  (19) 

d  d 

Now  the  measurement  pattern  (7)  becomes  understandable:  the  edge  qubits  in  the  defects  are 
measured  in  the  Z-basis  which  effectively  removes  them  from  the  cluster  [10].  In  this  way,  the 
quantum  state  on  the  exterior  of  the  defect  becomes  disentangled  from  the  state  with  support  on 
interior  of  the  defect.  Thereby,  a  defect  is  created  in  the  cluster  lattice.  Note  that  the  qubits  on 
faces  whose  entire  boundary  is  in  D±(d)  become  disentangled  individually.  If  no  errors  were  present 
we  could  leave  these  qubits  alone.  However,  their  measurement  in  the  X-basis  provides  additional 
syndrome  and  so  it  is  advantageous  to  measure  them. 

Correlations  and  defects.  In  the  proximity  of  a  primal  defect,  edges  in  the  boundary  of  a  primal 
2-chain  are  removed,  see  (17).  Therefore,  primal  correlations  can  end  in  primal  defects.  Dual  defects 
do  not  remove  primal  edges,  and  thus  primal  correlations  cannot  end  in  dual  defects.  Analogously, 
dual  correlations  can  end  in  dual  defects,  but  not  in  primal  defects. 
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to  this  onej 

dual  corr. 

primal  defect 

This  object  do< 

dual  defect 

es  ... 

primal  err.  cy. 

dual  err.  cy. 

correlation 

nothing 

bound 

repel 

nothing 

affect 

dual  correlation 

repel 

bound 

affect 

nothing 

primal  defect 

encircle 

pairwise  end 

encircle 

dual  defect 

encircle 

pairwise  end 

primal  err.  cy. 

nothing 

Table  1:  This  table  shows  who  does  what  to  whom.  CA  bounds  B ’  is  synonymous  with  LB  ends  in 
A\  The  displayed  objects  do  not  interact  with  themselves. 

Syndrome  and  defects.  In  the  presence  of  a  primal  defect  d  the  correlations  {Kq  —  K(8q)\q  E 
B(cs)  A  9q  D  d  ^  0}  do  not  commute  with  the  measurements  (7),  such  that  the  syndrome  at  the 
locations 

Dq  =  {v  E  8e\  e  E  d}  (20) 

is  lost.  Note,  however,  that  for  each  defect  d  there  will  be  one  syndrome  bit  associated  with 
the  defect  as  a  whole.  There  exists  a  2-cycle  02(d),  {02(d)}  C  V,  that  wraps  around  d,  and 
K (02(d))  =  X (02(d)).  When  the  qubits  in  V  are  measured  in  the  X-basis,  this  correlation  yields  an 
additional  syndrome  bit.  Dual  defects  act  analogously  on  the  dual  lattice. 

Errors  and  defects.  Because  the  local  syndrome  is  lost  at  the  surface  of  a  defect,  primal  error 
chains  can  potentially  end  in  primal  defects.  However,  there  is  a  dual  correlation  K (02(d))  = 
X (02(d))  wrapping  around  a  primal  defect,  and  this  correlation  detects  a  primal  error  chain  E(c\) 
if  the  number  of  intersection  points  between  02(d)  and  c\  is  odd.  Thus,  primal  error  chains  can 
pairwise  end  in  primal  defects. 

Primal  error  chains  cannot  end  in  dual  defects,  because  dual  defects  do  not  remove  primal 
syndrome.  Similarly,  dual  error  chains  can  pairwise  end  in  dual  defects,  and  they  cannot  end  in 
primal  defects. 

The  relations  among  cluster  state  correlations,  errors  and  defects  are  summarized  in  Tab.  1. 

5.2  Homological  and  physical  equivalence 

We  have  so  far  identified  physical  objects — correlations  and  errors — with  chains  of  a  chain  complex. 
In  this  section  we  point  out  that  it  is  the  homology  class  of  the  chain  rather  than  the  chain  itself 
which  characterizes  the  respective  physical  object.  The  equivalence  of  two  chains  under  relative 
homology  implies  the  physical  equivalence  of  the  corresponding  physical  operators. 

1.  Cluster  state  correlations.  We  regard  two  cluster  state  correlations  Kfa),  K(c2)  as  physically 
equivalent  if  they  yield  the  same  stabilizer  element  for  the  state  l^aigo)#  after  the  measurement  of 
the  qubits  in  V  and  D.  This  requires  two  things.  First,  K(c2)  and  K(c2)  need  to  be  simultaneously 
measurable.  With  Oa  the  locally  measured  observables  (7)  we  require  [K(c2),Oa]  =  0  Va  E  V  U 
D  [K(c2),  Oa\  —  0  Va  E  VUD.  Second,  the  two  operators  must  agree  on  S ,  K(c2)\s  —  K(c'2)\s- 
Then,  the  following  statement  holds:  If  c2  =r  02  w.r.t.  (V  U  D,D)  then  K(c2)  =  K(c2)- 
Proof :  There  exists  03  E  C%  \  {803}  C  V  U  D2  and  72  E  C2I  {72}  C  D2  such  that  c2  =  C2  +  803  +  72. 

1.  Simultaneous  measurability:  K{ 72)  =  (<S>aGB2(  12)  Xa)  (®&er>i(72)  Zb)  >  where  -di(72)  C  Dx  and 
£>2(72)  C  D2.  Therefore,  with  (7),  [K(j2),Oa]  =  0  Va  e  V  U  D  (*).  Similarly,  K(dc3 )  =  X(dc3 ) 
such  that,  with  (7),  [K(dc3),Oa\  =  0  Va  €  V  U  D  (**).  Since  K{c’2)  =  K ( c2 ) K ( dc3 ) K ( 72 ) ,  (*)  and 
(**)  imply  simultaneous  measurability  of  K(c2)  and  K(c2)  on  V  U  D.  2.  Same  restriction  to  S : 
K(8c3)  and  K( 72)  don’t  act  on  S ,  hence  K(c2)\s  —  K(c'2)\s-  d 
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2.  Errors.  Two  errors  E(c\)  and  E(c!^)  are  physically  equivalent  if  they  cause  the  same  damage 
to  the  computation.  That  is,  they  have  the  same  logical  effect  and  leave  the  same  syndrome.  Then, 
the  following  statement  holds:  If  c[  =r  c\  w.r.t.  (V  U  D,D)  then  E(c[)  =  E(c\). 

Proof:  There  exist  c2  E  C2,  71  E  Ci,  with  {02}  C  V  U  D2,  {71}  C  Tff,  such  that  c[  =  c\  +  802  +  71. 
Now,  a  Pauli  spin  flip  error  X  is  absorbed  in  a  subsequent  X-measurement  and  has  no  effect  on  the 
computation,  X  —  Thus,  with  (7),  Xa  =  Ia  for  all  a  E  V  U  D2.  Similarly,  =  A  for 

all  beDx.  Then,  E(c  1  +  dc2  +  71)  =  E(c1)Z(dc2)Z^y1)  =  E(c1)X(c2)X(c2)Z(7l)  9*  E(c  1).  □ 

6  Constructive  techniques 

The  purpose  of  the  measurements  in  V  and  D  is  to  create  on  S  the  Reed-Muller-encoded  algorithm- 
specific  resource  iTaigo)^  described  in  Section  4.  In  Section  6.1,  we  specify  the  location  of  5-qubits 
with  respect  to  the  lattice  defects  and  then,  in  Section  6.2,  we  give  a  construction  for  a  topologically 
protected  circuit  providing  |4/aigo}s- 

6.1  Location  of  the  5-qubits 

The  5-qubits  have  very  particular  locations  within  the  cluster.  Besides  the  defects  in  the  shape  of 
doughnuts  that  we  have  already  introduced  the  cluster  also  supports  defects  shaped  like  ear  clips. 
The  opening  of  these  ear  clip  defects  is  only  one  cluster  qubit  wide.  If  this  one  cluster  qubit  were 
a  defect  qubit  (q  E  D )  too,  the  ear  clips  would  become  doughnuts.  But  the  particular  cluster  qubit 
is  not  in  D,  it  is  an  5-qubit.  The  situation  is  displayed  graphically  in  Fig.  4a. 

The  appropriate  stabilizer  generators  among  the  5-qubits  are  induced  from  the  cluster  state 
correlations  associated  with  relative  2-cycles,  by  measurement  of  the  V-  and  D-qubits.  As  everything 
else  in  this  computational  scheme,  the  5-qubits  occur  in  the  two  kinds  ‘primal’  and  ‘dual’.  We  call 
an  5-qubit  q  primal,  q  E  5P,  if  it  lives  on  the  a  face  of  the  primal  lattice,  and  we  call  it  dual,  q  E  5^, 
if  it  lives  on  a  face  of  the  dual  lattice. 

We  now  discuss  how  primal  and  dual  correlations  affect  the  5-qubits.  Consider,  for  example  the 
correlation  X(c2)  corresponding  to  a  primal  relative  2-chain  c2.  A  primal  5-qubit  at  location  q  may 
lie  within  c2,  but  never  in  its  boundary,  {dc2}  D  q  =  0.  A  dual  5-qubit  qr  may  lie  in  the  boundary 

of  a  primal  2-chain  c2  but  never  in  c2  itself,  {c2}  H  q'  =  0.  We  therefore  conclude  that 

A  primal  correlation  X(c2)  acts  on  a  primal  S-qubit  by  one  of  the  two  Pauli- operators 

X ,  I  and  on  a  dual  S-qubit  by  one  of  the  two  operators  Z ,  1.  A  dual  correlation  ,~-t\ 

X(c2)  acts  on  a  primal  S-qubit  by  one  of  the  two  Pauli- operators  Z ,  I  and  on  a  dual 
S-qubit  by  one  of  the  two  operators  X,  /. 

For  finding  the  extended  relative  2-cycles  on  the  primal  lattice,  we  thus  regard  Sp  as  part  of  V 
and  Sd  as  part  of  D.  Analogously,  for  finding  the  extended  relative  2-cycles  on  the  dual  lattice,  we 
regard  Sp  as  part  of  D  and  Sd  as  part  of  V .  In  this  way,  the  problem  of  finding  the  extended  primal 
2-cycles  on  a  cluster  with  5-qubits  is  reduced  to  the  same  problem  without  5-qubits. 

6.2  Creating  |4/aigo)  among  the  5-qubits 

The  construction  of  a  topologically  protected  circuit  providing  iTaigo)^  proceeds  in  three  steps. 
First  we  show  that  l^aigo)#  is  local  unitarily  equivalent  to  a  bi-colorable  graph  state,  by  local 
Had amard-transformat ions.  Second,  we  show  how  to  create  a  bi-colorable  graph  state.  Third,  we 
take  care  of  the  Had  amard-transformat  ions. 

Equivalence  of  |Taigo)  to  a  bi-colorable  graph  state.  Denote  by  | T)c2  the  state  after  the  first 
measurement  round  (X-  and  ^-measurements)  in  a  QC^-computation  on  a  cluster  state  |</>)c2?  cf. 
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a)  b)  c) 


Figure  4:  a)  Location  of  a  singular  qubit  (dual,  held  by  a  primal  defect).  b,c)  The  correlations  iF, 
K  affect  the  S-qubit  by  a  Pauli-operator  X  in  (b),  and  by  Z  in  (c). 


Section  4.  Denote  by  Cx,  Cz  the  subsets  of  C^\Q  whose  qubits  are  measured  in  the  X-  or  Z-basis, 
respectively.  Further,  denote  by  Ceve n,  C0dd  the  sets  of  even  and  odd  qubits  in  C 2  (checkerboard 
pattern).  The  state  \^)c2  is  given  by 


|*>C2  ~ 


(22) 


and  has  the  form  \^)c2  =  |^aigo)g  ®  lrest)c2\Q*  Further,  \(j))c2  is  local  unitary  equivalent  to 


some 


'ieCodd 


CSS-state,  \4>)c2  — 

C Z  :=  (Cz  C  Ceve  n)  U  (Cj  HC  odd)?  \^)c2  ^  {®ieCodd^^j 


)  I CSS)c2-  Then,  with  (22)  and  C'x  :=  (Cx  nCeven)  U  (Cz  nC0dd), 

I  CSS)C2)  = 


I±Xg 


I±zb 

2 


ieCodd 


^  \CSSf)c2‘  Thus,  also  the  state  l^aigo)  is  l.u.  equivalent  to  a  CSS-state, 


|Talgo)g 


I  css")q. 


(23) 


Now,  we  consider  the  concatenated- Reed-Muller-encoded  resource  |^faigo)5,  which  may  obtained 
from  the  bare  state  |Taigo)g  via  encoding,  |Taigo)5  =  ((S^eg  Encg)  l^aigo)g-  The  encoding  pro¬ 
cedure  Enc  takes  every  qubit  q  G  Q  to  a  set  S(q )  of  qubits,  UqeQS(q)  =  S.  It  has  the  property 
that  EnCg  o  Hq—  °  Enc4-  Therein,  Enc7  is  an  encoding  procedure  for  the  code  con¬ 

jugated  to  the  Reed-Muller-code,  i.e.,  for  the  code  with  the  X-  and  the  Z-block  of  the  stabilizer 
interchanged.  This  code  is  of  CSS  type,  like  the  Reed-Muller  code  itself.  The  encoding  procedure 
changes  when  passing  through  the  Hadamard-gate  because  the  encoded  Hadamard-gate  is  not  local 
for  the  Reed-Muller  quantum  code. 

At  any  rate,  the  state  iTaigo)^  is  l.u.  equivalent  to  a  CSS-state  encoded  with  CSS-codes,  i.e.,  to  a 
larger  CSS-state,  14/aigo)#  =  (^&qecoddnQ  ®ieS(q)  \ CSS"')s-  Every  CSS-state  is  l.u.  equivalent 

to  a  bi-colorable  graph  state  [23],  by  a  set  of  local  Hadamard-transformations.  Thus,  we  finally 
obtain 

|^algo)s  =  (g)  Hi  |T)s.  (24) 

ieSn 


Therein,  Sh  is  some  subset  of  S  and  \T)s  is  a  bi-colorable  graph  state  with  adjacency  matrix  of  the 
corresponding  graph 


r  = 


0  gt  \ 

G  0  j' 
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(25) 


o 

o 


Figure  5:  Creation  of  an  arbitrary  bi-colorable  graph  state.  The  singular  qubits  are  displayed  as 
red  circles.  G  denotes  the  sub-matrix  of  the  adjacency  matrix  Y  which  encodes  the  edges  between 
primal  and  dual  vertices. 


Circuit  for  a  bi-colorable  graph  state.  The  circuit  layout  for  the  topologically  protected 
creation  of  an  arbitrary  bi-colorable  graph  state  is  shown  in  Fig.  5.  The  circuit  consists  of  a  set 
of  horizontal  primal  and  a  set  vertical  dual  defects.  Each  primal  defect  comes  close  to  each  dual 
defect  once,  and  the  two  defects  may  be  linked  in  that  region.  The  form  of  this  junction  is  decided 
by  a  corresponding  element  of  the  graph  state  adjacency  matrix  G.  If  Gij  =  1  then  the  defects  are 
linked  and  otherwise  they  are  not.  In  addition,  each  of  the  loop  defects  is  linked  with  an  ear-clip 
shaped  defect  which  holds  an  5-qubit.  The  graph  state  in  question  is  formed  among  these  qubits 
after  the  remaining  qubits  have  been  measured. 

An  explanation  of  the  circuit  in  Fig.  5,  for  the  specific  example  of  a  line  graph,  is  given  in  Fig.  6. 
From  this  example  it  should  be  clear  how  the  circuit  works  in  general.  For  the  line  graph  we  have 
the  adjacency  sub-matrix 

/  1  1  0  \  b 

G\ine  =  ^  0  1  1  )  d  ‘  (2®) 

ace 

This  implies,  for  example,  that  in  the  circuit  of  Fig.  6a  the  dual  defect  winding  around  the  dual 
5-qubit  a  will  be  linked  with  the  primal  defect  winding  around  the  primal  5-qubit  b.  Now  we 
explain  how  the  stabilizer  element  K &  =  ZaXi)Zc  for  the  graph  state  |Fiine)  emerges.  The  other 
stabilizer  generators  emerge  in  the  same  way. 

Consider  the  relative  2-cycle  02(b)  and  imagine  it  being  built  up  step  by  step.  We  start  around 
the  5-qubit  b.  Because  02(b)  is  primal  and  b  is  primal,  K (02(b))  affects  qubit  b  by  a  Pauli-operator 
X,  see  Eq.  (21).  Also,  02(b)  is  bounded  by  the  primal  defect  encircling  b. 

We  move  further  to  the  left.  At  some  point,  02(b)  approaches  a  dual  defect.  Primal  correlations 
cannot  end  in  dual  defects.  Therefore,  02(b)  bulges  out  and  forms  a  tube  wrapping  around  the 
dual  defect,  leading  downwards.  It  ends  in  the  primal  defect  holding  the  dual  5-qubit  c.  K (02(b)) 
affects  qubit  c  by  a  Pauli-operator  Z.  Back  at  the  junction,  02(b)  continues  to  expand  to  the  left. 

It  approaches  a  second  dual  defect  where  it  forms  another  tube.  In  result,  the  dual  5-qubit  a  is 

affected  by  a  Pauli-operator  Z.  Further  to  the  left,  the  primal  defect  closes  up  and  bounds  K (02(b)). 

K (02(b))  takes  the  form  XbZaZc  for  some  set  V(b)  C  V.  All  qubits  in  V  are 

measured  in  the  X-basis  (7)  such  that  after  these  measurements  the  correlation  K 5  =  zb ZaXi)Zc 
remains.  This  is  a  stabilizer  generator  for  the  graph  state  in  Fig.  6b  since  (a,  b)  and  (c,  b)  are  the 
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Figure  6:  Explanation  of  the  circuit  in  Fig.  5  for  a  particular  example,  the  line  graph,  a)  The  relative 
2-cycle  02(b)  which  gives  rise  to  the  graph-state  stabilizer  K &  =  ZaXi,Zc.  b)  Graph  corresponding 
to  the  created  graph  state. 


only  edges  of  the  line  graph  ending  in  the  vertex  b.  For  every  edge  in  the  graph  there  is  a  link 
among  defect  loops  in  the  circuit. 

The  proof  for  the  programmable  circuit  of  Fig.  5  realizing  a  general  bi-colorable  graph  state  is 
similar.  As  an  outline,  each  stabilizer  generator  is  associated  with  a  doughnut-shaped  defect  in  the 
circuit.  Such  a  defect  bounds  a  correlation,  and  this  correlation  affects  one  5-qubit  by  X.  Further, 
because  the  considered  defect  is  linked  with  other  defects  of  the  opposite  kind,  the  correlation 
surface  forms  tubes.  These  tubes  affect  one  other  5-qubit  each — the  neighbors  of  the  first — by 
Pauli-operators  Z. 


Implementing  the  local  Hadamard-transformations.  The  equivalence  between  the  graph 
state  \T)S  and  the  encoded  algorithm  specific  resource  |Taigo)5  is  by  Hadamard-transformations 
on  some  subset  Sh  of  5-qubits.  Wherever  such  a  Hadamard-transformation  needs  to  be  applied, 
attach  an  extra  loop  to  the  circuit  in  Fig.  5, 

o  •  (2?) 


7  Error  sources,  error  correction  and  fault-tolerance  threshold 

7.1  Two  error  models 

Below  we  describe  two  error  models,  a  simple  one  and  a  more  realistic  one. 

Error  Model  1.  The  cluster  state  is  created  only  on  those  cluster  qubits  which  are  needed  for  the 
computation,  i.e.,  on  V  U  5.  The  defect  qubits  of  D  are  left  out. 

1.  The  noise  is  described  by  independent  partially  depolarizing  channels  acting  on  each  cluster 
qubit.  The  noisy  state  pc  is  given  by  pc  =  (££)Ta(pi)  |0)c(0|;  with 

cl£:C 


r„(1>(pi)  =  (1  -  Pi)[4]  + f-  (pq  +  K]  +  [Z„D .  (28) 

2.  The  classical  computation  for  syndrome  processing  is  instantaneous. 
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The  reason  for  considering  this  error  model  first  is  its  simplicity.  We  would  like  to  separate  the 
intricacies  inherent  in  the  presented  error-correction  scheme  from  additional  difficulties  incurred  by 
a  realistic  error  model.  The  basic  justification  for  such  an  approach  is  this:  The  error  correction 
used  here  is  topological.  Therefore,  a  threshold  should  exist  regardless  of  whether  independent 
errors  are  strictly  local  or  only  local  in  the  sense  of  having  a  support  of  bounded  size. 

The  most  straightforward  method  to  create  a  cluster  state  is  from  a  product  state  0a  |+)a  via 
a  (constant  depth)  sequence  of  A(Z)-gates  [10].  If  these  gates  are  erroneous,  then  Error  Model  1 
does  not  apply  in  general.1  Specifically,  one  may  raise  the  following  objections  to  Error  Model  1: 

•  No  correlated  errors  are  included  in  Error  Model  1.  Creating  the  cluster  state  via  a  sequence 
of  gates  will,  however,  lead  to  correlated  errors  in  the  output  cluster  state. 

•  Storage  errors  accumulate  in  time.  There  is  temporal  order  among  the  measurements  such 
that  the  computation  takes  a  certain  time  tcomp  which  cannot  be  bounded  by  a  constant  for 
all  possible  computations.  As  a  consequence,  for  the  qubits  measured  in  the  final  round  the 
local  noise  rate  increases  monotonically  with  tcomp  and  exceeds  the  error  threshold. 

•  To  leave  the  D-qubits  out  is  a  deviation  from  the  originally  envisioned  setting:  the  cluster 
state  on  S  U  V  is  algorithm-specific.2 3 

To  account  for  these  inadequacies,  we  consider  a  second  error  model. 

Error  Model  2.  A  cluster  state  on  a  bee-symmetric  lattice  is  created  in  four  steps  of  nearest- 
neighbor  A(Z) -gates.  The  gate  sequence  is  as  shown  in  Fig.  7.  Errors  occur  due  to  the  erroneous 
preparation  of  the  initial  \ +)- qubits,  erroneous  A(Z)-gates  in  the  process  of  creating  the  cluster  state, 
storage  and  measurement. 

1.  The  computation  is  split  up  into  steps  which  performed  on  sub- clusters  C&.  In  each  step, 
unmeasured  qubits  remaining  from  the  previous  step — the  hand-over  qubits — are  loaded  into  a 
cluster  state  on  a  sub-cluster.  Subsequently,  all  but  a  few  cluster  qubits  (the  new  hand-over 
qubits)  are  measured.  The  steps  have  their  temporal  depth  adjusted  such  that  each  qubit,  after 
being  locally  prepared  and  entangled,  waits  at  most  a  constant  number  to  of  time  steps  until 
its  measurement  occurs,  to  >  1.  Error  in  storage  is  described  by  a  partially  depolarizing  noise 
with  error  probability  ps  per  time  step. 

2.  The  erroneous  preparation  of  initial  \  +) -qubits  is  modeled  by  the  perfect  procedure  followed 
by  local  depolarizing  noise  (28),  with  probability  pp.  Measurement  is  described  by  perfect 
measurement  preceded  by  partially  depolarizing  noise  with  error  probability  pm  •  The  erroneous 
A(Z) -gates  are  modeled  by  the  perfect  gate  followed  by  a  2-qubit  depolarizing  channel 

bjfe)  =  (1  -  P2)[Iej\  +  g  ([Ie  ®  Xf]  +  ...  +  [Ze  ®  Zf])  .  (29) 

3.  Classical  syndrome  processing  is  instantaneous. 

1  There  may  be  situations  in  which  the  error  Model  1  is  in  fact  a  good  approximation.  For  example,  consider 
a  scenario  in  which  the  cluster  state  is  purified  before  being  measured  for  computation.  Of  course,  the  gates  in  a 
purification  protocol  would  be  erroneous,  too,  such  that  the  purified  state  is  not  perfect.  In  effect,  the  errors  of  the 
initial  state  were  replaced  by  the  errors  of  the  purification  protocol.  There  exist  purification  protocols  [24]  in  which 
the  gates  act  transversally  on  two  copies  of  the  cluster  state  (one  of  which  is  subsequently  measured).  As  a  result, 
the  errors  introduced  by  the  purification  are  approximately  local,  as  in  Error  Model  1.  The  purification  protocol  [24] 
in  its  current  form  has  a  problem  of  its  own,  though;  due  to  the  exponentially  decreasing  efficiency  of  post-selection, 
it  is  not  scalable  in  the  size  of  the  state.  But  chances  are  that  this  can  be  repaired. 

2 For  the  creation  of  the  cluster  state  this  makes  little  or  no  difference:  fewer  gate  operations  are  needed  than  for 
the  creation  of  \4>)c-  For  parallelized  procedures  that  make  use  of  the  translation  invariance  of  C,  it  should  not  be  too 
difficult  to  remove  the  superfluous  D- qubits  from  the  lattice  before  the  remaining  qubits  are  entangled. 
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Figure  7:  Steps  of  A(Z)-gates  for  the  creation  of  a  cluster  state  on  a  bcc-symmetric  lattice,  for  the 
sub-clusters  C&.  For  k  odd,  the  sequence  is  1  — »  2  — ►  3  — ►  4,  and  for  k  even  it  is  3  — ^  4  — ^  1  — ^  2. 


In  the  subsequent  sections  we  compute  a  fault-tolerance  threshold  for  both  error  models.  In 
Error  Model  2,  storage  error  will  cause  the  minimum  damage  for  the  smallest  possible  value  of  to, 
which  is  to  =  1.  In  this  case,  each  sub-cluster  carries  a  subset  of  S'-qubits  with  no  mutual  temporal 
dependence.  All  qubits  in  V  and  T),  except  the  hand-over  qubits  (See  Appendix  A),  are  measured 
immediately  after  being  entangled.  The  measurement  of  the  S'-qubits  has  to  wait  one  time  step. 
Henceforth  we  set  to  =  1. 

7.2  Methods  for  error  correction  and  the  threshold  value 

There  are  different  methods  of  error  correction  associated  with  the  different  regions  V,  S  and  D  of 
the  cluster.  In  D ,  there  are  as  many  inequivalent  errors  as  there  are  syndrome  bits,  such  that  the 
error  correction  is  trivial.  The  error  correction  in  V  is  based  on  the  random  plaquette  Z2-gauge 
model  in  three  dimensions  [18].  The  error  correction  in  S  is  carried  out  using  the  (concatenated) 
quantum  Reed-Muller  code. 

Error  correction  in  D.  In  the  domain  D(d)  of  the  defect  only  ^-errors  matter  for  face  qubits 
and  only  X-errors  matter  for  edge  qubits.  Any  other  errors  may  be  absorbed  into  the  subsequent 
measurements  (7).  The  X-errors  on  edges  may  be  relocated  to  Z  errors  on  the  neighboring  face 
qubits  via  Xe  =  XeXe,  such  that  we  need  to  consider  Z-errors  on  face  qubits  only.  For  each  face  in 
d  we  learn  one  syndrome  bit,  yielding  a  unique  syndrome  for  each  error  configuration. 

However,  if  an  error  Xe  on  an  edge  qubit  e  in  the  surface  of  defect  d  is  relocated,  the  equivalent 
error  KeXe  may  partially  be  outside  D(d).  After  error  correction  in  D{d ),  an  individual  Z -error  on 
a  face  qubit  in  V  is  left  next  to  D{d).  This  affects  the  error  correction  in  V  near  D\  see  below. 

Error  correction  in  V.  First  consider  a  scenario  where  the  entire  cluster  consists  of  the  region 
V  (i.e.,  there  are  no  defects  and  no  singular  qubits).  Error  correction  on  the  primal  lattice  C  and 
the  dual  lattice  C  run  separately.  Here  we  consider  error  correction  on  the  primal  lattice  only;  error 
correction  on  the  dual  lattice  is  analogous. 

The  error  chains  live  on  the  edges  of  the  lattice  C  and  leave  a  syndrome  at  the  end  points,  which 
are  vertices  of  C.  This  is  exactly  the  scenario  which  has  been  considered  for  topological  quantum 
memory  in  [18],  and  subsequently  the  results  of  [18]  that  we  need  in  the  present  context  are  briefly 
summarized.  The  connection  between  topological  error  correction  and  cluster  states  has  been  made 
in  [14]  for  the  purpose  of  creating  long-range  entanglement  in  the  presence  of  noise. 

Given  a  particular  syndrome  and  an  error  chain  E(c{)  compatible  with  this  syndrome,  we  are 
interested  in  the  total  probability  P{c\)  of  the  homology  class  of  ci, 

p(ci)  =  Z  p(c * +zi)>  (30) 

zieZi 

where  p(c[)  is  the  probability  of  an  individual  error  chain  E(c/1),  and  the  sum  is  over  all  1-cycles. 
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For  error  correction  we  infer  that  the  physical  error  which  occurred  was  from  the  homology  class 
with  the  largest  probability. 

If  the  errors  on  the  lattice  edges  occur  independently  with  a  probability  q  then  the  problem  of 
computing  P(c\)  for  a  given  chain  c\  can  be  mapped  onto  a  problem  from  statistical  mechanics, 
namely  the  random  plaquette  Z2-gauge  model  in  three  dimensions  [18].  The  crossover  from  high 
fidelity  error  correction  at  small  error  rates  to  low  fidelity  error  correction  at  high  error  rates 
corresponds  to  a  phase  transition  in  this  model.  A  numerical  estimate  of  the  critical  error  rate  is 
qc  =  0.033  ±0.001  [25]. 

As  far  as  is  known,  the  classical  operational  resources  required  to  find  the  most  likely  error 
homology  class  consistent  with  a  given  syndrome  scales  exponentially  in  the  number  of  error  lo¬ 
cations.  The  assumption  of  the  classical  processing  being  instantaneous  cannot  be  justified  under 
these  conditions.  However,  it  is  possible  to  trade  threshold  value  for  efficiency  in  the  error  correction 
procedure.  A  reasonable  approximation  to  the  maximum  probability  for  a  homology  class  of  errors 
is  the  probability  of  the  lowest  weight  admissible  chain.  The  minimum- weight  perfect  matching 
algorithm  [26,  27]  computes  this  chain  using  only  polynomial  operational  resources.  A  numerical 
estimate  to  the  threshold  with  this  algorithm  for  error  correction  is  q'c  —  0.0293  ±  0.0002  [28]. 

Remark:  The  topological  error  threshold  is  estimated  in  numerical  simulations  of  finite-size 
systems.  For  this  purpose,  the  probability  of  logical  error  is  plotted  vs.  the  physical  error  parameter 
for  various  system  sizes.  For  sufficiently  large  lattices  (such  that  finite-size  effects  are  small),  we 
expect  these  curves  to  follow  a  universal  scaling  ansatz  near  the  threshold  such  that  they  share  a 
common  intersection  point  and  their  slopes  are  proportional  to  a  common  power  of  the  lattice  size. 
As  the  system  size  is  increased  to  infinity,  we  then  expect  the  curves  to  approach  a  step  function 
which  transitions  at  the  threshold  value  of  the  physical  error  parameter. 


The  above  quoted  threshold  value  is  for  independent  errors  on  the  edges  of  C.  Do  the  models 
for  the  physical  error  sources  of  Section  7.1  lead  to  such  independent  errors?  The  answer  is  ‘yes’ 
for  error  Model  1  and  ‘no’  for  error  Model  2.  For  the  latter,  we  need  to  consider  a  modified  RPGM 
with  correlated  errors  among  next-to-nearest  neighbors.  Specifically,  for  Error  Model  1  the  relation 
between  the  local  rate  p\  of  the  physical  depolarizing  error  and  the  error  parameter  q  that  shows 
up  in  the  RPGM  is 


for  Error  Model  1. 


(31) 


Given  a  threshold  of  q  —  2.93%  [28]  for  error  correction  via  the  minimum  weight  perfect  matching 
algorithm  in  the  bulk,  then  the  corresponding  depolarizing  error  rate  that  can  be  tolerated  is 


pi5c  =  4.4  x  10  2,  (in  V ). 


(32) 


For  Error  Model  2,  first  consider  the  case  where  only  the  A(Z)-gates  are  erroneous,  pp  =  ps  — 
Pm  =  0.  Then,  in  addition  to  local  errors  with  a  rate  q\  there  exist  correlated  errors  with  error  rate 
q2  for  each  pair  of  opposite  edges  in  all  faces  of  C.  That  is,  with  a  probability  q2  simultaneous  errors 
are  introduced  on  opposite  edges  of  the  faces  in  C.  The  local  noise  specified  by  q\  and  the  two-local 
noise  specified  by  q2  are  independent  processes.  The  relations  between  the  error  parameter  p2  of 
the  A(Z)-gates  and  the  parameters  gi,  q2  of  the  RPGM  with  correlated  errors  are 

32  /  8  \  /  64  2  /  8  \ 

51  =  l5re  y1  -  TEP2J  (225K  +  ('  “  15® j 

1  /i  4  4  2x 

®  =  2-\h-KK  =  15K  +  0(P2)- 

The  correlation  of  errors  on  sites  separated  by  a  distance  of  two  arises  through  error  propagation  in 
the  creation  of  the  cluster  state.  Correlations  among  errors  on  next-neighboring  sites  play  no  role 
because  such  errors  live  on  different  lattices  (£  and  C)  and  are  corrected  independently. 
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Figure  8:  Threshold  estimation  in  lattices  of  finite  size,  for  periodic  boundary  conditions.  Here, 
with  A(Z)-gates  as  the  only  error  source,  we  find  a  threshold  of  p2,c  —  9.6  x  10-3.  In  the  inset, 
best  fits  to  the  universal  scaling  ansatz  are  drawn.  Error  bars  denote  two  standard  deviations  due 
to  finite  sampling  size. 


The  only  effect  of  Pp,Ps,Pm  >  0  is  an  enhanced  local  error  rate  q\.  We  give  the  relations  to 
leading  order  only;  they  read 


32  2 

<7i  =  j^P2  + -{pp+ps +Pm)  , 

4 

<72  =  —p  2, 


for  error  Model  2. 


(34) 


See  Fig.  8  for  a  simulation  of  error  correction  under  faulty  A(Z)-gates  as  the  only  error  source,  which 
gives  rise  to  correlated  noise  between  neighboring  edge  qubits.  If  we  define  x  =  (p2  —  p2,c)L1^0  then 
the  universal  scaling  ansatz  states  that  fidelity  F  should  be  a  function  dependent  only  on  the  scaling 
parameter  x  in  the  vicinity  of  the  threshold  [28].  We  find  very  good  agreement  (with  R 2  >  .9991) 
for  F  =  A  +  Bx  +  Cx 2,  where  we  fit  for  constants  A,  £>,  (7,  £>2,0  and  uq.  This  gives  very  tight  bounds 
on  the  critical  probability  p2,c  =  9.6  x  10-3.  Interestingly,  we  also  find  z/q  =  1.00  ±  0.02,  which 
indicates  that  this  model  belongs  to  the  same  universality  class  as  the  purely  local  error  model  of 
the  3D-RPGM  [28]. 

In  Fig.  9  the  threshold  trade-off  curve  between  pp  +  ps  +  Pm  and  p2  is  displayed.  Numerically, 
we  obtain  for  the  thresholds  in  the  bulk 


Pp  =  ps  =  Pm  = 
P2  = 
Pi  = 


1.46  x  10“2, 

for 

0.96  x  10"2, 

for 

0.58  x  10-2, 

for 

P2  =  0, 

Pp  =  PS  =  Pm  =  0, 
Pp  —  Ps  —  Pm  —  P2- 


(35) 


Error  correction  in  V  near  D.  In  the  presence  of  defects  there  are  two  modifications  to  error 
processes  in  V.  First,  the  length  scale  for  the  minimum  extension  of  a  non-trivial  error  cycle  shrinks. 
Second,  there  is  a  surface  effect;  the  effective  error  rate  for  qubits  in  V  next  to  the  surface  of  defects 
is  enhanced  by  a  constant  factor. 
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Figure  9:  Trade-off  curve  for  the  threshold  value  in  the  presence  of  local  and  two-local  errors. 
Horizontal  axis:  two-qubit  gate  error  p25  vertical  axis:  local  error  rate  pp  +  ps  +  Pm- 


1.  Length  scale  for  non-trivial  errors:  For  comparison,  consider  a  cluster  cube  of  finite  size 
2L  x  2L  x  2L.  A  non-trivial  error  cycle  must  stretch  across  the  entire  cube  and  thus  has  a  weight 
of  at  least  L.  The  lowest  weight  errors  which  are  misinterpreted  by  the  error  correction  procedure 
occur  with  a  probability  qL /2.  The  total  error  probability  incurred  by  such  errors  may  therefore  be 
expected  to  decrease  exponentially  fast  in  L,  which  is  confirmed  in  numerical  simulations  [14]. 

In  the  presence  of  defects,  the  dominant  sources  for  logical  error  are  1-chains  that  either  wind 
around  a  defect  or  that  begin  and  end  in  a  defect  and  intersect  a  correlation  surface  (2-chain)  in 
between;  see  Fig.  10.  The  relevant  length  scales  are  thus  the  thickness  (circumference)  and  the 
diameter  of  the  defects.  They  are  much  smaller  than  L. 

Specifically,  consider  a  defect  with  circumference  u  and  length  l  which  bounds  a  correlation 
surface  C2,  such  that  |{dc2}|  =  l.  An  error  cycle  winding  around  the  defect  has  a  weight  of  at  least 
u  +  8  &  u,  and  there  are  l  such  minimum  weight  cycles.  Therefore,  the  probability  pe(u,1)  for 
affecting  Kfo)  by  an  error  is,  to  lowest  contributing  order, 

PE(n’ l) = ‘wiw f'2  ” 1  exp  (^“)  7 =jrn  (36) 

In  the  range  of  validity  for  the  above  expansion  in  powers  of  g,  the  error  is  still  exponentially 
suppressed  in  the  relevant  length  scale  u. 

2.  Surface  effects:  As  discussed  above,  the  error  level  is  enhanced  for  qubits  in  V  near  the 
surface  of  a  defect.  If  the  defect  is  primal  (dual),  the  enhancement  occurs  on  dual  (primal)  qubits. 
This  effect  will — if  anything — lower  the  threshold.  But  there  is  another  effect:  the  presence  of  the 
defect  changes  the  boundary  conditions.  In  case  of  a  primal  defect,  the  boundary  conditions  on 
the  defect  surface  become  rough  for  the  primal  lattice  and  smooth  for  the  dual  lattice.  Dual  error 
chains  cannot  end  in  a  primal  defect,  as  we  noted  earlier.  For  the  dual  lattice,  there  is  excess 
syndrome  available  at  the  defect  surface.  This  effect  will — if  anything — increase  the  threshold.  Our 
intuition  is  that  neither  effect  has  an  impact  on  the  threshold  value.  The  threshold  should,  if  the 
perturbations  at  the  boundary  are  not  too  strong,  still  be  set  by  the  bulk. 

We  have  performed  numerical  simulations  for  lattices  of  size  L  x  L  x  2 L,  where  half  of  the 
lattice  belongs  to  V  and  the  other  half  to  the  defect  region  D.  The  error  rate  is  doubled  near 
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Figure  10:  Sources  for  logical  error  in  the  presence  of  defects. 


a)  b) 


Figure  11:  Topological  error  correction  in  the  presence  of  defects,  a)  Short  error  cycles  intersect 
with  the  correlation  surfaces  and  thereby  jeopardize  the  topological  error  correction,  b)  Correlated 
errors  on  5-qubits  are  suppressed  exponentially  in  the  qubit  separation  l. 


the  mutual  boundary  of  the  regions  and  there  is  no  remaining  error  in  D.  Simulations  are  feasible 
with  reasonable  effort  up  to  L  «  20.  We  find  that  finite-size  effects  (due  to  the  smooth  boundary 
conditions)  are  still  noticeable  up  to  these  lattice  sizes,  but  the  intersection  point  of  fidelity  curves 
for  nearby  lattice  sizes  is  slowly  converging  to  a  threshold  value  around  that  of  the  bulk  2.9%). 

Error  correction  in  V  near  a  junction  between  D  and  5.  Near  an  5-qubit  there  exist  relative 
error  cycles  of  small  length,  see  Fig.  11a,  and  the  topological  error  correction  breaks  down.  As  a 
result,  the  effective  error  on  an  5-qubit  is  enhanced  by  its  surrounding.  To  compute  the  effective 
error  probabilities,  we  replace  every  low- weight  error-chain  E{ 7)  that  results  in  a  logical  error  after 
error  correction  by  an  equivalent  error  Eg  (7)  acting  on  the  5-qubits.  The  error  correction  converts 
E( 7)  into  E(ci( 7))  with  ci(y)  a  relative  1-cycle.  ‘Equivalent’  means  that  E(ci(y))  and  E$( 7)  act 
in  the  same  way  on  the  stabilizer  generators  {K^  s\  s  G  5}  of  the  induced  state  |Taigo)5,  i.e., 
[E(ci(j))Es(j),  J  =  0  for  all  s  G  5.  The  relevant  correlations  to  check  are  Kicz)  and  K(c 2) 
displayed  in  Fig.  11a. 

It  is  important  to  note  that  the  effective  error  on  the  5-qubits  is  local  This  arises  because  only 
error  chains  causing  a  1-qubit  error  may  have  small  length.  Error  chains  causing  a  correlated  error 
on  the  5-qubits  are  suppressed  exponentially  in  the  qubit  separation.  See  Fig.  lib. 

We  compute  the  effective  error  channel  on  the  5-qubit  to  first  order  in  the  error  probabilities 
only.  For  error  Model  1  the  error  enhancement  only  affects  sub-leading  orders  of  pi , 

Tj1)  =  (1  -Pi)[ls\  +  ([V]  +  [V]  +  [Zs]) .  (37) 
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Figure  12:  Shape  of  the  defect  near  an  5-qubit  (red),  a)  Three-dimensional  view,  b)  Cross-section 
through  the  cluster.  Edges  which  belong  to  the  defect  are  marked  as  “x”.  Black  dashed  lines 
connect  neighbors  on  the  cluster.  Blue  underlay:  faces  in  the  defect.  Even/odd  qubits:  #/o. 


For  Error  Model  2  the  effective  error  channel  on  an  5-qubit  is  not  universal  but  depends  on  the 
precise  shape  of  the  defect  double-tip  near  the  5-qubit.  For  our  calculation  we  use  the  defect  shape 
displayed  in  Fig.  12.  The  defect  is  one-dimensional  nearest  to  the  5-qubit,  and  farther  away  becomes 
three-dimensional.  The  effective  error  channel  then  is 

Ts  )  =  —  ~Pp  ~  -ps  ~  t^Pm  ~  Y5^21  ^ 5 ]  +  +  T^PS  +  ^s]  + 

/2  1  1  \  /  2  1  1  58  \  (38) 

+  (  -P2  +  -PS  +  ^Pm  )  [Ys\  +  f  -pp  +  -PS  +  -PM  +  —P2  j  [z8\. 

The  individual  contributions  to  (38)  are  listed  in  Appendix  B. 


Error  correction  in  5.  The  5-qubits  are  protected  by  the  concatenated  Reed-Muller  code.  This 
code  corrects  for  the  errors  (37)/(38)  that  remain  after  error  correction  in  V  and  D. 

The  5-qubits  are  all  measured  in  the  eigenbases  of  .  Then,  an  X-  or  a  E-error  is  equivalent 
to  a  E-error  with  half  the  probability.  This  is  easily  verified  for  the  case  where  X-  and  E-errors 
occur  with  the  same  probability.  W.l.o.g.  assume  the  measurement  basis  is  •  Then  [X]  +  [E]  = 

=  ([J]  +  [Z])  =  [Z].  But  the  statement  is,  to  leading  and  next-to-leading 

order  in  the  error  probability,  also  true  when  X-  and  E-errors  do  not  occur  with  equal  probability; 
see  Appendix  C.  We  may  thus  convert  the  X-  and  E-errors  in  (37)  and  (38)  into  E-errors.  The 
corresponding  error  probability  pz  is 


2 

PZ  = 

3P1’ 

for  error  Model  1 

76  2  4  4 

Pz  = 

yX>2  +  3  VP  +  3  PM  +  -jPs, 

for  error  Model  2 

The  fault-tolerance  threshold  of  the  concatenated  [15, 1,  3] -Reed-Muller  code  for  independent  E- 
errors  with  probability  pz  is  1.09  x  10-2.  As  we  discuss  all  Reed-Muller  error  correction  to  leading 
order  only,  we  take  the  leading  order  estimate 


PZ,c 


105 


0.95  x  10 


-2 


(40) 
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(41) 


For  Error  Model  1,  from  (39)  and  (40)  we  obtain  the  threshold 


P1,C  =  F  «  1.4  x  10  2,  (in  S). 


For  error  Model  2  we  obtain 

PP,  c  =  PS,  c  =  PM,  c  = 

4-  «  0.29  x  10"2 
350 

P2,c  = 

—  w  0.19  x  10“2 
532 

—  «  o.ll  x  10“2 
882 

Pi,  C  = 

for  p2  =  0, 

for  pp  —  ps  —  Pm  —  0,  (in  5).  (42) 

for  pP  —  pS  —  pM  =  p2, 


The  topological  error  correction  in  V  and  the  Reed-Muller  error  correction  in  S  run  separately, 
and  all  that  remains  is  to  check  which  mechanism  breaks  down  first.  By  comparison  of  Equation 
(32)  with  (41)  and  Equation  (35)  with  (42),  we  find  for  both  error  models  that  the  Reed-Muller 
error  correction  collapses  first.  It  therefore  sets  the  overall  threshold.  In  Error  Model  1,  the  critical 
error  probability  for  local  depolarizing  error  is  1.4  x  10-2.  In  Error  Model  2,  for  the  case  where 
preparation,  gate,  storage  and  measurement  errors  each  have  equal  strength,  the  error  threshold  for 
the  individual  processes  is  1.1  x  10-3. 


8  Overhead 


Denote  by  N  the  number  of  qubits  in  the  algorithm-specific  resource  state  |^raigo)Q-  N  is  also 
the  number  of  non-Clifford  one-qubit  rotations  in  a  quantum  circuit  realizing  the  algorithm  “algo” . 
What  is  the  number  Nr  of  cluster  qubits  required  to  perform  the  same  computation  fault-tolerantly? 
The  qubit  overhead  factor  for  Reed-Muller  error  correction  is 


Orm 


( log  n  y 

\  log  Pc  Av 


(43) 


where  7  =  log2  15  «  3.91,  and  p  is  the  actual  and  pc  the  critical  Reed-Muller  error  rate.  The  set  S 
consists  of  \S\  =  NOrm  qubits. 

We  need  to  determine  the  additional  overhead  due  to  topological  error  correction.  We  choose  a 
separation  r  between  strands  of  a  defect  loop,  and  a  strand  diameter  of  r / 2. 


m  m  m 


The  dimensions  of  the  cluster  thus  are  width,  depth  <  3/2tNOrm ,  height  =  4 r.  The  most  likely 
errors  then  occur  with  a  probability  of  exp (— ft(p)r),  (36),  and  there  are  less  than  3t(NOrm)2 
locations  for  them  (primal+dual).  We  require  that  St^NOrm)2  exp(— ft(p)r)  ~  1,  such  that  in  the 
large  N  limit 

2 

r  ~  — — —  In  N.  (44) 

k(p) 

Double-logarithmic  corrections  are  omitted.  The  total  number  of  qubits  is  given  by  Nr  =  width  x 
depth  x  height  <  9r3(VORM)2,  such  that 

Nit~  N2(logN)3+21 .  (45) 


The  overhead  is  polynomial.  The  values  of  the  exponents  in  the  above  expression  may  be  reduced 
in  more  resourceful  adaptations  of  the  presented  scheme. 
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9  Discussion 


We  have  described  a  scheme  for  fault-tolerant  cluster  state  universal  quantum  computation  which 
employs  topological  error  correction.  This  is  possible  because  of  a  link  between  cluster  states  and 
surface  codes.  In  addition  to  the  topological  method,  we  make  use  of  a  Reed-Muller  quantum  code 
which  ensures  that  non-Clifford  operations  can  be  performed  fault-tolerantly  by  local  measurements. 

The  error  threshold  is  1.4%  for  an  ad-hoc  error  model  with  local  depolarizing  error  and  0.11% 
for  a  more  detailed  error  model  with  preparation-,  gate-,  storage-  and  measurement  errors.  We 
have  not  tried  to  optimize  for  either  threshold  value  or  overhead  here;  the  foremost  purpose  of  this 
paper  is  to  explain  the  techniques.  With  regard  to  a  high  threshold,  the  obvious  bottleneck  is  the 
Reed-Muller  code.  The  error  threshold  imposed  by  this  code  on  the  cluster  region  S  is — depending 
on  the  error  model — a  factor  of  3  to  5  times  worse  than  the  threshold  obtained  from  the  topological 
error  correction  in  k.  To  increase  the  threshold  one  may  replace  this  code  by  another  CSS  code 
with  property  (5)  that  has  a  higher  error  threshold,  provided  such  a  code  exists.  Alternatively,  one 
may  probe  the  Reed-Muller  code  in  error  detection,  as  in  magic  state  distillation  [17].  The  error 
detection  threshold  is  14%,  which  indicates  that  there  is  some  room  for  improvement. 

Part  of  the  investigations  in  this  paper  are  numerical  simulations,  and  we  would  like  to  comment 
on  their  impact  on  the  threshold  value.  Numerics  are  encapsulated  only  in  the  threshold  estimate 
for  topological  error  correction  which  is  much  higher  than  the  overall  threshold.  Our  final  threshold 
estimate  stems  from  the  Reed-Muller  code  and  is  analytical. 
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The  appendices  are  relevant  for  Error  Model  2  only. 

A  Connecting  sub-clusters 

The  cluster  C  consists  of  a  set  of  sub-clusters  Ck,  C  =  {JkCk  which  are  prepared  in  sequence.  Two 
successive  sub-clusters  Ck,  Ck+ 1  have  an  overlap,  Ck  fl  Ck+ 1  =  Hk  C  V  U  D.  Hk  is  a  set  of  locations 
for  hand-over  qubits.  Each  graph  edge  (corresponding  to  a  A(Z)-gate  in  cluster  state  creation)  can 
be  unambiguously  assigned  to  one  sub-cluster.  The  set  of  edges  ending  in  one  vertex  either  belongs 
to  one  or  to  two  sub-clusters.  In  the  latter  case,  the  vertex  is  the  location  for  a  hand-over  qubit. 

If  k  is  odd,  the  cluster  state  creation  procedure  on  Ck  is  the  sequence  1  — ►  2  — »  3  — >  4.  If 
k  is  even,  the  sequence  is  3  — »  4  — ►  1  — ►  2.  As  shown  in  Fig.  13b,  connecting  the  sub-clusters 
proceeds  smoothly.  Consider,  for  example,  the  sub-cluster  C\.  With  the  exception  of  a  subset  of 
the  hand-over  qubits  to  C2,  the  qubits  in  C\  are  prepared  at  t  —  0,  entangled  in  steps  1  to  4, 
and  measured  at  t  —  5,6.  Specifically,  the  V-  and  D-qubits  are  measured  at  time  t  —  5,  while 
the  5-qubits  are  measured  at  time  t  —  6  with  measurement  bases  adapted  according  to  previous 
measurement  outcomes.  The  sub-clusters  Ck  are  chosen  such  that  there  is  no  temporal  order  among 
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Figure  13:  Connecting  sub-clusters,  a)  Elementary  cell  of  C.  The  old  sub-cluster  is  displayed  in 
blue,  the  new  in  black;  V;  hand-over  qubits,  qubits  within  one  sub-cluster,  b)  Temporal  order 
of  operations.  “P”:  |+)-preparation,  “E1-E4”:  steps  of  parallel  A(Z)-gates,  Ml,  M2:  measurement. 


the  measurements  on  qubits  within  S  fl  C&,  for  all  k.  Then,  the  5-qubits  wait  for  one  time  step  in 
which  a  storage  error  may  occur. 

The  hand-over  qubits  stay  in  the  computation  no  longer  than  the  other  V-  and  5-qubits.  They 
form  two  subsets,  Hi  =  A  U  5;  see  Fig.  13a.  The  qubits  in  A  are  prepared  at  t  =  0,  entangled  in 
time  steps  1  to  4  and  measured  in  at  t  =  5.  They  cause  no  change  in  the  error  model. 

The  qubits  in  B  are  not  acted  upon  by  a  gate  until  time  step  2  (since  the  potential  interaction 
partner  of  step  1  isn’t  there  yet),  so  they  are  prepared  at  time  t  —  1.  The  final  interaction  involving 
the  5-qubits  is  in  step  5,  and  they  are  measured  in  step  6.  Between  preparation  and  measurement, 
the  5-qubits  are  in  the  computation  for  four  time  steps  in  each  of  which  they  are  acted  upon  by 
a  gate.  No  additional  storage  error  occurs.  There  is  one  modification  due  to  the  5-qubits.  The 
temporal  order  of  A(Z)-gates  involving  the  qubits  b  G  5  is  changed.  As  a  result,  the  correlated 
errors  on  the  edge  qubits  of  the  faces  {02}  =  b  are  not  among  pairs  of  opposite  edge  qubits  but 
among  pairs  of  neighboring  edge  qubits.  So,  the  error  rates  q\  and  q2  in  (34)  are  unchanged,  but  q2 
characterizes  a  slightly  different  process. 

We  expect  this  to  be  a  minor  effect.  The  overall  threshold  is  still  set  by  the  threshold  for  Reed- 
Muller  error-correction,  which  is  some  five  times  smaller  than  the  simulated  threshold  for  topological 
error  correction. 


B  Effective  error  channel  on  the  5-qubits 

The  effective  error  on  an  5-qubit  stems  from  the  5-qubit  itself  and  its  immediate  surrounding  shown 
in  Fig.  14  and  from  the  two  edge-qubits  in  the  one-dimensional  section  of  the  defect,  which  are  not 
protected  by  any  syndrome.  Of  the  latter  each  contributes  an  error 

/  4  2  2  \ 

-^defect  =  (  5^2  +  ^PS  +  ^PM  J  [Xs\ 

The  preparation  error  does  not  contribute,  because  the  corresponding  Z-error  on  the  defect  qubit 
is  absorbed  in  the  ^-measurement. 

The  effective  error  of  the  center  qubit  5  G  5  stems  from  operations  that  act  on  5  directly,  from 
X-  or  T-errors  propagated  to  5  by  the  A(Z)-gates  and  from  short  nontrivial  cycles.  Specifically, 
there  are  four  non-trivial  cycles  of  length  3.  One  of  them  is  denoted  as  E(c\)  in  Fig.  14.  Because  of 
the  correlations  in  the  forward-propagated  errors  these  cycles  have  weight  2  and  cause  inconclusive 
syndrome  at  lowest  order  in  the  error  probability.  There  are  further  error  cycles  of  length  3,  such  as 
E(c[)  in  Fig  14.  But  they  have  weight  3  even  for  Error  Model  2  and  do  not  contribute  to  the  lowest 
order  error  channel.  We  perform  a  count  including  all  error  sources  in  the  cluster  region  displayed 
in  Fig.  14,  right.  There  is  one  convention  that  enters  into  the  count.  Namely,  the  error  ZaZ\)ZcZ & 
(see  Fig.  14)  is  a  non-trivial  error  cycle  such  that  the  errors  and  ZCZ^  have  the  same  weight 


(46) 
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Figure  14:  Lattice  around  an  5-qubit.  Left:  Location  of  the  5-qubit  (red)  and  the  defect  (gray). 
E(c[)  is  an  error  cycle  of  length  3  and  weight  3  for  Error  Models  1  and  2.  Right:  Detail.  E{c\) 
is  a  non-trivial  error  cycle  of  length  3  which  has  weight  2  in  error  Model  2.  Red:  5-qubit,  black: 
lattice  C  for  topological  error  correction.  The  cluster  edges  (blue)  correspond  to  A(Z)-gates  whose 
temporal  order  is  as  indicated. 


and  the  same  syndrome  but  different  effect  on  the  computation,  Xs  vs.  Is.  When  the  corresponding 
syndrome  occurs,  we  assert  that  a  logical  X-error  occurred  and  correct  for  it.  We  obtain 

^central  =  +  ~^PS  +  ([ Xs]  +  [Ys ])  +  +  ~^PS  +  ~^PM  +  —  (^7) 

The  sources  (46)  and  (47)  combined,  Zcentrai  +  2  x  E&e fect  ?  lead  to  the  local  error  channel  (38). 


C  Conversion  of  X-  and  Y-errors  on  ^-qubits 


Here  we  show  that  an  X-  or  Y -error  on  an  individual  5-qubit  s  with  probability  p  is  equivalent  to  a  Z- 
error  on  that  qubit  with  probability  p/2,  p[Xs]  =  p\Ys]  —  p/2  [Zs\.  The  qubit  s  may  be  measured  in 
the  eigenbasis  of  XJ^  or  of  .  W.l.o.g.  assume  the  qubit  s  is  measured  in  the  eigenbasis  of  x^ . 

The  X-error  is  equivalent  to  a  coherent  Z-error  and  we  need  to 


_  XsH-X  Is  —iZs 


Is  iZs 

V2 


Then,  Xs  ^  ^ 

check  whether  the  coherences  matter.  More  generally,  for  the  described  scenario  with  the  subsequent 
measurement  a  probabilistic  local  error  channel  (1  —  px  ~  Py  —  pz)[Is\  +Px [Xs]  +  py[Ys\  +Pz[Zs\ 
is  equivalent  to  a  channel  with  coherent  errors 


P  — >  (1  -  q)p  +  qZspZs  +  iq  (pZs  -  Zsp) , 


(48) 


with 


,  PX+PY  ~  PX  ~PY 
1  =  PZ  +  ^—,  q =  2 


(49) 


Now  assume  that  all  5-qubits  are  affected  individually  by  the  error  channel  (48).  The  Reed-Muller 
error  correction  at  successive  levels  maps  these  channels  to  channels  of  the  same  form,  one  coding 
level  higher  up.  The  parameters  qp  qi  at  coding  level  l  obey  recursion  relations  which,  up  to  fourth 
order,  read 


Ql+ 1  =  105  qf  (1  -  qi)13  +  35  qf  (1  -  qi)12  +  1260  qf  (1  -  qi)n  +  630  qf  (1  -  qt)n, 
qi+i  =  70 qf  (1  -  <ft)12  -  1680 qf  qt  (1  -  qi)u . 


If  we  compare  (50)  to  the  recursion  relation  of  q  for  probabilistic  Z-error,  a  deviation  first  shows  up 
at  fourth  order.  The  discussion  of  Reed-Muller  error  correction  in  this  paper  is  confined  to  leading 
order.  The  leading  order  result  for  the  threshold,  qc  =  1/105,  is  not  affected  by  the  coherences 
in  the  error  (48).  Probabilistic  X-and  T-errors  influence  the  threshold  by  contributing  half  their 
weight  to  the  probability  of  an  effective  Z-error;  see  (49). 
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With  a  combination  of  the  quantum  repeater  and  the  cluster  state  approaches,  we  show  that  efficient 
quantum  computation  can  be  constructed  even  if  ah  the  entangling  quantum  gates  only  succeed  with  an 
arbitrarily  small  probability  p.  The  required  computational  overhead  scales  efficiently  both  with  l/p  and 
n,  where  n  is  the  number  of  qubits  in  the  computation.  This  approach  provides  an  efficient  way  to  combat 
noise  in  a  class  of  quantum  computation  implementation  schemes,  where  the  dominant  noise  leads  to 
probabilistic  signaled  errors  with  an  error  probability  1  —  p  far  beyond  any  threshold  requirement. 
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The  celebrated  threshold  theorem  in  quantum  computa¬ 
tion  has  assured  that,  if  the  amount  of  noise  per  quantum 
gate  is  less  than  a  small  value,  reliable  quantum  computa¬ 
tion  can  be  efficiently  performed  [1].  In  terms  of  imple¬ 
mentation,  however,  the  experimental  noise  is  typically 
orders  of  magnitude  large  than  the  required  threshold 
value.  To  overcome  this  problem,  a  practical  route  to  noise 
reduction  is  by  exploitation  of  certain  properties  of  the 
noise.  In  carefully  designed  implementation  schemes,  the 
dominant  noise  leads  to  specific  types  of  errors  only,  which 
can  be  corrected  much  more  efficiently.  Here,  we  consider 
an  important  noise  model,  which  is  relevant  for  several 
experimental  approaches  to  quantum  computation  with  the 
use  of  photons  or  trapped  atoms  or  ions  as  the  qubits  [2- 
11].  In  these  schemes,  the  dominant  noise  leads  to  signifi¬ 
cant  failure  probability  for  the  entangling  gates  only,  and  a 
gate  failure  is  always  signaled  through  built-in  fast  photon 
detections  during  the  gate  operation.  The  success  proba¬ 
bility  p  for  each  entangling  gate  is  rather  small  for  some 
typical  experimental  systems  [4,10,1 1].  It  is  hard  to  use  the 
standard  methods  of  error  correction  in  the  considered 
scenario,  because  the  error  probability  1  —  p  (close  to 
the  unity)  is  simply  too  large. 

Naively,  if  a  gate  succeeds  only  with  a  certain  probabil¬ 
ity  p ,  one  cannot  have  efficient  computation,  as  the  overall 
success  probability  (efficiency)  scales  down  exponentially 
as  pn  with  the  number  n  of  gates.  However,  in  this  Letter 
we  show  that  efficient  quantum  computation  can  be  con¬ 
structed  with  the  required  computational  overhead  (such  as 
the  computation  time  or  the  repetition  number  of  the  en¬ 
tangling  gates)  scaling  up  slowly  (polynomially)  with  both 
n  and  l/p .  The  demonstration  of  this  result  combines  the 
ideas  from  the  quantum  repeater  schemes  [12,13]  and  the 
cluster  state  approach  to  computation  [14,15].  It  has  been 
shown  in  quantum  repeater  schemes  that,  with  probabilistic 
entangling  operations,  one  can  construct  scalable  quantum 
communication  and  Greenberger-Home-Zeilinger  correla¬ 
tions  [13,16].  Recently,  it  has  also  been  demonstrated  in 
the  context  of  linear  optics  computation  [2]  that  the  thresh¬ 
old  requirement  on  the  probability  p  for  the  entangling 


PACS  numbers:  03.67.Lx,  03.67.Pp,  03.67.Mn 

gates  can  be  significantly  improved  using  the  cluster  state 
approach  to  quantum  computation  [3, 5, 7, 8].  In  particular, 
Ref.  [8]  shows  that  for  construction  of  one-dimensional 
(ID)  cluster  states,  to  get  efficient  scaling  with  n ,  in 
principle  no  threshold  value  is  needed  on  /?,  although  in 
practice  p  is  still  required  to  be  sufficiently  large  as  the 
computational  overhead  in  that  scheme  has  inefficient 
(superexponential)  scaling  with  l/p.  Compared  with  these 
previous  results,  here  we  have  the  following  two  advances: 
(i)  We  propose  a  probabilistic  computation  scheme  which 
has  efficient  scaling  with  both  n  and  l/p.  This  improve¬ 
ment  is  substantial  as  in  current  experiments  1  / p  is  large 
[10,11].  (ii)  Through  explicit  construction,  we  also  dem¬ 
onstrate  efficient  scaling  of  the  computational  overhead  for 
generation  of  the  two-dimensional  (2D)  cluster  states 
which  are  critical  for  realization  of  universal  quantum 
computation. 

To  be  more  specific,  we  assume  in  this  Letter  that  one 
can  reliably  perform  two-qubit  controlled  phase  flip  (CPF) 
gates  with  a  small  success  probability  p ,  although  the  basic 
ideas  here  also  apply  for  other  kinds  of  entangling  gates. 
We  neglect  the  noise  for  all  the  single-bit  operations,  which 
is  well  justified  for  typical  atomic  or  optical  experiments. 
Our  basic  steps  are  as  follows:  First,  we  show  how  to 
efficiently  prepare  ID  cluster  state  from  probabilistic 
CPF  gates.  Then,  we  give  a  construction  to  efficiently 
generate  2D  cluster  states  from  ID  chains.  Efficient  prepa¬ 
ration  of  2D  cluster  states,  together  with  simple  single-bit 
operations,  realizes  universal  quantum  computation. 

With  respect  to  a  given  lattice  geometry,  the  cluster  state 
is  defined  as  coeigenstates  of  all  the  operators  At  = 
XiY\jZj,  where  i  denotes  an  arbitrary  lattice  site  and  j 
runs  over  all  the  nearest  neighbors  of  the  site  i.  The  Xt 
and  Zj  denote,  respectively,  the  Pauli  spin  and  phase  flip 
operators  on  the  qubits  at  the  sites  i,  j.  In  our  construction 
of  lattice  cluster  states  with  probabilistic  CPF  gates,  we 
will  make  use  of  the  following  three  properties  of  the 
cluster  states:  (i)  If  we  have  two  chains  of  cluster  states 
each  with  n  qubits,  we  can  join  them  to  form  a  ID  cluster 
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state  of  2 n  qubits  by  successfully  applying  a  CPF  gate  on 
the  end  qubits  of  the  two  chains,  (ii)  If  we  destroy  the  state 
of  an  end  qubit  of  an  rc-qubit  cluster  chain,  for  instance, 
through  an  unsuccessful  attempt  of  the  CPF  gate,  we  can 
remove  this  bad  qubit  by  performing  a  Z  measurement  on 
its  neighboring  qubit,  and  recover  a  cluster  state  of  n  —  2 
qubits,  (iii)  We  can  shrink  a  cluster  state  by  performing  X 
measurements  on  all  the  connecting  qubits  [see  Fig.  1(c)]. 
These  three  properties  of  the  cluster  states,  illustrated  in 
Fig.  1,  can  be  conveniently  explained  from  their  above 
definition  [15,17]. 

If  we  have  generated  two  sufficiently  long  cluster  chains 
each  of  n0  qubits,  we  can  just  try  to  connect  them  through  a 
probabilistic  CPF  gate.  If  this  attempt  fails,  through  the 
property  (ii),  we  can  recover  two  (n0  —  2) -qubit  cluster 
chains  through  a  Z  measurement,  and  try  to  connect  them 
again.  As  one  continues  with  this  process,  the  average 
number  of  the  qubits  in  the  connected  chain  is  then  given 
by  «i  =  X"= o  2("o  “  2i)p(l  -  pY  =s  2ra0  -  4(1  -  p)/p, 
where  the  last  approximation  is  valid  when  e~n°p /2  «  1. 
So  the  average  chain  length  goes  up  if  n0  >  nc  =  4(1  — 
p)/p.  We  can  iterate  these  connections  to  see  how  the 
computation  overhead  scales  with  the  qubit  number  n. 
We  measure  the  computation  overhead  in  terms  of  the  total 
computation  time  and  the  total  number  of  attempts  for  the 
CPF  gates.  For  the  rth  (r  >  1)  round  of  successful  con¬ 
nection,  the  chain  length  nr ,  the  total  preparation  time  Tr, 
and  the  total  number  of  attempts  Mr  scale  up,  respectively, 
by  the  recursion  relations  nr  =  2 nr_x  —  nc ,  Tr  =  Tr_x  + 
tal p,  and  Mr  =  2Mr_x  +  \/p.  In  writing  the  recursion 
relation  for  Tr,  we  have  assumed  that  two  cluster  chains 
for  each  connection  are  prepared  in  parallel,  and  we  ne¬ 
glect  the  time  for  single-bit  operations  (ta  denotes  the  time 
for  each  attempt  of  the  CPT  gate).  From  the  above  recur¬ 
sion  relations,  we  conclude  that  if  we  can  prepare  cluster 
chains  of  n0  (n0  >  nc)  qubits  in  time  T0  with  M0  attempts 
of  the  probabilistic  gates,  for  a  large  cluster  state,  the 
preparation  time  T  and  the  number  of  attempts  M  scale 
with  the  chain  length  n  as  T(n)  =  T0  +  (ta/ p)log2[(n  — 


FIG.  1.  Illustration  of  the  three  properties  of  the  cluster  states 
which  are  important  for  our  construction  of  such  states  with  the 
probabilistic  entangling  gates:  (a)  extend  cluster  states  with  CPF 
gates;  (b)  recover  cluster  states  by  removing  bad  qubits; 
(c)  shrink  cluster  states  for  more  complicated  links. 


nc)/(n0-nc )]  and  M{n)  =  (M0  +  1/ p)(n  -  nc)/(n0  - 
nc)  -  1  Ip. 

In  the  above,  we  have  shown  that  if  one  can  prepare 
cluster  chains  longer  than  some  critical  length  nc ,  one  can 
generate  large  scale  ID  cluster  states  very  efficiently.  The 
problem  then  reduces  to  how  to  efficiently  prepare  cluster 
chains  up  to  the  critical  length  nc.  If  one  wants  to  prepare 
an  rc-qubit  cluster  chain,  we  propose  to  use  a  repeater 
protocol  which  divides  the  task  into  m  =  log2rc  steps:  For 
the  ith  (i  =  1,  2, ... ,  m )  step,  we  attempt  to  build  a  2* -bit 
cluster  state  by  connecting  two  2*_1-bit  cluster  chains 
through  a  probabilistic  CPF  gate.  If  such  an  attempt  fails, 
we  discard  all  the  qubits  and  restart  from  the  beginning 
[18].  For  the  ith  step,  the  recursion  relations  for  the  prepa¬ 
ration  time  Tt  and  the  number  of  attempts  Mt  are  given  by 
Tt  =  (1/pXT)-!  +  ta)  [19]  and  M{  =  (l/jp)(2M/_1  +  1), 
which,  together  with  T ,  =  ta/ p  and  Mx  =  l/p ,  give  the 
scaling  rules  T(n )  —  ta(  1/ p)log lH  and  M(n )  —  (2/ p)login / 2. 
The  cost  is  more  significant,  but  it  is  still  a  polynomial 
function  of  n.  To  construct  a  rc-qubit  cluster  chain,  in  total 
we  need  n  —  1  successful  CPF  gates.  In  a  direct  protocol, 
we  need  all  these  attempts  to  succeed  simultaneously, 
which  gives  the  scaling  T(n )  M(n )  (1  / p)n~l .  By  di¬ 
viding  the  task  into  a  series  of  independent  pieces,  we 
improve  the  scaling  with  n  from  exponential  to  polynomial 
(for  n  <  nc). 

To  generate  a  cluster  chain  of  a  length  n>  nc,  we  simply 
combine  the  above  two  protocols.  First,  we  use  the  repeater 
protocol  to  generate  fto-finlfit  chains  with  n0>  nc.  Then  it 
is  straightforward  to  use  the  connect-and-repair  protocol  to 
further  increase  its  length.  For  instance,  with  n0  =  nc  +  1 
(which  is  a  reasonable  close-to-optimal  choice),  the  overall 
scaling  rules  for  T  and  M  are  (for  n  >  nc) 

T(n )  -  ta(\/p)log^nc+l)  +  (ta/p)log2(n  -  nc),  (1) 

M(n)  —  (2  /  p)log2^nc+1\n  —  nc)/2.  (2) 

As  the  critical  length  is  nc  ^  A/p,T  and  M  in  our  protocol 
scale  with  1/p  as  (1  / p)log2^4^ p\  which  is  much  more 
efficient  than  the  superexponential  scaling  (l/p)4^p  in 
the  previous  work. 

We  have  shown  that  for  any  success  probability  p  of  the 
probabilistic  entangling  gate,  ID  cluster  states  of  arbitrary 
length  can  be  created  efficiently.  For  universal  quantum 
computation,  however,  such  ID  cluster  states  are  not  suffi¬ 
cient.  They  need  to  be  first  connected  and  transformed  into 
2D  cluster  states  (for  instance,  with  a  square  lattice  ge¬ 
ometry)  [15].  It  is  not  obvious  that  such  a  connection  can 
be  done  efficiently.  First,  in  the  connect-and-repair  proto¬ 
col,  when  an  attempt  fails,  we  need  to  remove  the  end 
qubits  and  all  of  their  neighbors.  This  means  that  in  a  2D 
geometry  the  lattice  shrinks  much  faster  to  an  irregular 
shape  in  the  events  of  failure.  Furthermore,  a  more  impor¬ 
tant  obstacle  is  that  we  need  to  connect  much  more  bound¬ 
ary  qubits  if  we  want  to  join  two  2D  cluster  states.  For 
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instance,  for  a  square  lattice  of  n  qubits,  the  number  of 
boundary  qubits  scales  as  *J\ n  (which  is  distinct  from  a  ID 
chain).  If  we  need  to  connect  all  the  corresponding  bound¬ 
ary  qubits  of  the  two  parts,  the  overall  success  probability 
is  exponentially  small. 

To  overcome  this  problem,  we  introduce  a  method  which 
enables  efficient  connection  by  attaching  a  long  leg  (a  ID 
cluster  chain)  to  each  boundary  qubit  of  the  2D  lattice.  The 
protocol  is  divided  into  the  following  steps:  First,  we  try  to 
build  a  “+-shape”  cluster  state  by  probabilistically  con¬ 
necting  two  cluster  chains  each  of  length  2rii  +  1  (the  value 
of  ni  will  be  specified  below).  This  can  be  done  through  the 
probabilistic  CPF  gate  together  with  a  simple  Hardmard 
gate  H  and  an  X  measurement,  as  shown  in  Fig.  2(a)  and 
explained  in  its  caption.  With  on  average  1/p  repetitions, 
we  get  a  + -shape  state  with  the  length  of  each  of  the  four 
legs  given  by  n{.  We  use  the  + -shape  state  as  the  basic 
building  blocks  of  large  scale  2D  cluster  states.  In  the 
+ -shape  state,  we  have  attached  four  long  legs  to  the  center 
qubit.  The  leg  qubits  serve  as  ancilla  to  generate  near- 
deterministic  connection  from  the  probabilistic  CPF  gates. 
The  critical  idea  here  is  that  if  we  want  to  connect  two 
center  qubits,  we  always  start  the  connection  along  the  end 
qubits  of  one  of  the  legs  (see  illustration  in  Fig.  2).  If  such 
an  attempt  fails,  we  can  delete  two  end  qubits  and  try  the 
connection  again  along  the  same  legs.  If  the  leg  is  suffi¬ 
ciently  long,  we  can  almost  certainly  succeed  before  we 
reach  (destroy)  the  center  qubits.  When  we  succeed,  and  if 
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there  are  still  redundant  leg  qubits  between  the  two  center 
ones,  we  can  delete  the  intermediate  leg  qubits  by  perform¬ 
ing  simple  single-bit  X  measurements  on  all  of  them  [see 
Figs.  1(c)  and  2  for  the  third  property  of  the  cluster  state]. 
With  such  a  procedure,  we  can  continuously  connect  the 
center  qubits  and  form  any  complex  lattice  geometry  [see 
the  illustration  for  construction  of  the  square  lattice  state  in 
Figs.  2(b)  and  2(c)].  What  is  important  here  is  that  after 
each  time  of  connection  of  the  center  qubits,  in  the  formed 
new  shape,  we  still  have  the  same  length  of  ancillary  legs 
on  all  the  boundary  qubits,  which  enables  the  succeeding 
near-deterministic  connection  of  these  new  shapes. 

Now  we  investigate  for  the  2D  case  how  the  computa¬ 
tional  overhead  scales  with  the  size  of  the  cluster  state.  If 
the  ancillary  legs  have  length  nh  for  each  connection  of 
two  center  qubits,  we  can  try  at  most  nj 2  times  of  the 
probabilistic  CPF  gates,  and  the  overall  success  probability 
is  given  by  pc  =  1  —  (1  —  p)n //2.  If  we  want  to  build  a 
square  lattice  cluster  state  of  N  qubits,  we  need  about  2N 
times  of  connections  of  the  center  qubits  (there  are  about 
2N  edges  in  an  N- vertex  square  lattice).  The  probability  for 
all  these  connections  to  be  successful  is  given  by  plN.  We 
require  that  this  overall  success  probability  is  sufficiently 
large  with  plN  >1  —  6,  where  6  is  a  small  number  char¬ 
acterizing  the  overall  failure  probability.  From  that  require¬ 
ment,  we  figure  out  that  nt  =*  (2/ p)  ln(2N  / 6).  To  construct 
a  square  lattice  cluster  state  of  N  qubits,  we  need  to 
consume  N  + -shape  states,  and  each  of  the  latter  requires 
on  average  2/p  cluster  chains  with  a  length  of  2n{  +  1 
qubits.  So  we  need  in  total  ( 2N / p)(2nt  +  l)-bit  cluster 
chains,  which  can  be  prepared  in  parallel  with 
(2N / p)M{2nt  +  1)  CPF  attempts  within  a  time  period 
T(2nt  +  1)  [see  Eqs.  (1)  and  (2)  for  expressions  of  the 
M(n)  and  T(n)].  This  gives  the  resources  for  preparation 
of  all  the  basic  building  blocks  (the  chains).  Then  we  need 
to  connect  these  blocks  to  form  the  square  lattice.  We 
assume  that  the  connection  of  all  the  building  blocks  are 
done  in  parallel.  The  whole  connection  takes  on  average 
2N/p  CPF  attempts,  and  consumes  a  time  at  most 
ta/ p\n(2N / e).  Summarizing  these  results,  the  temporal 
and  the  operational  resources  for  preparation  of  an  Wbit 
square  lattice  cluster  state  are  approximately  given  by 

T(N)  -  ta(l/p)lo^4/p-3)  +  -log2f-[ln(2 N/e)  ~  1] 

P  \P 

+  -  ln(2 N/e),  (3) 

P 


FIG.  2.  Illustration  of  the  steps  for  construction  of  the  2D 
square  lattice  cluster  states  from  a  set  of  cluster  chains. 

(a)  Construction  of  the  basic  + -shape  states  from  cluster  chains 
by  applying  first  a  Hardmard  gate  H  on  the  middle  qubit  of  one 
chain,  and  then  a  CPF  gate  to  connect  the  two  middle  qubits,  and 
finally  a  X  measurement  on  one  middle  qubit  to  remove  it. 

(b) ,(c)  Construction  of  the  square  lattice  cluster  state  from  the 
+ -shape  states  through  probabilistic  CPF  gates  along  the  legs 
and  X  measurements  to  remove  the  remaining  redundant  qubits. 


M(N)  -  (2/p)2+lo^Wp-3)N[ln(2N/e)  -  1]  +  2 N/p. 

(4) 

In  the  2D  case,  the  temporal  and  the  operational  overheads 
still  have  very  efficient  scaling  with  the  qubit  number  N, 
logarithmically  for  T(N)  and  Nln(N)  for  M(N).  Their 
scalings  with  1/p  are  almost  the  same  as  in  the  ID  case 
except  an  additional  factor  of  1/p 2  for  M(N).  Through 
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some  straightforward  variations  of  the  above  method,  it  is 
also  possible  to  efficiently  prepare  any  complicated  graph 
state  using  probabilistic  CPF  gates  [17].  This  shows  that  in 
principle  we  do  not  need  to  impose  any  threshold  on  the 
success  probability  of  the  CPF  gates  for  efficient  quantum 
computation. 

Before  ending  the  Letter,  we  would  like  to  add  a  few 
remarks  on  other  sources  of  noise  that  have  not  been  taken 
into  account  in  the  above  discussions.  If  each  CPF  gate  has 
some  small  additional  infidelity  error,  one  might  wonder 
whether  such  an  error  scales  up  with  the  large  number  of 
attempts  M(N).  That  is  actually  not  the  case.  Most  of  the 
CPF  attempts  have  failed,  and  all  the  failed  CPF  gates  have 
no  contribution  to  the  final  state  infidelity.  In  practice,  we 
may  be  more  concerned  about  the  temporal  overhead  T(N) 
than  the  operational  overhead  M(N).  Each  qubit  has  a  finite 
coherence  time  and  we  need  to  finish  all  the  CPF  attempts 
within  such  a  time  scale.  For  typical  probabilistic  entan¬ 
gling  experiments  with  atoms  [10,11],  the  time  ta  for  each 
CPF  attempt  is  about  100  ns,  while  the  qubit  coherence 
time  is  usually  longer  than  1  s.  If  we  take  the  success 
probability  ~  0.1,  Eq.  (3)  gives  T(N)  ~  1.6  X  105^  ~ 
16  ms  for  any  large  N  [20],  which  is  still  well  within  the 
qubit  coherence  time. 

In  summary,  we  have  shown  that  cluster  states  in  two 
dimensions  can  be  generated  using  probabilistic  CPF  gates 
with  efficient  scaling  in  both  the  qubit  number  and  the 
inverse  of  the  success  probability.  This  result  opens  up  a 
prospect  to  realize  efficient  quantum  computation  with 
probabilistic  entangling  gate  operations.  Such  a  prospect 
is  relevant  for  several  experimental  systems  involving 
atoms,  ions,  or  photons  [2,4,10,11],  with  ongoing  efforts 
towards  probabilistic  quantum  information  processing. 

L.  M.  D.  thanks  Chris  Monroe  for  helpful  discussions. 
This  work  was  supported  by  NSF  grants  (0431476  and 
EIA-0086038),  the  ARDA  under  ARO  contracts,  the  A.  P. 
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Note  added . — Recently,  we  became  aware  that  in  the 
latest  version  of  Ref.  [8]  the  authors  have  also  made  some 
interesting  improvement  of  their  construction  efficiency  of 
the  ID  cluster  state  to  overcome  the  inefficient  scaling. 
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This  paper  describes  a  quantum  cellular  automaton  capable  of  performing  universal  quantum  computation. 
The  automaton  has  an  elementary  transition  function  that  acts  on  Margolus  cells  of  2  X  2  qubits,  and  both  the 
“quantum  input”  and  the  program  are  encoded  in  the  initial  state  of  the  system. 
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I.  INTRODUCTION 

A  number  of  physical  systems  that  are  considered  for  the 
realization  of  a  universal  quantum  computer,  such  as  optical 
lattices  [1]  or  arrays  of  microlenses  [2],  possess  a  translation 
symmetry  in  the  arrangement  of  qubits  and  their  mutual  in¬ 
teraction.  Quantum  cellular  automata  (QCA)  represent  a  suit¬ 
able  framework  to  explore  the  computational  power  of  such 
physical  systems,  because  they  respect  this  symmetry.  A  pri¬ 
ori ,  translation  invariance  may  be  regarded  as  a  severe  limi¬ 
tation  since  it  constrains  the  degree  of  control  that  can  be 
exerted  to  the  quantum  system.  However,  it  has  been  dem¬ 
onstrated  that  one-dimensional  QCA  can  efficiently  simulate 
any  quantum  Turing  machine  [3]. 

Further  it  has  been  shown  that  there  exists  a  universal 
QCA  which  can  simulate  any  other  automaton  with  linear 
slow  down  [4],  and  that  every  reversible  QCA  can  be  repre¬ 
sented  in  a  generalized  Margolus  partitioning  scheme  [5]. 
Proposals  with  an  emphasis  on  experimental  viability  have 
outlined  how  generic  physical  systems  can  be  used  as  quan¬ 
tum  computation  devices  if  equipped  with  a  minimal  amount 
of  external  control.  Among  the  described  mechanisms  are 
global  control  via  sequences  of  resonant  light  pulses  [6]  or 
modulation  of  a  coupling  constant  [7,8],  and  individual  con¬ 
trol  over  one  of  the  elementary  cells  [9]. 

At  this  point,  one  may  abandon  all  algorithm- specific  con¬ 
trol  during  the  process  of  computation  and  ask  “How  intri¬ 
cate  do  quantum  cellular  automata  have  to  be  such  that  they 
can  perform  useful  tasks  in  quantum  information  process¬ 
ing?.”  A  quick  answer  may  be  “Simple,  by  definition.”  How¬ 
ever,  when  QCAs  are  tuned  for  algorithmic  application,  it 
may  occur  that — while  the  simple  composition  is  retained — 
the  elementary  cells  and  neighborhood  schemes  become 
complicated.  An  interesting  facet  of  the  answer  to  the  above 
question  has  been  provided  in  Ref.  [10],  where  a  very  simple 
QCA  for  quantum  data  transmission  has  been  devised  (also 
see  Ref.  [5]).  Motivated  by  a  recent  result  [11],  where  uni¬ 
versal  computation  via  autonomous  evolution  of  a  10-local 
Hamiltonian  is  described,  here  we  consider  quantum  compu¬ 
tation  in  the  cellular  automaton  scenario.  We  explicitly  con¬ 
struct  a  computationally  universal  two-dimensional  QCA 
whose  transition  rule  is  based  on  a  four-qubit  unitary. 

II.  CONSTRUCTION  OF  THE  AUTOMATON 

Consider  a  two  dimensional  lattice  of  size  2sX2r  with 
periodic  boundary  conditions,  i.e.,  a  torus.  Each  lattice  site 


carries  a  qubit.  The  transition  rule  for  the  QCA  is  described 
in  terms  of  a  Margolus  partitioning  [12].  The  lattice  is  parti¬ 
tioned  into  cells  of  size  2X2,  and  there  is  a  separate  parti¬ 
tioning  for  the  time  t  being  even  or  odd,  respectively.  One 
may  choose  a  coordinate  system  on  the  torus  with  axes  par¬ 
allel  to  what  were  the  boundaries  before  identification.  For  t 
even,  the  qubits  in  the  upper  left  corner  of  each  cell  have 
both  coordinates  even,  and  for  t  odd  they  have  both  coordi¬ 
nates  odd.  Thus,  a  cell  in  step  t  overlaps  with  four  cells  of 
step  t—  1. 

The  transition  of  the  QCA  from  time  t  to  t+ 1  proceeds  by 
simultaneously  applying  a  unitary  transition  function  r  to 
each  cell.  For  a  suitable  choice  of  the  4-qubit  unitary  r  one 
can  perform  universal  quantum  computation  with  the  de¬ 
scribed  QCA.  Specifically,  a  quantum  logic  network  of  local 
and  next-neighbor  unitary  gates  with  width  2s  and  depth  r 
can  be  simulated. 

With  a  labeling  of  particles  as  illustrated  in  Fig.  1(a),  the 
following  elementary  transition  function  is  chosen: 


T=  5(1, 3)5(2, 4)Hl  exp(  -  i 
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FIG.  1.  (Color  online)  Transition  of  the  QCA  from  tt o  t+ 1.  (a) 
Margolus  cell  of  2  X  2  qubits,  with  the  part  | D)n  of  a  data  and  |p)34 
of  the  program  column,  (b)  Before  and  after  the  first  transition.  The 
program  columns  move  left  and  the  data  columns  move  right.  The 
dashed  lines  indicate  the  partitioning  into  Margolus  cells. 
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FIG.  2.  (Color  online)  The  quantum  logic  network  simulated  by 
the  QCA.  The  gates  in  each  shaded  box  result  from  one  application 
of  the  elementary  transition  function.  The  arrows  denote  classically 
controlled  gates  which  are  triggered  by  the  program  registers.  In  the 
second  time  slice  of  the  displayed  network,  the  two  half¬ 
neighborhoods  at  the  lower  and  upper  end  form  a  neighborhood, 
due  to  the  toric  topology. 

Therein,  S(a,b )  denotes  a  SWAP  gate  between  qubit  a  and  b , 
is  a  Hadamard  transformation  on  qubit  1 ,  and  Zc  denotes 
a  Pauli  phase  flip  operator  applied  to  qubit  c. 

If  | p)34  is  a  state  in  the  computational  basis,  it  effectively 
stores  two  classical  bits,  p and  p^\  Then,  the  transition 
function  r  amounts  to  a  classically  controlled  unitary  opera¬ 
tion  U(p)  applied  to  | D)n.  The  bit  triggers  a  A (Z)  gate 
applied  to  |D)12,  and  p ^  a  7r/4-phase  gate  exp(-/7r/8Z1) 
applied  to  qubit  1  of  \D)n.  In  this  way,  | p)  encodes  an  el¬ 
ementary  step  of  a  program,  carried  out  on  the  two-qubit 
“data”  |Z))12.  The  SWAP  gates  allow  the  quantum  data  and  the 
program  to  pass  by  another,  such  that  \D)n  can  interact  with 
subsequent  program  registers. 

The  sets  of  qubits  on  the  torus  with  the  same  first  coordi¬ 
nate  x,  0^x^2r-l,  are  called  columns.  At  time  £=0  all 
even  columns  contain  data  registers  | Dt),  and  all  odd  col¬ 
umns  contain  program  registers  | pj).  The  initial  state  of  the 
automaton  is 

r- 1 

|^0))=®|A(0))2,h+i)2«+i-  (2) 

i= 0 

Here  and  in  the  following  the  labels  inside  the  kets  specify 
the  state  and  the  ones  outside  specify  the  location  of  the 
support  within  the  lattice,  i.e.,  the  column.  For  example, 
|Z>1(0))2  is  data  register  number  1  at  time  t=0,  located  on 
column  2.  Of  all  data  registers  only  |D0)  is  used,  the  others 
are  auxiliary.  When  the  QCA  starts  to  run,  the  data  registers 
move  right  (counter-clockwise,  as  seen  from  top)  and  the 
program  registers  move  left  (clockwise),  by  one  column  in 
each  time  step.  When  passing  the  data  registers,  the  program 
registers  | p)  control  unitary  transformations  U(p)  acting 
upon  the  data  registers  |  D).  In  this  way,  a  program  specified 
by  the  data  pl,...ipr,  with  px  encoding  the  first  and  pr  the 
last  step,  is  carried  out  on  the  quantum  data  register  |D0).  The 
program  that  is  carried  out  corresponds  to  a  quantum  logic 
network  of  local  and  next-neighbor  gates  in  a  particular  ar¬ 
rangement;  see  Fig.  2.  Such  networks  are  sufficient  for  uni¬ 
versal  quantum  computation,  as  is  discussed  in  detail  further 


below.  The  same  program  steps  that  are  applied  to  the  regis¬ 
ter  |D0)  are  also  carried  out  on  the  auxiliary  data  registers 
| Df),  1^/^r-l,  but  in  scrambled  order.  Therefore,  these 
registers  are  not  used. 

In  the  course  of  computation,  both  data  and  program 
travel  across  half  the  torus.  When  the  automaton  has  run  for 
r  time  steps,  the  computation  is  finished  and  the  register 
|D0(r))r  can  be  read  out  from  column  r,  via  local  measure¬ 
ments. 

Let  Tij  denote  the  elementary  transition  function  (1)  ap¬ 
plied  to  the  cell  (ij).  Therein,  i  is  the  column  coordinate  of 
the  upper  left  qubit  in  the  cell,  and  j  the  respective  coordi¬ 
nate  within  the  column.  Then,  the  unitary  transition  function 
Th  acting  upon  two  consecutive  columns  i,  z+1,  is 

s-l 

Ti=®/i,[  2/  +  -V  (3) 

7=0 

Therein,  [2 j+i\2s  is  a  shorthand  for  2 j  +  i  mod  2s  that  will  be 
used  throughout  the  remainder  of  the  paper. 

If  | p)  is  a  state  in  the  computational  basis,  then 

T^D),  0  | p)M)  =  | p),  0  [U(p)\D)]i+1 .  (4) 

Therein,  U{p)  is  a  unitary  transformation  chosen  by  p  con¬ 
taining  a  Hadamard — and  possibly  a  A(Z) — and  a  7r/4-phase 
gate,  in  accordance  with  (1). 

The  global  transition  function  T:  \i/^t+ 1))  is,  for 

even  t  given  by  Te-®riJ)T2i,  and  for  odd  t  by  T0-®ri=oT2i+i. 
In  both  cases  it  can  be  written  in  the  form 

r- 1 

T=  ®T[2i  +  t]2r-  (5) 

i= 0  2r 

Now,  the  state  of  the  QCA  at  time  t  is 

r-\ 

I  0(0)  =  ®  |A(0)[2 i  +  t\2r  ®  I P[i  +  t  +  l]r)[2 i  +  t  +  l]2r>  (6) 

with  the  data  register  i  at  time  t  given  by 

lAW)  =  (n^[,+t]f)j|D70)).  (7) 

The  unitaries  U(p[i+k] )  are  ordered  in  ascending  order  with 
k ,  i.e.,  £/(/?p+1] )  acts  first.  For  i= 0  and  t-r ,  in  particular,  one 
finds  that 

|£>o(r))=^IIc/fe)j|£>o(0))  (8) 

is  the  output  quantum  register,  with  the  unitaries 

U(pi)-  •  -U(pr)  applied  in  the  correct  order  to  the  quantum 
register  in  its  input  state,  |D0(0)). 

Before  proving  (6),  let  us  recover  therein  some  features  of 
the  QCA  that  were  stated  before.  It  is  easy  to  see  that — apart 
from  being  moved — the  program  registers  remain  unchanged 
throughout  the  evolution,  and  that  there  is  no  entanglement 
across  columns.  The  data  registers  indeed  move  right, 

and  the  program  registers  left  (with  i+t+l=i’ , 
\P[i+t+i])[2i+t+i]2=\P[i'])[2i'-t-\])-  After  r  time  steps,  the 
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output  quantum  register  |D0(r))  can  be  read  out  from  column 
r. 

Equation  (6)  is  proved  by  induction.  First  note  that  for 
t-  0  it  reduces  to  (2).  Further, 

r- 1 

T\  ijAt))  =  <S>  T[2i  +  (]  |  AW>[2;  +  t]2\P[i  + 1  +  l]  )[2 ;  +  r  +  i]2r 

i= 0 
r-1 

=  (^(P[Z  +  z  +  l]r)lA'W))[2/  +  r+  l]2r 

®  |P[Z  +  f  +  l]r)[2i  +  d2r 
r-1 

=  +  1))[2/  +  o+  i)]2r 

®  +  (f  +  1)  +  l]r)[2i  +  (t  +  1)  +  l]2r 

=  |0(f+D>-  (9) 


Here,  the  first  line  follows  by  the  definitions  of  T  and  |^(/L)), 
(5)  and  (6),  and  the  second  follows  by  (4).  The  third  line 
follows  by  (7)  and,  for  the  | p)  part,  the  substitution  z— >z  + 1 
under  which  the  product  is  invariant.  □ 

Finally,  it  is  shown  that  the  quantum  logic  network  simu¬ 
lated  by  the  described  QCA  is  indeed  universal,  as  stated. 
The  controlled-NOT  (CNOT),  the  Hadamard  and  the  7t/4 
phase  gate  exp(-z7r/8Z)  form  a  universal  gate  set  [13].  For 
the  described  QCA,  one  can  independently  apply  the  Had¬ 
amard,  the  7t/4  phase  gate  and  the  identity  to  the  simulated 
logical  qubits,  by  choosing  the  following  sequences  of  p ^ 
bits: 


P 


(3)  = 


0  000  000  000, 

<  0  101  101  101, 
1  000  000  000, 


for  U  =  I, 
for  U  =  H , 
for  U=  Uz[tt/4\. 


(10) 


with  all  p ^  bits  zero.  In  the  way  they  are  constructed  here 
(no  claim  that  this  is  close  to  optimal),  the  one-qubit  opera¬ 
tions  from  the  universal  set  require  10  successive  applica¬ 
tions  of  //^-controlled  gates  and  thus  20  time  steps  of  the 
QCA.  A  next-neighbor  CNOT  gate  that  acts  within  this  cycle 
can  be  constructed  from  a  A(Z)  and  two  Hadamard  gates. 
The  long-distance  CNOT  gates  may  then  be  constructed  with 
the  help  of  next-neighbor  SWAP  gates,  which  themselves  con¬ 
sist  of  three  next-neighbor  CNOT  gates.  This  completes  our 
construction  of  a  QCA  capable  of  performing  universal 
quantum  computation. 

Two  remarks,  (1)  the  described  QCA  may,  with  some 
right,  be  called  a  deterministic  programmable  quantum  gate 
array,  but  this  notion  is  already  in  use  for  a  construction  that 
has  been  proven  not  to  exist  [14].  Our  QCA  is  consistent 
with  this  result.  The  program  information  is  classical  and  all 
program  states  C/=1[Pz)2;-i  are  orthogonal,  as  required  in  Ref. 
[14].  Further,  from  the  viewpoint  of  temporal  complexity,  the 
described  QCA  is — within  a  constant — as  efficient  as  a  quan¬ 
tum  logic  network  with  local  and  next-neighbor  gates.  These 
issues  have  also  been  addressed  in  Ref.  [15]. 


0  2r-l 

FIG.  3.  (Color  online)  Boundary  specifications  for  the  described 
QCA  on  a  planar  sheet.  The  2X2  cells  displayed  in  white  are  acted 
upon  by  r,  as  usual.  To  the  hatched  1X2  cells  a  swap  gate  is 
applied,  and  to  the  cross-hatched  cells  the  identity  operation.  The 
column  in  gray  underlay  represents  a  data  register  moving  right. 

(2)  That  the  described  QCA  lives  on  a  torus  simplifies  the 
discussion,  but  is  not  essential.  A  planar  sheet  of  size  n 
X  2r  is  sufficient  for  simulation  of  the  discussed  networks  of 
r  time  steps  and  n  qubits.  Figure  3  specifies  the  operations  on 
the  boundary  that  differ  from  r.  When  this  modified  QCA  is 
run,  in  the  bulk  the  data  registers  still  move  right  and  the 
program  registers  left.  On  the  left  and  right  boundary,  how¬ 
ever,  the  registers  are  reflected.  As  a  consequence,  on  the 
left-hand  side  of  the  lattice,  reflected  program  registers  are 
acted  upon  by  left-moving  program  registers  as  if  they  were 
data.  More  severely,  on  the  right-hand  side  of  the  cluster, 
reflected  data  registers  act  upon  right-moving  data  registers 
as  program.  Therefore,  the  state  of  the  QCA  is  no  longer  a 
tensor  product  of  the  column  states,  but  instead  an  entangled 
state  supported  by  many  columns  grows  from  the  right,  by 
one  column  in  each  time  step.  However,  none  of  that  has  an 
impact  on  the  data  register  |D0),  which — as  before — can  be 
read  out  from  column  r  after  r  transitions  of  the  automaton. 

III.  CONCLUSION 

This  paper  describes  a  quantum  cellular  automaton  for 
universal  quantum  computation.  The  transition  function  from 
one  time  step  to  the  next  is  generated  by  a  four-qubit  unitary 
transformation  acting  on  Margolus  cells  of  size  2X2.  The 
program  is  encoded  in  the  initial  state  of  the  system,  and  the 
automaton  is  left  to  its  autonomous  evolution  from  initializa¬ 
tion  to  readout. 

Coming  back  to  our  initially  posed  question,  it  is  found 
that  QCAs  performing  complex  tasks  in  quantum  informa¬ 
tion  processing  can  indeed  be  constructed  for  compact  cells 
and  neighborhood  schemes. 
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We  review  an  ongoing  program  of  interdisciplinary  research  aimed  at  developing 
hardware  and  protocols  for  quantum  communication  networks.  Our  primary  exper¬ 
imental  goals  are  to  demonstrate  quantum  state  mapping  from  storage/processing 
media  (internal  states  of  trapped  atoms)  to  transmission  media  (optical  photons), 
and  to  investigate  a  nanotechnology  paradigm  for  cavity  QED  that  would  involve 
the  integration  of  magnetic  microtraps  with  photonic  bandgap  structures. 


1  Introduction 

No  one  can  dispute  that  we  live  in  an  age  of  information;  living  in  such  an  age,  we  experi¬ 
ence  an  ever-increasing  need  for  technology  that  allows  us  to  share  information  in  a  secure, 
efficient,  and  reliable  way.  Quantum  technologies  show  great  potential  for  revolutionizing 
the  methods  by  which  we  collect  and  distribute  information,  in  diverse  engineering  contexts 
that  range  from  computer  architecture  to  the  Internet.  Seminal  theoretical  research  in  the 
field  has  identified  a  portfolio  of  tasks  for  which  quantum  methods  are  provably  superior  to 
their  classical  counterparts.  For  example,  it  has  recently  been  shown  that  the  communica¬ 
tion  complexity  of  a  fundamental  task  such  as  appointment-scheduling  can  be  drastically 
reduced  through  the  use  of  quantum  resources  1.  It  is  also  known  that  the  elementary 
protocol  of  quantum  state  teleportation  2  could  provide  a  means  for  distributing  crypto¬ 
graphic  key  with  absolute  and  verifiable  security  3 ,  and  that  related  methods  could  be  used 
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Figure  1.  Transmission  of  quantum  information  between  nodes  in  a  quantum  network.  A  qubit  is  initially 
stored  in  the  internal  state  of  an  atom  trapped  within  a  high-finesse  optical  cavity,  at  the  sending  node 
(each  node  contains  multiple  atoms).  1:  the  atomic  state  is  mapped  to  that  of  an  optical  photon  via  cavity 
QED,  2:  the  photon  travels  through  a  fiber  to  the  receiving  node,  3:  the  photonic  state  is  mapped  into  that 
of  a  trapped  atom  at  the  receiving  node.  Multiple  nodes  connected  in  this  fashion  form  a  quantum  network. 
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to  enhance  the  capacity  of  noisy  communication  channels  4. 

An  ongoing  program  of  research  at  Caltech  focuses  on  development  of  the  hardware 
and  error  correction  protocols  required  to  construct  a  quantum  network  (see  Fig.  1).  Here 
quantum  nodes  with  memory  and  local  processing  capabilities  will  be  connected  by  quantum 
communication  channels  that  allow  robust  transmission  of  coherent  quantum  information 
among  nodes  of  the  network.  Such  a  network  would  be  capable  of  performing  distributed 
quantum  computations  5.  In  our  proposed  implementation,  trapped  neutral  atoms  will 
provide  quantum  memory  at  each  node  of  the  network,  and  optical  cavities  will  be  utilized 
both  to  perform  quantum  gates  and  to  transfer  quantum  information  between  nodes  6. 
With  a  long-term  view  towards  developing  integrated  and  robust  hardware  for  quantum 
nodes,  much  of  our  work  is  aimed  at  exploring  a  revolutionary  new  technical  paradigm  for 
cavity  QED  based  on  magnetic  microtraps  and  optical  photonic  bandgap  structures  7 .  In 
the  following  sections,  we  first  provide  an  overview  of  this  new  technical  paradigm  and  then 
describe  the  quantum  repeater  protocol  that  will  enable  robust  quantum  communication  in 
the  presence  of  channel  decoherence. 

2  Nanotechnology  for  optical  cavity  QED  with  neutral  atoms 

During  the  past  decade,  advances  in  semiconductor  crystal  growth  technique  have  led  to 
the  rapid  development  of  optical  microcavity  devices.  With  the  ultrahigh  precision  that 
can  now  be  obtained  over  layer  thicknesses,  high  reflectivity  mirrors  can  easily  be  grown  by 
depositing  alternating  layers  of  high-  and  low-index  materials  to  define  high-Q  cavities  in  the 
vertical  dimension.  Recently,  it  has  also  become  possible  to  microfabricate  high  reflectivity 
mirrors  with  horizontal  orientation  by  etching  periodic  arrays  of  holes  into  a  semiconductor 
substrate.  These  “photonic  crystals”  can  be  designed  to  fully  suppress  horizontal  light 
propagation  within  certain  frequency  bands,  called  photonic  bandgaps  (see  Fig.  2)  8,9 . 

When  combined  with  high  index- contrast  slabs,  in  which  the  propagation  of  light  is 
already  confined  to  a  plane,  photonic  bandgap  mirrors  can  be  used  to  confine  light  within 
“nanocavities”  of  extremely  small  volume.  Such  cavities  can  be  produced  simply  by  in¬ 
troducing  defects  into  the  photonic  bandgap  structure,  for  example  by  omitting  one  hole 
from  the  periodic  array.  The  resonance  wavelength  of  such  a  cavity  can  be  lithographically 
defined  by  adjusting  the  precise  geometry  of  the  holes  immediately  surrounding  the  defect. 
In  previous  work  this  strategy  has  been  employed  to  construct  microcavity  semiconductor 
lasers  10  with  mode  volumes  as  small  as  2.5  (A/2nsiab)3. 

Using  the  same  fabrication  techniques,  it  is  also  possible  to  guide,  bend,  filter,  and  sort 
light  in  2-D  photonic  crystals.  Such  photonic  waveguides  could  clearly  be  used  as  optical 
channels  between  nanocavities,  and  could  also  be  used  to  define  simple  interferometers  to 
combine  cavity  inputs/outputs.  Tapered  photonic  waveguides  could  be  pigtailed  to  standard 
optical  fibers,  as  a  means  of  bringing  light  into  or  out  of  the  semiconductor  slab.  Using 
the  techniques  described  above,  we  can  easily  envision  fabricating  semiconductor  wafers 
in  which  multiple  interconnected  optical  cavities  are  embedded  at  an  areal  density  ~  106 
cm-2.  Such  arrays  of  PBG  nanocavities  could  replace  the  Fabry-Perot  cavities  that  have 
been  used  in  previous  experiments  on  optical  cavity  QED  with  neutral  atoms. 

The  basic  parameter  that  measures  coupling  strength  between  an  atom  and  an  optical 
cavity  is  the  so-called  “vacuum  Rabi  frequency”  g  =  where  d  is  the  atomic  dipole- 

transition  matrix  element  and  E\  is  the  electric  field  per  photon  in  the  cavity.  To  achieve 
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Figure  2.  Schematic  cross-section  of  the  photonic  crystal  structure. 


strong  coupling  we  must  have  g  sufficiently  large  compared  to  both  the  atomic  dipole  decay 
rate  7_l  and  the  cavity  field  decay  rate  k.  For  the  purpose  of  quantum  information  process¬ 
ing,  we  require  that  both  the  critical  photon  number  mo  and  the  critical  atom  number  Nq 
be  less  than  one,  where  mo  =  and  N0  =  2k1±  •  As  the  dipole  matrix  element  d  is  fixed 
by  atomic  structure,  the  only  way  to  increase  g/^±  is  to  construct  cavities  in  which  E\  is 
as  large  as  possible. 

We  have  performed  numerical  calculations  (via  finite- difference  time-domain  modeling 
12 )  of  the  mode  properties  for  several  possible  PBG  defect  cavities.  In  one  case  we  have 
obtained  Wff  =  0.09  (A/2)2  together  with  Q  ^  1.6  x  104.  This  particular  geometry  incor¬ 
porates  a  central  “defect”  hole  of  reduced  diameter  ^  108  nm.  To  couple  most  strongly 
with  the  cavity  field,  an  atom  should  be  trapped  at  the  center  of  this  hole.  While  further 
optimization  of  the  geometry  may  still  be  possible,  these  parameters  already  lead  to  g  ~  5.6 
GHz  with  k  =  11  GHz,  or  m0  ~  5.4  x  10-8  photons  and  No  ~  1.8  x  10-3  atoms. 

In  addition  to  the  creation  of  semiconductor  nanostructures,  modern  microfabrication 
techniques  also  allow  the  patterning  of  conducting  wires  and  ferromagnetic  materials  at  the 
1  —  10  /im  scale.  As  a  result,  it  should  be  possible  to  construct  magnetic  microtraps  with 
field  gradients  on  the  order  of  108  G/cm  and  field  curvatures  ^  108  G/cm2  14,15 .  y\qth  this 
magnitude  of  field  curvature,  e.g .,  an  Ioffe  configuration  trap  would  hold  a  Cs  atom  with 
g  ~  0.35  in  the  axial  coordinate  and  g  ~  0.035  in  the  radial  coordinates  14 ,  and  provide  a 
well-defined  bias  field  ^  1  G  at  the  trap  center. 

Our  initial  integration  scheme  will  be  to  deposit  micron-scale  wires  on  the  surface  of 
a  photonic  crystal  membrane,  such  that  each  PBG  cavity  is  surrounded  by  a  concentric 
microtrap.  Fig.  3  shows  one  possible  configuration.  Current-carrying  wires  arranged  in  a 
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Figure  3.  Schematic  of  an  integrated  PBG  cavity  and  planar  magnetic  microtrap,  (a)  Top  view  showing 
arrangement  of  the  current-carrying  wires.  Arrows  indicate  the  direction  of  current  flow,  (b)  Cross-section 
through  the  defect  cavity  (as  indicated  by  the  dashed  line  in  (a)),  showing  wire  deposition  on  the  surface 
of  the  semiconductor  slab. 


suitable  modification  of  Weinstein  and  Libbrecht’s  ‘Ioffe  c’  configuration  14  can  project  a 
magnetic  field  with  a  stable  minimum  at  the  geometric  center  of  the  PBG  cavity,  such  that 
one  or  more  atoms  could  be  confined  within  the  defect  hole  and  (as  discussed  above)  would 
therefore  experience  strong  coupling.  A  similar  geometry  can  be  envisioned  for  a  microtrap 
based  on  permanent  magnets,  rather  than  current-carrying  wires.  This  would  involve  the 
use  of  techniques  for  electroplating  ferromagnetic  material  (such  as  nickel  16 )  into  selected 
holes  around  the  perimeter  of  each  PBG  cavity. 


3  Quantum  repeater  architecture 

Any  quantum  communication  scheme  will  have  to  deal  with  losses  and  other  noise  (such  as 
phase  shifts)  in  the  quantum  channel  connecting  the  sending  and  receiving  nodes.  Unfor¬ 
tunately,  standard  quantum  error  correction  schemes  as  developed  for  quantum  computing 
will  not  be  practical  for  two  reasons.  First,  such  schemes  typically  work  only  if  the  error 
probability  is  sufficiently  small,  and  second,  they  would  require  a  large  overhead  in  qubits 
to  protect  a  single  qubit  against  one  error.  Here  we  review  an  error  correction  protocol 
that  solves  for  photon  absorption  errors  to  all  orders  (i.e.,  it  works  irrespective  of  the  error 
rate)  and  that  nevertheless  requires  only  a  moderate  overhead  in  resources.  Subsequently 
we  describe  a  quantum  repeater  scheme  that  allows  one  to  reinforce  the  quantum  signal  at 
intermediate  nodes,  thus  allowing  communication  over  long  distances. 
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Quantum  error  correction  schemes  require  entangled  states  of  at  least  5  qubits  to  protect 
one  qubit  against  a  single  general  error.  In  the  specific  physical  setup  described  in  the 
present  proposal,  however,  not  all  possible  errors  are  equally  likely  to  occur.  In  particular, 
whereas  photon  absorption  is  a  common  error,  the  inverse  process,  the  spontaneous  creation 
of  a  photon  with  a  particular  polarization  and  optical  frequency  is  extremely  unlikely.  This 
fact  allows  one  to  encode  the  quantum  information  in  such  a  way  that  the  photon  absorption 
error  can  be  corrected  to  all  orders  with  just  two  entangled  atoms  per  node.  Without  going 
into  the  technical  details  (see  17’18),  we  present  here  just  the  basic  ideas. 

1.  The  logical  |0)  and  |1)  are  encoded  in  the  absence  and  presence  of  a  photon  in  a 
particular  mode,  respectively,  so  that  the  error  |0)  i— ►  |1)  has  been  eliminated. 

2.  We  use  an  auxiliary  qubit  in  the  sending  node  that  contains  no  information  about  the 
actual  qubit  we  wish  to  communicate,  to  send  a  photon.  This  way,  if  that  photon  is 
absorbed,  no  information  will  be  lost.  This  part  of  the  protocol  is  repeated  until  we 
know  that  the  photon  arrived.  Only  then  do  we  involve  the  qubit  that  contains  the 
actual  information. 

3.  In  order  to  detect  photon  absorption,  we  do  not  have  to  monitor  the  environment.  The 

auxiliary  atom  in  the  sending  node  is  used  twice:  its  state  is  communicated  in  two 
transmissions  to  two  atoms  in  the  receiving  node.  Between  the  two  transmissions  a 
NOT  operation  is  applied  (i.e.,  we  interchange  |0)  |1))  to  the  sending  atom.  The 

two  atoms  receiving  the  information  should,  therefore,  be  in  different  states.  Photon 
absorption  can  thus  be  unambiguously  detected  by  measuring  whether  the  two  atoms 
are  in  the  state  |0)|0). 

4.  As  a  bonus,  the  application  of  the  NOT  operation  symmetrizes  and  thereby  corrects 
for  all  systematic  phase  errors. 

Two  atoms  per  node  are  sufficient  to  achieve  a  high  communication  fidelity  F  (defined  as 
the  overlap  between  the  actual  and  desired  final  state) .  If  there  is  need  for  further  correction 
of  the  remaining  phase  errors,  one  extra  atom  per  node  is  needed  18 . 

The  probability  of  photon  absorption  increases  exponentially  with  the  length  of  the  com¬ 
munication  channel.  Thus,  while  the  above  error  correction  scheme  in  principle  works  just 
as  well  for  large-distance  communication,  it  would  require  an  exponentially  large  number  of 
transmissions.  In  order  to  reduce  the  number  of  transmissions,  we  need,  just  as  for  classical 
communication,  intermediate  nodes — quantum  repeaters — where  the  signal  is  “amplified.” 
Of  course,  since  true  amplification  of  quantum  signals  is  not  possible,  the  procedure  used 
is  more  subtle  than  in  the  classical  case,  and  is  in  fact  a  variant  of  standard  entanglement 
purification  schemes  19 ,  modified  to  encompass  the  restrictions  on  the  amount  of  physical 
resources. 

Several  quantum  repeater  schemes  have  been  discussed  in  Ref.  20 .  The  most  efficient 
scheme  takes  the  form  of  a  concatenated  error  correction  code.  It  makes  use  of  two  simple 
sub-procedures. 

1.  Two  pairs  of  imperfect  EPR  pairs  of  entangled  qubits  can  be  used  to  distill  a  single 
pair  of  higher  fidelity  between  nodes  A  and  B. 

2.  Two  pairs  of  imperfect  entangled  states  connecting  nodes  A  with  B,  and  B  with  C, 
resp.,  can  be  used  to  create  a  single  EPR  pair  of  lower  fidelity  between  nodes  A  and  C. 
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When  one  has  more  than  one  intermediate  mode,  repeating  procedure  2  would  lead  to  a 
fidelity  that  decreases  exponentially  with  the  total  number  of  nodes.  In  order  to  achieve  a 
certain  minimum  final  fidelity,  one  has  to  periodically  purify  the  entangled  states  obtained 
thus  far  with  procedure  1.  The  whole  protocol  is  then  a  concatenated  procedure  in  that  the 
purification  has  to  be  repeated  at  different  levels:  starting  at  the  lowest  level  one  purifies 
all  EPR  pairs  between  adjacent  modes,  at  the  next  level  one  purifies  the  lower- fidelity  EPR 
pairs  between  more  distant  nodes  as  gotten  by  procedure  2,  and  so  on. 
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We  study  the  properties  of  quantum  stabilizer  codes  that  embed  a  finite-dimensional  protected  code  space  in 
an  infinite-dimensional  Hilbert  space.  The  stabilizer  group  of  such  a  code  is  associated  with  a  symplectically 
integral  lattice  in  the  phase  space  of  2 N  canonical  variables.  From  the  existence  of  symplectically  integral 
lattices  with  suitable  properties,  we  infer  a  lower  bound  on  the  quantum  capacity  of  the  Gaussian  quantum 
channel  that  matches  the  one-shot  coherent  information  optimized  over  Gaussian  input  states. 
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I.  INTRODUCTION 

A  central  problem  in  quantum  information  theory  is  to 
determine  the  quantum  capacity  of  a  noisy  quantum 
channel — the  maximum  rate  at  which  coherent  quantum  in¬ 
formation  may  be  transmitted  through  the  channel  and  recov¬ 
ered  with  arbitrarily  good  fidelity  [1,2].  A  general  solution  to 
the  corresponding  problem  for  classical  noisy  channels  was 
found  by  Shannon  in  the  pioneering  paper  that  launched  clas¬ 
sical  information  theory  [3,4].  With  the  development  of  the 
theory  of  quantum  error  correction  [5,6],  considerable 
progress  has  been  made  toward  characterizing  the  quantum 
channel  capacity  [7],  but  it  remains  less  well  understood  than 
the  classical  capacity. 

The  asymptotic  coherent  information  has  been  shown  to 
provide  an  upper  bound  on  the  capacity  [8,9]  and  a  matching 
lower  bound  has  been  conjectured,  but  not  proven  [10].  Un¬ 
fortunately,  the  coherent  information  is  not  subadditive  [11], 
so  that  its  asymptotic  value  is  not  easily  computed.  There¬ 
fore,  it  has  been  possible  to  verify  the  coherent  information 
conjecture  in  just  a  few  simple  cases  [12]. 

One  quantum  channel  of  considerable  intrinsic  interest  is 
the  Gaussian  quantum  channel,  which  might  also  be  simple 
enough  to  be  analytically  tractable,  thus  providing  a  fertile 
testing  ground  for  the  general  theory  of  quantum  capacities. 
A  simple  analytic  formula  for  the  capacity  of  the  Gaussian 
classical  channel  was  found  by  Shannon  [3,4].  The  Gaussian 
quantum  channel  was  studied  by  Holevo  and  Werner  [13], 
who  computed  the  one-shot  coherent  information  for  Gauss¬ 
ian  input  states,  and  derived  an  upper  bound  on  the  quantum 
capacity. 

Lower  bounds  on  the  quantum  capacity  of  the  Gaussian 
quantum  channel  were  established  by  Gottesman,  Kitaev, 
and  Preskill  [14].  They  developed  quantum  error-correcting 
codes  that  protect  a  finite-dimensional  subspace  of  an 
infinite-dimensional  Hilbert  space,  and  showed  that  these 
codes  may  be  used  to  transmit  high-fidelity  quantum  infor¬ 
mation  at  a  nonzero  asymptotic  rate.  In  this  paper,  we  con¬ 
tinue  the  study  of  the  Gaussian  quantum  channel  begun  in 
[14].  Our  main  result  is  that  the  coherent  information  com¬ 
puted  by  Holevo  and  Werner  is  in  fact  an  achievable  rate. 
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This  result  lends  nontrivial  support  to  the  coherent  informa¬ 
tion  conjecture. 

We  define  the  Gaussian  quantum  channel  and  review  the 
results  of  Holevo  and  Werner  [13]  in  Sec.  II.  In  Sec.  Ill,  we 
describe  the  stabilizer  codes  for  continuous  quantum  vari¬ 
ables  introduced  in  [14],  which  are  based  on  the  concept  of  a 
symplectically  integral  lattice  embedded  in  phase  space.  In 
Secs.  IV  and  V,  we  apply  these  codes  to  the  Gaussian  quan¬ 
tum  channel,  and  calculate  an  achievable  rate  arising  from 
lattices  that  realize  efficient  packings  of  spheres  in  high  di¬ 
mensions.  This  achievable  rate  matches  the  one-shot  coher¬ 
ent  information  Iq  of  the  channel  in  cases  where  2 !Q  is  an 
integer.  Rates  achieved  with  concatenated  coding  are  calcu¬ 
lated  in  Sec.  VI;  these  fall  short  of  the  coherent  information 
but  come  close.  In  Sec.  VII,  we  consider  the  Gaussian  clas¬ 
sical  channel,  and  again  find  that  concatenated  codes  achieve 
rates  close  to  the  capacity.  Section  VIII  contains  some  con¬ 
cluding  comments  about  the  quantum  capacity  of  the  Gauss¬ 
ian  quantum  channel. 

II.  THE  GAUSSIAN  QUANTUM  CHANNEL 

The  Gaussian  quantum  channel  is  a  natural  generalization 
of  the  Gaussian  classical  channel.  In  the  classical  case,  we 
consider  a  channel  such  that  the  input  v  and  the  output  y  are 
real  numbers.  The  channel  applies  a  displacement  to  the  in¬ 
put  by  distance  £, 


y=x+£,  (l) 

where  £  is  a  Gaussian  random  variable  with  mean  zero  and 
variance  a2;  the  probability  distribution  governing  £  is 


P(€)  = 


1 

yjlircr2 


(2) 


Similarly,  acting  on  a  quantum  system  described  by  ca¬ 
nonical  variables  q  and  p  that  satisfy  the  commutation  rela¬ 
tion  \q,p\  =  ih,  we  may  consider  a  quantum  channel  that 
applies  a  phase-space  displacement  described  by  the  unitary 
operator 


D(a)  =  exp(a<2^  +  a*a),  (3) 

where  a  is  a  complex  number,  [a,a^]  =  1,  and  q ,  p  may  be 
expressed  in  terms  of  a  and  as 
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q= 


(4) 


This  quantum  channel  is  Gaussian  if  a  is  a  complex  Gauss¬ 
ian  random  variable  with  mean  zero  and  variance  <x2.  In  that 
case,  the  channel  is  the  superoperator  (trace-preserving  com¬ 
pletely  positive  map)  £  that  acts  on  the  density  operator  p 
according  to 


p^£{p)  =  — —  [  d2ae  l“|2/o'2D(a)pZ)(a)t.  (5) 

77  CJ  J 

In  other  words,  the  position  q  and  momentum  p  are  displaced 
independently, 


+  P^P  +  %p,  (6) 

where  and  ijp  are  real  Gaussian  random  variables  with 
mean  zero  and  variance  a2  =  ficr2. 

To  define  the  capacity,  we  consider  a  channel’s  nih  exten¬ 
sion.  In  the  classical  case,  a  message  is  transmitted  consist¬ 
ing  of  the  n  real  variables 


and  the  probability  of  a  decoding  error  vanishes  in  the  limit 
P^cc.  The  capacity  of  the  channel  with  power  constraint  P 
is  the  supremum  of  all  achievable  rates. 

The  need  for  a  constraint  on  the  signal  power  to  define  the 
capacity  of  the  Gaussian  classical  channel  may  be  under¬ 
stood  on  dimensional  grounds.  The  classical  capacity  (in 
bits)  is  a  dimensionless  function  of  the  variance  a2,  but  cr2 
has  dimensions.  Another  quantity  with  the  dimensions  of  cr 2 
is  needed  to  construct  a  dimensionless  variable,  and  the 
power  P  fills  this  role. 

In  contrast,  no  power  constraint  is  needed  to  define  the 
quantum  capacity  of  the  quantum  channel.  Rather,  Planck’s 
constant  h  enables  us  to  define  a  dimensionless  variance 
cr2  =  cr2  lh ,  and  the  capacity  is  a  function  of  this  quantity.  In 
the  quantum  case,  a  code  consists  of  an  encoding  superop¬ 
erator  that  maps  an  m- dimensional  Hilbert  space  TLm  into  the 
infinite-dimensional  Hilbert  space  TL  ®N  of  N  canonical  quan¬ 
tum  systems,  and  a  decoding  superoperator  that  maps  TL  ®N 
back  to  TLm  .  We  say  that  the  rate  R  (in  qubits)  is  achievable 
if  there  is  a  sequence  of  codes  such  that 

1 

R=  lim  W~  lo§2  mp’  (12) 


x  =  (xl,x2,  . .  .  ,X„),  (7) 

and  the  channel  applies  the  displacement 

x^x  +  l  |=(fi,f2,  ...,&),  W 

where  the  ^’s  are  independent  Gaussian  random  variables, 
each  with  mean  zero  and  variance  a2.  A  code  consists  of  a 
finite  number  m  of  ^-component  input  signals 

x(a\  a  =  1,2,  .  .  .  ,m  (9) 


where  arbitrary  states  in  TLm  may  be  recovered  with  a  fidelity 
that  approaches  1  as  The  quantum  capacity  Cq  of  the 

channel  is  defined  as  the  supremum  of  all  achievable  rates. 

Holevo  and  Werner  [13]  studied  a  more  general  Gaussian 
channel  that  includes  damping  or  amplification  as  well  as 
displacement.  However,  we  will  confine  our  attention  in  this 
paper  to  channels  that  apply  only  displacements.  Holevo  and 
Werner  derived  a  general  upper  bound  on  the  quantum  ca¬ 
pacity  by  exploiting  the  properties  of  the  “diamond  norm” 
(norm  of  complete  boundedness)  of  a  superoperator.  The  dia¬ 
mond  norm  is  defined  as  follows:  First,  we  define  the  trace 
norm  of  an  operator  X  as 


and  a  decoding  function  that  maps  output  vectors  to  the  in¬ 
dex  set  {1,2, .  .  .  ,ra}.  We  refer  to  n  as  the  length  of  the  code. 

If  the  input  vectors  were  unrestricted,  then  for  fixed  <x2  we 
could  easily  construct  a  code  with  an  arbitrarily  large  number 
of  signals  m  and  a  decoding  function  that  correctly  identifies 
the  index  ( a )  of  the  input  with  an  arbitrarily  small  probability 
of  error;  even  for  n  =  1 ,  we  merely  choose  the  distance  be¬ 
tween  signals  to  be  large  compared  to  cr.  To  obtain  an  inter¬ 
esting  notion  of  capacity,  we  impose  a  constraint  on  the  av¬ 
erage  power  of  the  signal, 

-  2  (x^V^p,  (io) 

ft  i 


\\X\\t=tr^X,  (13) 

which  for  a  self-adjoint  operator  is  just  the  sum  of  the  abso¬ 
lute  values  of  the  eigenvalues.  Then  a  norm  of  a  superopera¬ 
tor  £  may  be  defined  as 


l|£|lso=SUP 


IjfPOjjlr 

Mtr 


(14) 


The  superoperator  norm  is  not  stable  with  respect  to  append¬ 
ing  an  ancillary  system  on  which  £  acts  trivially.  Thus,  we 
define  the  diamond  norm  of  £  as 

Iloilo  =  sup||£®/JS0,  (15) 


for  each  a.  We  say  that  a  rate  R  (in  bits)  is  achievable  with 
power  constraint  P  if  there  is  a  sequence  of  codes  satisfying 
the  constraint  such  that  the  (3th  code  in  the  sequence  contains 
m  p  signals  with  length  n  p ,  where 

1 

R=  lim  — log2mB ,  (11) 

P^oo  n/3 


where  In  denotes  the  ^-dimensional  identity  operator.  (This 
supremum  is  always  attained  for  some  n  no  larger  than  the 
dimension  of  the  Hilbert  space  on  which  £  acts.)  Holevo  and 
Werner  showed  that  the  quantum  capacity  obeys  the  upper 
bound 

Ce(£)«log2||£°r||0  ,  (16) 


062301-2 


ACHIEVABLE  RATES  FOR  THE  GAUSSIAN  QUANTUM  CHANNEL 


PHYSICAL  REVIEW  A  64  062301 


where  T  is  the  transpose  operation  defined  with  respect  to 
some  basis.  In  the  case  of  the  Gaussian  quantum  channel, 
they  evaluated  this  expression,  obtaining 


U(a,/3)  =  ex  p 


/  N 

i^\  2  <XiPi+Piqi 


(24) 


CQ{u2)^:\og1{hlu1)  (17) 

for  hlcr2>  1,  and  Cq(ct2)  =  0  for  h/a2^  1. 

Hole vo  and  Werner  [13]  also  computed  the  coherent  in¬ 
formation  of  the  Gaussian  quantum  channel  for  a  Gaussian 
input  state.  To  define  the  coherent  information  of  the  channel 
£  with  input  density  operator  p,  one  introduces  a  reference 
system  R  and  a  purification  of  p,  a  pure  state  |<l>)  such  that 

tr*(|<W>|)  =  p.  (18) 

Then  the  coherent  information  Iq  is 


Two  such  operators  obey  the  commutation  relation 

U(a,P)U(a'  ,p')  =  e2™u,(al3’a'l3')U{a'  ,/3')U(a,/3), 

(25) 

where 

(x)(a/3,af  (3f)  =  a-  f3'  —  a'  •  (3  (26) 

is  the  symplectic  form.  Thus,  Weyl  operators  commute  if  and 
only  if  their  symplectic  form  is  an  integer. 

The  2  N  generators  of  a  stabilizer  code  are  commuting 
Weyl  operators 


/e(£,p)=s(£(P))-s(f®/J!(|a>)<a>|))>  (19) 


U(a(a\/3(a)),  a  =  1,2, .  . .  ,2N.  (27) 


where  S  denotes  the  Von  Neumann  entropy, 

s(p)=  -tr(plog2p).  (20) 

It  is  conjectured  [10,8,9]  that  the  quantum  capacity  is  related 
to  the  coherent  information  by 


Cq{£)=  lim  —Cn(£),  (21) 

n—> oo  n 

where 

C  n{£)  =  sup  /  G(£®",p).  (22) 

p 

Unlike  the  mutual  information  that  defines  the  classical  ca¬ 
pacity,  the  coherent  information  is  not  subadditive  in  general, 
and  therefore,  the  quantum  capacity  need  not  coincide  with 
the  “one-shot”  capacity  C\.  Holevo  and  Werner  showed  that 
for  the  Gaussian  quantum  channel,  the  supremum  of  Iq  over 
Gaussian  input  states  is 


Thus,  the  elements  of  the  stabilizer  group  are  in  one-to-one 
correspondence  with  the  points  of  a  lattice  C  generated  by 
the  2N  vectors  v^  =  These  vectors  may  be  as¬ 

sembled  into  the  generator  matrix  M  of  C  given  by 


M  = 


u(1)  \ 

„(2) 


v(2N)j 


(28) 


Then  the  requirement  that  the  stabilizer  generators  commute, 
through  Eq.  (25),  becomes  the  condition  that  the  antisym¬ 
metric  matrix 


A  =  M  u>Mt  (29) 

has  integral  entries,  where  MT  denotes  the  transpose  of  M,  co 
is  the  2NX2N  matrix 


(lQ)max=lOg2(h/eO-2),  (23) 

(where  £  =  2.71828,  .  .  .  ,)  for  hlecr2>  1,  and  (/p)max=0  for 
h/ea2^  1.  According  to  the  coherent-information  conjec¬ 
ture,  Eq.  (23)  should  be  an  achievable  rate. 

III.  QUANTUM  ERROR-CORRECTING  CODES 
FOR  CONTINUOUS  QUANTUM  VARIABLES 

The  lattice  codes  developed  in  [14]  are  stabilizer  codes 
[15,16]  that  embed  a  finite-dimensional  code  space  in  the 
infinite-dimensional  Hilbert  space  of  N  “oscillators,”  a  sys¬ 
tem  described  by  2  N  canonical  variables 
qi,q 2 ,  •  •  •  qN >P l  >Pi *  ■  •  •  >Pn •  That  is,  the  code  space  is  the 
simultaneous  eigenstate  of  2 N  commuting  unitary  operators, 
the  generators  of  the  code’s  stabilizer  group.  Each  stabilizer 
generator  is  a  Weyl  operator ,  a  displacement  in  the 
2V-dimensional  phase  space. 

Such  displacements  may  be  parametrized  by  2 N  real 
numbers  ax,a2,  .  .  .  ,(32,  .  .  .  ,/3N,  and  expressed  as 


(x)  = 


0 

~In 


(30) 


and  IN  is  theVXV  identity  matrix.  If  the  generator  matrix  M 
of  a  lattice  C  has  the  property  that  A  is  an  integral  matrix, 
then  we  will  say  that  the  lattice  C  is  symplectically  integral. 

Encoded  operations  that  preserve  the  code  subspace  are 
associated  with  the  code’s  normalizer  group,  the  group  of 
phase-space  translations  that  commute  with  the  code  stabi¬ 
lizer.  The  generator  matrix  of  the  normalizer  is  a  matrix  M 1 
that  may  be  chosen  to  be 


II 

1 

(31) 

M±coMt=I ; 

(32) 

and 


(M-l)m(M-l)t=(A~1)t.  (33) 
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We  will  refer  to  the  lattice  generated  by  M1  as  the  symplectic 
dual  C1  of  the  lattice  C. 

Another  matrix  that  generates  the  same  lattice  as  M  (and 
therefore  defines  a  different  set  of  generators  for  the  same 
stabilizer  group)  is 


M'=RM ,  (34) 

where  R  is  an  integral  matrix  with  det R=±  1 .  This  replace¬ 
ment  changes  the  matrix  A  according  to 

A^RARt.  (35) 


By  Gaussian  elimination,  an  R  may  be  constructed  such  that 


and 


D 

0 


(A_1)r= 


0 

-D~l 


(36) 


(37) 


where  D  is  a  positive  diagonal  integral  NXN  matrix.  In  the 
important  special  case  of  a  symplectically  self-dual  lattice, 
both  A  and  (A~])T  are  integral  matrices;  therefore  D 
=  D~l  and  the  standard  form  of  A  is 


space  An  may  also  be  identified  as  the  moduli  space  of  prin¬ 
cipally  polarized  abelian  varieties  in  complex  dimension  N 
[17]. 

The  encoded  operations  that  preserve  the  code  space  but 
act  trivially  within  the  code  space  comprise  the  quotient 
group  C1/ C.  The  order  of  this  group,  the  ratio  of  the  volume 
of  the  unit  cell  of  C  to  that  of  CL,  is  m2,  where  m  is  the 
dimension  of  the  code  space.  The  volume  of  the  unit  cell  of 
C  is  |  det  M  \  =  |  det  A  | 1/2  and  the  volume  of  the  unit  cell  of  C 1 
is  |  det  M1 1  =  |  det  A  | ~~ 1/2;  therefore,  the  dimension  of  the  code 
space  is 

m  =  |  Pf  A  |  =  |det  M  |  =detZ),  (43) 

where  Pf  A  denotes  the  Pfaffian  of  A,  the  square  root  of  its 
determinant.  Thus,  a  symplectically  self-dual  lattice,  for 
which  |  det  M  |  =  IdetM1!  =  1,  corresponds  to  a  code  with  a 
one-dimensional  code  space.  Given  a  2NX  2N  generator  ma¬ 
trix  M  of  a  symplectically  self-dual  lattice,  we  can  rescale  it 
as 


M^^KM,  (44) 

where  X  is  an  integer,  to  obtain  the  generator  matrix  of  a 
symplectically  integral  lattice  corresponding  to  a  code  of  di¬ 
mension 


m  =  \N.  (45) 

The  rate  of  this  code,  then,  is 


A  = 


0 

~In 


(38) 


Hence,  the  generator  matrix  of  a  symplectically  self-dual  lat¬ 
tice  may  be  chosen  to  be  a  real  symplectic  matrix:  McoMT 
=  co. 

If  the  lattice  is  rotated,  then  the  generator  matrix  is  trans¬ 
formed  as 


M^MO,  (39) 

where  O  is  an  orthogonal  matrix.  Therefore,  it  is  convenient 
to  characterize  a  lattice  with  its  Gram  matrix 

G  =  MMt,  (40) 

which  is  symmetric,  positive,  and  rotationally  invariant.  In 
the  case  of  a  symplectically  self-dual  lattice,  the  Gram  matrix 
G  may  be  chosen  to  be  symplectic,  and  two  symplectic  Gram 
matrices  G  and  G'  describe  the  same  lattice  if 

G’=RGRt,  (41) 

where  R  is  symplectic  and  integral.  Therefore,  the  moduli 

space  of  symplectically  self-dual  lattices  in  2 N  dimensions 
may  be  represented  as 

AN=H(2N)/Sp(2N,Z),  (42) 

where  H(2N)  denotes  the  space  of  real  symplectic  positive 
2NX2N  matrices  of  determinant  1  and  SP(2N,Z)  denotes 
the  space  of  integral  symplectic  2NX2N  matrices.  The 


R  =  log2  X .  (46) 

When  an  encoded  state  is  subjected  to  the  Gaussian  quan¬ 
tum  channel,  a  phase-space  displacement 

(q,p)->(q,p)  +  (£qJP)  (47) 

is  applied.  To  diagnose  and  correct  this  error,  the  eigenvalues 
of  all  stabilizer  generators  are  measured,  which  determines 
the  value  of  modulo  the  normalizer  lattice  C1 .  To 

recover,  a  displacement  of  minimal  length  is  applied  that 
returns  the  stabilizer  eigenvalues  to  their  standard  values, 
and  so  restores  the  quantum  state  to  the  code  space.  We  may 
associate  with  the  origin  of  the  normalizer  lattice  its  Voronoi 
cell ,  the  set  of  points  in  R2N  that  are  closer  to  the  origin  than 
to  any  other  lattice  site.  Recovery  is  successful  if  the  applied 
displacement  lies  in  this  Voronoi  cell.  Thus,  we  may  estimate 
the  likelihood  of  a  decoding  error  by  calculating  the  prob¬ 
ability  that  the  displacement  lies  outside  the  Voronoi  cell. 

IV.  ACHIEVABLE  RATES  FROM  EFFICIENT 
SPHERE  PACKINGS 

One  way  to  establish  an  achievable  rate  for  the  Gaussian 
quantum  channel  is  to  choose  a  normalizer  lattice  C 1  whose 
shortest  nonzero  vector  is  sufficiently  large.  In  this  section, 
we  calculate  an  achievable  rate  by  demanding  that  the 
Voronoi  cell  surrounding  the  origin  contain  all  typical  dis¬ 
placements  of  the  origin  in  the  limit  of  large  N.  In  Sec.  V,  we 
will  use  a  more  clever  argument  to  improve  our  estimate  of 
the  rate. 

The  volume  of  a  sphere  with  unit  radius  in  n  dimensions 
is 
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V  = 


(48) 


and  from  the  Stirling  approximation  we  find  that 


V 


n 


2  7re  | 11,2 


(49) 


It  was  shown  by  Minkowski  [18,19],  that  lattice  sphere  pack¬ 
ings  exist  in  n  dimensions  that  fill  a  fraction  at  least  l/2("-1) 
of  space.  Correspondingly,  if  the  lattice  is  chosen  to  be  uni- 
modular,  so  that  its  unit  cell  has  unit  volume,  then  kissing 
spheres  centered  at  the  lattice  sites  may  be  chosen  to  have  a 
radius  rn  such  that 


or 


vn(rn)n^  2-(”-1>, 


(50) 


1 

4 


(: 2/Vn)2,n 3* 


n 

Sire  ' 


(51) 


FIG.  1.  Two  ways  to  estimate  the  rate  achieved  by  a  lattice  code. 
Each  site  of  the  normalizer  lattice  has  a  Voronoi  cell  (represented 
here  by  a  square)  containing  all  points  that  are  closer  to  that  site 
than  any  other  site.  Displacements  that  move  a  site  to  a  position 
within  its  Voronoi  cell  may  be  corrected.  The  volume  of  the  Voronoi 
cell  determines  the  rate  of  the  code.  In  (a),  the  ball  containing 
typical  displacements  lies  within  the  cell,  so  that  the  error  probabil¬ 
ity  is  small.  In  (b),  the  ball  of  typical  displacements  is  not  com¬ 
pletely  contained  within  the  cell,  but  the  region  where  neighboring 
balls  overlap  (shown  in  black)  has  a  small  volume,  so  that  the  error 
probability  is  still  small. 


This  lower  bound  on  the  efficiency  of  sphere  packings  has 
never  been  improved  in  the  nearly  100  years  since  Minkows¬ 
ki’s  result.  More  recently,  Buser  and  Sarnak  [17]  have  shown 
that  this  same  lower  bound  applies  to  lattices  that  are  sym- 
plectically  self  dual. 

Now  consider  the  case  of  n  =  2J/V-dimensional  phase 
space.  For  sufficiently  large  n ,  the  channel  will  apply  a 
phase-space  translation  by  a  distance  that  with  high  probabil¬ 
ity  will  be  less  than  V n(cr2  +  s ),  for  any  positive  e.  There¬ 
fore,  a  code  that  may  correct  a  shift  this  large  will  correct  all 
likely  errors.  What  rate  can  such  a  code  attain?  If  the  code  is 
a  lattice  stabilizer  code,  and  the  dimension  of  the  code  space 
is  m,  then  the  unit  cell  of  the  code’s  normalizer  lattice  has 
volume 


1 

A=  —  X  (27rh)N.  (52) 

m 

Nonoverlapping  spheres  centered  at  the  sites  of  the  normal¬ 
izer  lattice  may  be  chosen  to  have  radius  r=  \jn(cr2  +  s), 
where 


2  776? 


n! 2 


[rt(o-2  +  e)]"/23=  —  X2_"X(27j-ft)  ,  (53) 

m 


or 


m  = 


ft 


4  e(cr2  +  s)l 


(54) 


is  achievable,  provided  h/4ea2^l.  However,  as  noted  in 
Sec.  Ill,  the  rates  that  may  be  attained  by  this  construction 
(rescaling  of  a  symplectically  self-dual  lattice)  are  always  of 
the  form  log2  X,  where  X  is  an  integer. 

V.  IMPROVING  THE  RATE 

The  achievable  rate  found  in  Eq.  (55)  falls  two  qubits 
short  of  the  coherent  information  Eq.  (23).  We  will  now 
show  that  this  gap  may  be  closed  by  using  tighter  estimates 
of  the  error  probability.  We  established  Eq.  (55)  by  filling 
phase  space  with  nonoverlapping  spheres,  which  is  overly 
conservative.  It  is  acceptable  for  the  spheres  to  overlap,  as 
long  as  the  overlaps  occupy  an  asymptotically  negligible 
fraction  of  the  total  volume,  as  suggested  in  Fig.  1. 

Our  improved  estimate  applies  another  result  obtained  by 
Buser  and  Sarnak  [17].  They  note  that  the  moduli  space  of 
symplectically  self-dual  lattices  is  compact  and  equipped 
with  a  natural  invariant  measure.  Therefore,  it  makes  sense 
to  consider  averaging  over  all  lattices.  Denote  by  (•)  the 
average  over  all  symplectically  self-dual  lattices  with  speci¬ 
fied  dimension  n  =  2N ,  and  let  f(x)  denote  an  integrable 
rotationally  invariant  function  of  the  vector  v  (that  is  a  func¬ 
tion  of  the  length  \x\  of  x).  Then,  Buser  and  Sarnak  [17] 
show  that 

(  2  /(*)/=  f  f(x)dnx.  (56) 

\jceA{0}  j  J 


The  error  probability  becomes  arbitrarily  small  for  large  N  if 
Eq.  (54)  is  satisfied,  for  any  positive  s .  We  conclude  that  the 
rate 


(Note  that  the  sum  is  over  all  nonzero  vectors  in  the  lattice 
C .)  It  follows  that  there  must  exist  a  particular  symplecti¬ 
cally  self-dual  lattice  C  such  that 


1  / 

log2m  =  log2| 


4ecr 


,2  * 


(55) 


2  /U)=s  f{x)dnx.  (57) 

JCE  A{0}  J 
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The  statement  that  a  unimodular  lattice  exists  that  satisfies 
Eq.  (57)  is  the  well-known  Minkowski-Hlawka  theorem 
[19].  Buser  and  Sarnak  established  the  stronger  result  that 
the  lattice  may  be  chosen  to  be  symplectically  self  dual. 

We  may  use  this  result  to  bound  the  probability  of  a  de¬ 
coding  error,  and  establish  that  a  specified  rate  is  achievable. 
Our  argument  will  closely  follow  de  Buda  [20],  who  per¬ 
formed  a  similar  analysis  of  lattice  codes  for  the  Gaussian 
classical  channel.  However,  the  quantum  case  is  considerably 
easier  to  analyze,  because  we  can  avoid  complications  aris¬ 
ing  from  the  power  constraint  [21-23]. 

A  decoding  error  occurs  if  the  channel  displaces  the  origin 
to  a  point  outside  the  Voronoi  cell  centered  at  the  origin.  The 
Voronoi  cell  has  a  complicated  geometry,  so  that  the  error 
probability  is  not  easy  to  analyze.  But,  we  may  simplify  the 
analysis  with  a  trick  [20].  Imagine  drawing  a  sphere  with 
radius 


a=  ^n(cr2  +  s)  (58) 

around  each  lattice  site,  where  s  >  0 ;  this  value  of  a  is  cho¬ 
sen  so  that  the  typical  displacement  introduced  by  the  chan¬ 
nel  has  a  length  less  than  a\  the  probability  of  a  shift  larger 
than  a  thus  becomes  negligible  for  large  n.  It  may  be  that 
these  spheres  overlap.  However,  a  vector  that  is  contained  in 
the  sphere  centered  at  the  origin,  and  is  not  contained  in  the 
sphere  centered  at  any  other  lattice  site,  must  be  closer  to  the 
origin  than  any  other  lattice  site.  Therefore,  the  vector  is 
contained  in  the  origin’s  Voronoi  cell,  and  is  a  shift  that  may 
be  corrected  successfully.  (See  Fig.  1.) 

Hence  (ignoring  the  possibility  of  an  atypical  shift  by  £ 
>a)  we  can  upper  bound  the  probability  of  error  by  estimat¬ 
ing  the  probability  that  the  shift  moves  any  other  lattice  site 
into  the  sphere  of  radius  a  around  the  origin.  We  then  find 

^errors  2  f  P(x-r)dnr,  (59) 

jcgC1\{0} 

where  P{^)  denotes  the  probability  of  a  displacement  by  £. 

The  Buser-Sarnak  theorem  [17]  tells  us  that  there  exists  a 
lattice  whose  unit  cell  has  volume  A ,  and  which  is  related  by 
rescaling  to  a  symplectically  self-dual  lattice,  such  that 

terror 52  J  f  d"x  f  P(x~r)dnr\  (60) 

J  |r| sSa 

by  interchanging  the  order  of  integration,  we  find  that 

1 

p  ^—yan  (61) 

the  ratio  of  the  volume  of  the  ^-dimensional  sphere  of  radius 
a  to  the  volume  of  the  unit  cell. 

Now  the  volume  A  of  the  unit  cell  of  the  normalizer  lat¬ 
tice  C 1 ,  and  the  dimension  m  of  the  code  space,  are  related 
by 


A  =  (2irh)Nm  1  =  (2  irh  X  2  R)N ,  (62) 


where  R  is  the  rate,  and  we  may  estimate  the  volume  of  the 
sphere  as 


Vnan 


nil 

[«(cr2  +  s)]"/2, 


(63) 


where  n  =  2N.  Thus,  we  conclude  that 


e(cr2  +  s) 

h 


X  2/ 


(64) 


Therefore,  the  error  probability  becomes  small  for  large  N 
for  any  rate  R  such  that 


R  <  l°g2 


*  ) 

e(cr2  +  e) )  ’ 


(65) 


where  s  may  be  arbitrarily  small.  We  conclude  that  the  rate 

/  h  \ 

7?  =  l0§2\^1)  (66) 

is  achievable  in  the  limit  A/— provided  that  filecr2>  1. 
This  rate  matches  the  optimal  value  Eq.  (23)  of  the  one-shot 
coherent  information  for  Gaussian  inputs.  We  note,  again, 
that  the  rates  that  we  obtain  from  rescaling  a  symplectically 
self-dual  lattice  are  restricted  to  7?  =  log2\,  where  X  is  an 
integer.  Thus,  for  specified  a2,  the  achievable  rate  that  we 
have  established  is  really  the  maximal  value  of 

R  =  log2  X ,  XeZ,  (67) 

such  that  the  positive  integer  X  satisfies 

h 

X< — 2  .  (68) 

ecr 


VI.  ACHIEVABLE  RATES  FROM  CONCATENATED 
CODES 

Another  method  for  establishing  achievable  rates  over  the 
Gaussian  quantum  channel  was  described  in  [14],  based  on 
concatenated  coding.  In  each  of  N  “oscillators”  described  by 
canonical  variables  p{  and  qt ,  a  d-dimensional  system  (“qu- 
dit”)  is  encoded  that  is  protected  against  sufficiently  small 
shifts  in  pt  and  qt .  The  encoded  qudit  is  associated  with  a 
square  lattice  in  two-dimensional  phase  space.  Then,  a  stabi¬ 
lizer  code  is  constructed  that  embeds  a  &-qudit  code  space  in 
the  Hilbert  space  of  N  qudits;  these  k  encoded  qudits  are 
protected  if  a  sufficiently  small  fraction  of  the  N  qudits  are 
damaged.  Let  us  compare  the  rates  achieved  by  concatenated 
codes  to  the  rates  achieved  with  codes  derived  from  efficient 
sphere  packings. 

We  analyze  the  effectiveness  of  concatenated  codes  in  two 
stages.  First,  we  consider  how  likely  each  of  the  N  qudits  is 
to  sustain  damage  if  the  underlying  oscillator  is  subjected  to 
the  Gaussian  quantum  channel.  The  area  of  the  unit  cell  of 
the  two-dimensional  square  normalizer  lattice  that  represents 
the  encoded  operations  acting  on  the  qudit  is  lirhld,  and  the 
minimum  distance  between  lattice  sites  is  S=  V 2  irhld.  A 
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displacement  of  q  by  a  S ,  where  a  is  an  integer,  is  the  opera¬ 
tion  Xa  acting  on  the  code  space,  and  a  displacement  of  p  by 
b  8  is  the  operation  Zb ,  where  X  and  Z  are  the  Pauli  operators 
acting  on  the  qudit;  these  act  on  a  basis  {|  j),j 
=  0,1,2,  .  .  .  ,d—  1}  for  the  qudit  according  to 

x'-\j)^\j+l  (mode?)), 

Z:\j)^uJ\j),  (69) 


where  a>  =  exp(27 ri/d). 

Shifts  in  p  or  q  may  be  corrected  successfully  provided 
that  they  satisfy 


\Aq\<  S/2= 


\Ap\<  S/2= 


(70) 


for  large  N,  the  typical  number  of  damaged  qudits  is  close  to 
px N.  A  damaged  qudit  may  be  damaged  in  any  of  d—  1 
different  ways  [ Xa ,  where  a  =  1,2, .  .  .  ,(d  —  1)].  We  will  sup¬ 
pose,  pessimistically,  that  all  d—  1  shifts  of  the  qudit  are 
equally  likely.  The  actual  situation  that  arises  in  our  concat¬ 
enated  coding  scheme  is  more  favorable — small  values  of  |  a  \ 
are  more  likely — but  our  argument  will  not  exploit  this  fea¬ 
ture. 

Thus,  with  high  probability,  the  error  that  afflicts  the 
block  will  belong  to  a  typical  set  of  errors  that  contains  a 
number  of  elements  close  to 

Aftyp~ ( Np  )  ( d~ 1  )Npx~dmHd<-Px)+Px log^rf“1)],  (73) 


where 


If  the  shifts  in  q  and  p  are  Gaussian  random  variables  with 
variance  a2,  then  the  probability  that  a  shift  causes  an  un- 
correctable  error  is  no  larger  than  the  probability  that  the 
shift  exceeds  yfnhl2d,  or 


Px>Pz^ 


1  f00 

2  ,  |  dx e 

V27TCT2  J  pnKJTd 


—  X^UcT 


2 


=  erfc(  V  irhlAdcr2) , 


(71) 


where  erfc  denotes  the  complementary  error  function.  Here, 
px  is  the  probability  of  an  “X  error”  acting  on  the  qudit,  of 
the  form  Xa  for  a  #  0  (mod  d) ,  and  p z  denotes  the  probabil¬ 
ity  of  a  “Z  error”  of  the  form  Zb  for  b^O  (mod d).  The  X 
and  Z  errors  are  uncorrelated,  and  errors  with  a ,  b  =  ±  1  are 
much  more  likely  than  errors  with  |<z|,|&|>1.  By  choosing 
d~h!cr2,  we  may  achieve  a  small  error  probability  for  each 
oscillator. 

The  second  stage  of  the  argument  is  to  determine  the  rate 
that  may  be  achieved  by  a  qudit  code  if  px  ,p z  satisfy  Eq. 
(71).  We  will  consider  codes  of  the  Calderbank-Shor-Steane 
(CSS)  type,  for  which  the  correction  of  X  errors  and  Z  errors 
may  be  considered  separately  [24,25].  A  CSS  code  is  a  sta¬ 
bilizer  code,  in  which  each  stabilizer  generator  is  either  a 
tensor  product  of  /’s  and  powers  of  Z  (measuring  these  gen¬ 
erators  diagnoses  the  X  errors)  or  a  tensor  product  of  /’s  and 
powers  of  X  (for  diagnosing  the  Z  errors). 

We  can  establish  an  achievable  rate  by  averaging  the  error 
probability  over  CSS  codes;  we  give  only  an  informal  sketch 
of  the  argument.  Suppose  that  we  fix  the  block  size  N  and  the 
number  of  encoded  qudits  k.  Now  select  the  generators  of  the 
code’s  stabilizer  group  at  random.  About  half  of  the  N~k 
generators  are  of  the  Z  type  and  about  half  are  of  the  X  type. 
Thus,  the  number  of  possible  values  for  the  eigenvalues  of 
the  generators  of  each  type  is  about 

(72) 


Now,  we  can  analyze  the  probability  that  an  uncorrectable  X 
error  afflicts  the  encoded  quantum  state  (the  probability  of  an 
uncorrectable  Z  error  is  analyzed  in  exactly  the  same  way). 
Suppose  that  X  errors  act  independently  on  the  N  qudits  in 
the  block,  with  a  probability  of  error  per  qudit  of  px .  Thus, 


Hd(p)=-p  ^gdp-(l-p)logd(l-p).  (74) 

If  a  particular  typical  error  occurs,  then  recovery  will  suc¬ 
ceed  as  long  as  there  is  no  other  typical  error  that  generates 
the  same  error  syndrome.  It  will  be  highly  unlikely  that  an¬ 
other  typical  error  has  the  same  syndrome  as  the  actual  error, 
provided  that  the  number  of  possible  error  syndromes 
is  large  compared  to  the  number  of  typical  errors. 
Therefore,  the  X  errors  may  be  corrected  with  high  probabil¬ 
ity  for 

1  /  k\  1 

2  I  1  -  x  I  > ^  loSd  N typ ~Hd(px)  +  px  log d(d- 1 ) , 

(75) 


or  for  a  rate  Rd  in  qudits  satisfying 

Rd=  !<  1  -  2  Hd(px)  -  2  Px  log  M- 1 ).  (76) 

Similarly,  the  Z  errors  may  be  corrected  with  high  probability 
by  a  random  CSS  code  if  the  rate  satisfies 


Rd<l-2Hd(pz)-2pzlogd(d-l).  (77) 


Converted  to  qubits,  the  rate  becomes 

R  =  (\og2d)Rd.  (78) 

Under  these  conditions,  the  probability  of  error  averaged 
over  CSS  codes  becomes  arbitrarily  small  for  N  large.  It 
follows  that  there  is  a  particular  sequence  of  CSS  codes  with 
rate  approaching  Eqs.  (76) -(7 8),  and  error  probability  going 
to  zero  in  the  limit  N — » 00 . 

For  given  a2,  the  optimal  rate  that  may  be  attained  by 
concatenating  a  code  that  encodes  a  qudit  in  a  single  oscil¬ 
lator  with  a  random  CSS  code,  is  found  by  estimating  px  and 
pz  using  Eq.  (71)  and  then  choosing  d  to  maximize  the  rate 
R  given  by  Eqs.  (76)-(78).  The  results  are  shown  in  Fig.  2. 
This  rate  (in  qubits)  may  be  expressed  as 

R  =  log2(C2h/o-2),  (79) 


062301-7 


JIM  HARRINGTON  AND  JOHN  PRESKILL 


PHYSICAL  REVIEW  A  64  062301 


Rate 


PIG.  2.  Rates  achieved  by  concatenated  codes,  compared  to  the 
one-shot  coherent  information  optimized  over  Gaussian  input  states. 
Here,  a  is  the  standard  deviation  of  the  magnitude  of  the  phase- 
space  displacement  introduced  by  the  channel,  in  units  with  h  =  1 . 
The  rate  is  in  units  of  qubits  per  oscillator. 

where  C2  is  a  slowly  varying  function  of  cr2lh  plotted  in  Fig. 
3.  It  turns  out  that  this  rate  is  actually  fairly  close  to  log2  d\ 
that  is,  the  optimal  dimension  d  of  the  qudit  encoded  in  each 
oscillator  is  approximately  C2  filer2.  With  this  choice  for  d , 
the  error  rate  for  each  oscillator  is  reasonably  small,  and  the 
random  CSS  code  reduces  the  error  probability  for  the  en¬ 
coded  state  to  a  value  exponentially  small  in  N  at  a  modest 
cost  in  rate.  The  rate  achieved  by  concatenating  coding  lies 
strictly  below  the  coherent  information  Iq  ,  but  comes  within 
one  qubit  of  IQ  for  cr2>  1.88X  10-4. 

Both  the  concatenated  codes  and  the  codes  derived  from 
efficient  sphere  packings  are  stabilizer  codes,  and  therefore, 
both  are  associated  with  lattices  in  2J/V-dimensional  phase 
space.  But  while  the  sphere-packing  codes  have  been  chosen 
so  that  the  shortest  nonzero  vector  on  the  lattice  is  large 
relative  to  the  size  of  the  unit  cell,  the  concatenated  codes 
correspond  to  sphere  packings  of  poor  quality.  For  the  con¬ 
catenated  codes,  the  shortest  vector  of  the  normalizer  lattice 
has  length  where 


c2 


FIG.  3.  The  slowly  varying  function  C2,  defined  by  R 
=  log 2(C2/<t2),  where  R  is  the  rate  achievable  with  concatenated 
codes.  Units  have  been  chosen  such  that  h  =  1 .  The  horizontal  lines 
are  at  C2=  lie,  corresponding  to  a  rate  equal  to  the  coherent  infor¬ 
mation,  and  at  C2=  l/2e,  corresponding  to  one  qubit  below  the 
coherent  information. 


/2  =  2i rfild,  (80) 

and  the  rate  R  is  close  to  log2  d.  The  efficient  sphere  packings 
have  radius  r  =  //2  close  to  \jncr2,  or 

„  8  Nfi 

/2  = - X2~r.  (81) 

e 

Hence,  if  we  compare  sphere-packing  codes  and  concat¬ 
enated  codes  with  comparable  rates,  the  sphere-packing 
codes  have  minimum  distance  that  is  larger  by  a  factor  of 
about  ^ANIire.  The  concatenated  codes  achieve  a  high  rate 
not  because  the  minimum  distance  of  the  lattice  is  large,  but 
rather  because  the  decoding  procedure  exploits  the  hierarchi¬ 
cal  structure  of  the  code. 

VII.  THE  CLASSICAL  GAUSSIAN  CHANNEL 

We  have  found  that  quantum  stabilizer  codes  based  on 
efficient  sphere  packings  can  achieve  rates  for  the  Gaussian 
quantum  channel  that  match  the  one-shot  coherent  informa¬ 
tion,  and  that  concatenated  codes  achieve  rates  that  are  be¬ 
low,  but  close  to,  the  coherent  information.  Now,  as  an  aside, 
we  will  discuss  the  corresponding  statements  for  the  classical 
Gaussian  channel.  We  will  see,  in  particular,  that  concat¬ 
enated  codes  achieve  rates  that  are  close  to  the  classical 
channel  capacity. 

Shannon’s  expression  for  the  capacity  of  the  classical 
Gaussian  channel  may  be  understood  heuristically  as  follows 
[3,4].  If  the  input  signals  have  average  power  P,  which  is 
inflated  by  the  Gaussian  noise  to  P+  a2,  then  if  n  real  vari¬ 
ables  are  transmitted,  the  total  volume  occupied  by  the  space 
of  output  signals  is  the  volume  of  a  sphere  of  radius 
\jn(P+  cr2),  or 


Vtol=Vn[n(P+(T2)r12.  (82) 


We  will  decode  a  received  message  as  the  signal  state  that  is 
the  minimal  distance  away.  Consider  averaging  over  all 
codes  that  satisfy  the  power  constraint  and  have  m  signals. 
When  a  message  is  received,  the  signal  that  was  sent  will 
typically  occupy  a  decoding  sphere  of  radius  \jn(cr2  +  s) 
centered  at  the  received  message,  which  has  volume 

^decoding  sphere  =  V„[fl((T2  +  e)]"/2.  (83) 


A  decoding  error  may  arise  if  another  one  of  the  m  signals, 
aside  from  the  one  that  was  sent,  is  also  contained  in  the 
decoding  sphere.  The  probability  that  a  randomly  selected 
signal  inside  the  sphere  of  radius  ^n(P+  cr2)  is  contained  in 
a  particular  decoding  sphere  of  radius  \jn(cr2+  s)  is  the  ratio 
of  the  volume  of  the  spheres,  so  the  probability  of  a  decoding 
error  may  be  upper  bounded  by  m  times  that  ratio,  or 


P 


error 


<m 


cr2  +  s\n/2 
o-2  +  JP/ 


22R 


cr2  +  e  ^ 

^+P I 


nil 


(84) 


where  R  is  the  rate  of  the  code.  If  the  probability  of  error 
averaged  over  codes  and  signals  satisfies  this  bound,  there  is 
a  particular  code  that  satisfies  the  bound  when  we  average 


062301-8 


ACHIEVABLE  RATES  FOR  THE  GAUSSIAN  QUANTUM  CHANNEL 


PHYSICAL  REVIEW  A  64  062301 


only  over  signals.  If  Peiror<  S  when  we  average  over  signals, 
then  we  can  discard  at  most  half  of  all  the  signals  (reducing 
the  rate  by  at  most  1  In  bits)  to  obtain  a  new  code  with 
Penor<2S  for  all  signals.  Since  s  may  be  chosen  arbitrarily 
small  for  sufficiently  large  n ,  we  conclude  that  there  exist 
codes  with  arbitrarily  small  probability  of  error  and  rate  R 
arbitrarily  close  to 

c=\  log2(l+^),  (85) 

which  is  the  Shannon  capacity.  Conversely,  for  any  rate  ex¬ 
ceeding  C,  the  decoding  spheres  inevitably  have  nonnegli- 
gible  overlaps,  and  the  error  rate  cannot  be  arbitrarily  small. 

Suppose  that,  instead  of  Shannon’s  random  coding,  we 
use  a  lattice  code  based  on  an  efficient  packing  of  spheres.  In 
this  case,  the  power  constraint  may  be  imposed  by  including 
as  signals  all  lattice  sites  that  are  contained  in  an 
^-dimensional  ball  of  radius  V nP ,  and  the  typical  shifts  by 
distance  V na 2  must  be  correctable.  Thus,  decoding  spheres 
of  radius  V ncr 2  are  to  be  packed  into  a  sphere  of  total  radius 
\jn(P  +  cr2).  Suppose  that  the  lattice  is  chosen  so  that  non¬ 
overlapping  spheres  centered  at  the  lattice  sites  fill  a  fraction 
at  least  2_(n_1)  of  the  total  volume;  the  existence  of  such  a 
lattice  is  established  by  Minkowski’s  estimate  [18,19].  Then 
the  number  m  of  signals  satisfies 

mVn(ri(T2)nl2^2-in-l)Vn[n(P  +  a2)]"'2,  (86) 

or 

p  \  n/2 

l  +  ^l)  ,  (87) 

corresponding  to  the  rate 

1  1  I  P\ 

R=  — log2w=  2  l°g2|  1 +  ^2 )  —  1’  (88) 

which  is  one  bit  less  than  the  Shannon  capacity. 

Much  as  in  the  discussion  of  quantum  lattice  codes  in  Sec. 
V,  an  improved  estimate  of  the  achievable  rate  is  obtained  if 
we  allow  the  decoding  spheres  to  overlap  [20-23].  In  fact, 
there  are  classical  lattice  codes  with  rate  arbitrarily  close  to 
the  capacity,  such  that  the  probability  of  error,  averaged  over 
signals,  is  arbitrarily  small  [23].  Unfortunately,  though,  be¬ 
cause  of  the  power  constraint,  the  error  probability  depends 
on  which  signal  is  sent,  and  the  trick  of  deleting  the  worst 
half  of  the  signals  would  destroy  the  structure  of  the  lattice. 
Alternatively,  it  may  be  shown  that  for  any  rate 

R<-\og2{P!a2),  (89) 


Rate 


FIG.  4.  Rates  for  the  Gaussian  classical  channel  achievable  with 
concatenated  codes,  compared  to  the  Shannon  capacity.  Here,  a  is 
the  standard  deviation  of  the  displacement,  in  units  with  the  power 
P  =  1 .  The  rate  is  in  units  of  bits  per  signal. 

each  real  variable  takes  one  of  d  possible  values,  with  spac¬ 
ing  2 Ax  between  the  signals,  then  a  shift  by  Ax  may  be 
corrected.  By  replacing  the  sum  over  d  values  by  an  integral, 
which  may  be  justified  for  large  d ,  we  find  an  average  power 
per  signal 

1  [d\x  1 

thus,  the  largest  correctable  shift  may  be  expressed  in  terms 
of  the  average  power  as 

\x=yl3P/d.  (91) 

For  the  Gaussian  channel  with  mean  zero  and  variance  u2, 
the  probability  p  of  an  error  in  each  real  variable  transmitted 
is  no  larger  than  the  probability  of  a  shift  by  a  distance  ex¬ 
ceeding  Ax,  or 

p^erfc(^3P/2d2cr2),  (92) 

where  erfc  denotes  the  complementary  error  function. 

We  reduce  the  error  probability  further  by  encoding  k 
<n  dits  in  the  block  of  n  dits.  Arguing  as  in  Sec.  VI,  we  see 
that  a  random  code  for  dits  achieves  an  asymptotic  rate  in 
bits  given  by 

R  =  ( log2  d) [  1  -  Hd(p )  - p  log d(d -  1 ) ] .  (93) 

Given  a2,  using  the  expression  Eq.  (92)  for  p,  and  choosing 
d  to  optimize  the  rate  in  Eq.  (93),  we  obtain  a  rate  close  to 
the  Shannon  capacity,  as  shown  in  Fig.  4.  As  for  the  concat¬ 
enated  quantum  code,  the  rate  of  the  concatenated  classical 
code  is  close  to  log2 d,  where  d~C(cr2)^P/cr2,  and  C(cr2) 
is  a  slowly  varying  function. 


there  are  lattice  codes  with  maximal  probability  of  error  that 
is  arbitrarily  small  [20].  This  achievable  rate  approaches  the 
capacity  for  large  Pier2. 

Now  consider  the  rates  that  may  be  achieved  for  the 
Gaussian  classical  channel  with  concatenated  coding.  A 
d-state  system  (dit)  is  encoded  in  each  of  n  real  variables.  If 


VIII.  CONCLUSIONS 

We  have  described  quantum  stabilizer  codes,  based  on 
symplectically  integral  lattices  in  phase  space,  that  protect 
quantum  information  carried  by  systems  described  by  con¬ 
tinuous  quantum  variables.  With  these  codes,  we  may  estab- 
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lish  lower  bounds  on  the  capacities  of  continuous-variable 
quantum  channels. 

For  the  Gaussian  quantum  channel,  the  best  rate  we  know 
how  to  achieve  with  stabilizer  coding  matches  the  one-shot 
coherent  information  optimized  over  Gaussian  inputs,  at 
least  when  the  value  of  the  coherent  information  is  log2  of  an 
integer.  That  our  achievable  rate  matches  the  coherent  infor¬ 
mation  only  for  isolated  values  of  the  noise  variance  a2 
seems  to  be  an  artifact  of  our  method  of  analysis,  rather  than 
indicative  of  any  intrinsic  property  of  the  channel.  Hence,  it 
is  tempting  to  speculate  that  this  optimal  one-shot  coherent 
information  actually  is  the  quantum  capacity  of  the  channel. 

Conceivably,  better  rates  may  be  achieved  with  nonaddi¬ 
tive  quantum  codes  that  cannot  be  described  in  terms  of  sym- 
plectically  integral  lattices.  We  do  not  know  much  about  how 
to  construct  these  codes,  or  about  their  properties. 

In  the  case  of  the  depolarizing  channel  acting  on  qubits, 
Shor  and  Smolin  discovered  that  rates  exceeding  the  one- 
shot  coherent  information  could  be  achieved.  Their  construc¬ 
tion  used  concatenated  codes,  where  the  “outer  code”  is  a 
random  stabilizer  code,  and  the  “inner  code”  is  a  degenerate 


code  with  a  small  block  size  [11].  The  analogous  procedure 
for  the  Gaussian  channel  would  be  to  concatenate  an  outer 
code  based  on  a  symplectically  integral  lattice  with  an  inner 
code  that  encodes  one  logical  oscillator  in  a  block  of  several 
oscillators.  This  inner  code,  then,  embeds  an  infinite¬ 
dimensional  code  space  in  a  larger  infinite-dimensional 
space,  as  do  codes  constructed  by  Braunstein  [26]  and  Lloyd 
and  Slotine  [27].  However,  we  have  not  been  able  to  find 
concatenated  codes  of  this  type  that  achieve  rates  exceeding 
the  one-shot  coherent  information  of  the  Gaussian  channel. 
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Quantum  error-correcting  codes  are  constructed  that  embed  a  finite-dimensional  code  space  in  the  infinite¬ 
dimensional  Hilbert  space  of  a  system  described  by  continuous  quantum  variables.  These  codes  exploit  the 
noncommutative  geometry  of  phase  space  to  protect  against  errors  that  shift  the  values  of  the  canonical 
variables  q  and  p.  In  the  setting  of  quantum  optics,  fault-tolerant  universal  quantum  computation  can  be 
executed  on  the  protected  code  subspace  using  linear  optical  operations,  squeezing,  homodyne  detection,  and 
photon  counting;  however,  nonlinear  mode  coupling  is  required  for  the  preparation  of  the  encoded  states. 
Finite-dimensional  versions  of  these  codes  can  be  constructed  that  protect  encoded  quantum  information 
against  shifts  in  the  amplitude  or  phase  of  a  d-state  system.  Continuous-variable  codes  can  be  invoked  to 
establish  lower  bounds  on  the  quantum  capacity  of  Gaussian  quantum  channels. 
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I.  INTRODUCTION 

Classical  information  can  be  carried  by  either  a  discrete 
(digital)  signal  or  a  continuous  (analog)  signal.  Although  in 
principle  an  analog  signal  can  be  processed,  digital  comput¬ 
ing  is  far  more  robust — a  digital  signal  can  be  readily  restan¬ 
dardized  and  protected  from  damage  caused  by  the  gradual 
accumulation  of  small  errors. 

Quantum  information  can  also  be  carried  by  either  a  dis¬ 
crete  (finite-dimensional)  system,  such  as  a  two-level  atom 
or  an  electron  spin,  or  by  a  continuous  (infinite-dimensional) 
system,  such  as  to  a  harmonic  oscillator  or  a  rotor.  Even  in 
the  finite-dimensional  case,  quantum  information  is  in  a  cer¬ 
tain  sense  continuous  —  a  state  is  a  vector  in  a  Hilbert  space 
that  can  point  in  any  direction.  Nevertheless,  we  have  known 
for  nearly  five  years  that  cleverly  encoded  quantum  states 
can  be  restandardized  and  protected  from  the  gradual  accu¬ 
mulation  of  small  errors,  or  from  the  destructive  effects  of 
decoherence  due  to  uncontrolled  interactions  with  the  envi¬ 
ronment  [1,2]. 

One  is  tempted  to  wonder  whether  we  can  go  still  further 
and  protect  the  quantum  state  of  a  system  described  by  con¬ 
tinuous  quantum  variables.  Probably  this  is  too  much  to 
hope  for,  since  even  the  problem  of  protecting  analog  clas¬ 
sical  information  seems  to  pose  insuperable  difficulties. 

In  this  paper  we  achieve  a  more  modest  goal:  we  describe 
quantum  error-correcting  codes  that  protect  a  state  of  a  finite¬ 
dimensional  quantum  system  (or  “qudit”)  that  is  encoded  in 
an  infinite-dimensional  system.  These  codes  may  be  useful 
for  implementing  quantum  computation  and  quantum  com¬ 
munication  protocols  that  use  harmonic  oscillators  or  rotors 
that  are  experimentally  accessible. 

We  also  explain  how  encoded  quantum  states  can  be  pro¬ 
cessed  fault  tolerantly.  Once  encoded  states  have  been  pre¬ 
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pared,  a  universal  set  of  fault-tolerant  quantum  gates  can  be 
implemented  using,  in  the  language  of  quantum  optics,  linear 
optical  operations,  squeezing,  homodyne  detection,  and  pho¬ 
ton  counting.  However,  for  preparation  of  the  encoded  states, 
nonlinear  couplings  must  be  invoked. 

Our  continuous-variable  quantum  error-correcting  codes 
are  effective  in  protecting  against  sufficiently  weak  diffusive 
phenomena  that  cause  the  position  and  momentum  of  an  os¬ 
cillator  to  drift,  or  against  losses  that  cause  the  amplitude  of 
an  oscillator  to  decay.  By  concatenating  with  conventional 
finite-dimensional  quantum  codes,  we  can  also  provide  pro¬ 
tection  against  errors  that  heavily  damage  a  (sufficiently 
small)  subset  of  all  the  oscillators  in  a  code  block.  A  differ¬ 
ent  scheme  for  realizing  robust  and  efficient  quantum  com¬ 
putation  based  on  linear  optics  has  been  recently  proposed  by 
Knill,  Laflamme,  and  Milburn  [3,4]. 

We  begin  in  Sec.  II  by  describing  codes  that  embed  an 
ft-state  quantum  system  in  a  larger  d-state  system,  and  that 
protect  the  encoded  quantum  information  against  shifts  in  the 
amplitude  or  phase  of  the  d- state  system.  A  realization  of  this 
coding  scheme  based  on  a  charged  particle  in  a  magnetic 
field  is  discussed  in  Sec.  III.  Our  continuous-variable  codes 
are  obtained  in  Sec.  IV  by  considering  a  limit.  For¬ 

mally,  the  code  states  of  the  continuous-variable  codes  are 
nonnormalizable  states,  infinitely  squeezed  in  both  position 
and  momentum;  in  Sec.  V  we  describe  the  consequences  of 
using  more  realistic  approximate  code  states  that  are  finitely 
squeezed.  In  Sec.  VI  we  outline  the  theory  of  more  general 
continuous-variable  codes  based  on  lattice  sphere  packings 
in  higher  dimensional  phase  space. 

We  discuss  in  Sec.  VII  how  continuous -variable  codes 
protect  against  quantum  diffusion,  amplitude  damping,  and 
unitary  errors.  In  Sec.  VIII  we  establish  a  lower  bound  on  the 
quantum  capacity  of  the  Gaussian  quantum  channel. 

We  then  proceed  to  develop  schemes  for  fault- tolerant 
manipulation  of  encoded  quantum  information,  starting  in 
Sec.  IX  with  a  discussion  of  the  symplectic  operations  that 
can  3be  implemented  with  linear  optics  and  squeezing.  In 
Sec.  X  we  discuss  the  measurement  of  the  error  syndrome 


1 050-2947/200 1/64( 1  )/0 1231 0(2 1  )/$20.00 


64  012310-1 


©2001  The  American  Physical  Society 


DANIEL  GOTTESMAN,  ALEXEI  KITAEV,  AND  JOHN  PRESKILL 


PHYSICAL  REVIEW  A  64  012310 


and  error  recovery,  which  can  be  achieved  with  symplectic 
operations  and  homodyne  detection.  Completion  of  the  fault- 
tolerant  universal  gate  set  by  means  of  photon  counting  is 
described  in  Sec.  XI,  and  the  preparation  of  encoded  states  is 
explained  in  Sec.  XII.  Finally,  Sec.  XIII  contains  some  fur¬ 
ther  remarks  about  the  physical  realization  of  our  coding 
schemes,  and  Sec.  XIV  contains  concluding  comments. 

II.  SHIFT-RESISTANT  QUANTUM  CODES 

An  unusual  feature  of  our  codes  is  that  they  are  designed 
to  protect  against  a  different  type  of  error  than  has  been 
considered  in  previous  discussions  of  quantum  coding.  This 
distinction  is  more  easily  explained  if  we  first  consider  not 
the  case  of  a  continuous  quantum  variable,  but  instead  the 
(also  interesting)  case  of  a  “qudit,”  a  d-dimensional  quan¬ 
tum  system.  Quantum  codes  can  be  constructed  that  encode  k 
protected  qudits  in  a  block  of  N  qudits,  so  that  the  encoded 
qudits  can  be  perfectly  recovered  if  up  to  t  qudits  are  dam¬ 
aged,  irrespective  of  the  nature  of  the  damage  [5-8].  Error 
recovery  will  be  effective  if  errors  that  act  on  many  qudits  at 
once  are  rare.  More  precisely,  a  general  error  superoperator 
acting  on  N  qudits  can  be  expanded  in  terms  of  a  basis  of 
operators,  each  of  definite  “weight”  (the  number  of  qudits 
on  which  the  operator  acts  nontrivially).  Encoded  informa¬ 
tion  is  well  protected  if  the  error  superoperator  has  nearly  all 
its  support  on  operators  of  weight  t  or  less. 

But  consider  instead  a  different  situation,  in  which  the 
amplitude  for  an  error  to  occur  on  each  qudit  is  not  small,  but 
the  errors  are  of  a  restricted  type.  The  possible  errors  acting 
on  a  single  qudit  can  be  expanded  in  terms  of  a  unitary 
operator  basis  with  d 2  elements,  the  “Pauli  operators:” 

XaZb ,  a, 6  =  0, 1,2,  .  .  .  ,d-l.  (1) 

Here  X  and  Z  are  generalizations  of  the  Pauli  matrices  crx  and 
crz ,  which  act  in  a  particular  basis  {\j)j  =  0,1,2,  .  .  .  ,J—  1} 
according  to 

x-\. /)H./'+1  (mod  d)), 

Z:\j)^wJ\j),  (2) 

where  a)=  exp(27rc7d).  Note  that  it  follows  that 

ZX=wXZ.  (3) 

For  N  qudits,  there  is  a  unitary  operator  basis  with  d2N  ele¬ 
ments  consisting  of  all  tensor  products  of  single-qudit  Pauli 
operators. 

We  will  now  imagine  that  errors  with  |a|,|6|  small  com¬ 
pared  to  d  are  common,  but  errors  with  large  | a |  and  \b\  are 
rare.  This  type  of  error  model  could  be  expected  to  apply  in 
the  case  of  a  continuous  quantum  variable,  which  is  formally 
the  d^  oo  limit  of  a  qudit.  For  example,  decoherence  causes 
the  position  q  and  momentum  p  of  a  particle  to  diffuse  with 
some  nonzero  diffusion  constant.  In  any  finite  time  interval  q 
and  p  will  drift  by  some  amount  that  may  be  small,  but  is 
certainly  not  zero.  How  can  we  protect  encoded  quantum 
information  under  these  conditions? 


Fortunately,  the  general  “stabilizer”  framework  [9,10] 
for  constructing  quantum  codes  can  be  adapted  to  this  set¬ 
ting.  In  this  framework,  one  divides  the  elements  of  a  unitary 
operator  basis  into  two  disjoint  and  exhaustive  classes:  the 
set  £  of  “likely  errors”  that  we  want  to  protect  against,  and 
the  rest;  the  “unlikely  errors.”  A  code  subspace  is  con¬ 
structed  as  the  simultaneous  eigenspace  of  a  set  of  commut¬ 
ing  “stabilizer  generators,”  that  generate  an  Abelian  group, 
the  “code  stabilizer.”  The  code  can  reverse  errors  in  the  set 
£  if,  for  each  pair  of  errors  Ea  and  Eb  ,  either  ElEb  lies  in  the 
stabilizer  group,  or  ElEb  fails  to  commute  with  some  ele¬ 
ment  of  the  stabilizer.  (In  the  latter  case,  the  two  errors  alter 
the  eigenvalues  of  the  generators  in  distinguishable  ways;  in 
the  former  case  they  do  not,  but  we  can  successfully  recover 
from  an  error  of  type  a  by  applying  either  E\  or  E\.)  In 
typical  discussions  of  quantum  coding,  £  is  assumed  to  be 
the  set  of  all  tensor  products  of  Pauli  operators  with  weight 
up  to  t  (those  that  act  trivially  on  all  but  at  most  t  qudits).  But 
the  same  principles  can  be  invoked  to  design  codes  that  pro¬ 
tect  against  errors  in  a  set  £  with  other  properties. 

Quantum  codes  for  continuous  variables  have  been  de¬ 
scribed  previously  by  Braunstein  [11]  and  by  Lloyd  and  Slo- 
tine  [12].  For  example,  one  code  they  constructed  can  be 
regarded  as  the  continuous  limit  of  a  qudit  code  of  the  type 
originally  introduced  by  Shor  in  the  binary  ( d=2 )  case,  an 
[[N=9,k=  1,2?+  1  =  3]]  code  that  protects  a  single  qudit 
encoded  in  a  block  of  9  from  arbitrary  damage  inflicted  on 
any  one  of  the  9.  The  8  stabilizer  generators  of  the  code  can 
be  expressed  as 

Z,Z2  1  ,z2z3“ 1  ^4z5-  1  ,z5z6- 1  ,z7z8- 1  ,z8z9- 1 , 

(X1Z2X3)  •  (X4X5X6)  - 1 ,  (X4X5X6)  ■  (X-jX^Xg)  - 1 ,  (4) 

and  encoded  operations  that  commute  with  the  stabilizer  and 
hence  act  on  the  encoded  qudit  can  be  chosen  to  be 

Z  =  Z1Z4Z7 , 

X=X1X2X3.  (5) 

In  the  oo  limit,  we  obtain  a  code  that  is  the  simultaneous 
eigenspace  of  eight  commuting  operators  acting  on  nine  par¬ 
ticles,  which  are 

<15 ,4s-  <16  ,qi~  4s  ,q%~  49  > 
(Pl+P2  +  P3)-(/>4  +  P5+/>6)> 

(Pa  +  P5  +  P(,)~(Pi  +  P%  +  P9)-  (6) 

Logical  operators  that  act  in  the  code  space  are 

q  =  ql  +  q4  +  q1, 

P  =  Pl+P2+P3-  0) 

This  code  is  designed  to  protect  against  errors  in  which  one 
of  the  particles  makes  a  large  jump  in  q  or  p  (or  both),  while 
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the  others  hold  still.  But  it  provides  little  protection  against 
small  diffusive  motions  of  all  the  particles,  which  allow  q 
and  p  to  drift. 

Entanglement  purification  protocols  for  continuous  vari¬ 
able  systems  have  also  been  proposed  —  good  entangled 
states  can  be  distilled  from  noisy  entangled  states  via  a  pro¬ 
tocol  that  requires  two-way  classical  communication  [13,14]. 
These  purification  protocols  work  well  against  certain  sorts 
of  errors,  but  their  reliance  on  two-way  classical  communi¬ 
cation  makes  them  inadequate  for  accurately  preserving  un¬ 
known  states  in  an  imperfect  quantum  memory,  or  for  robust 
quantum  computation. 

Returning  to  qudits,  let  us  consider  an  example  of  a  quan¬ 
tum  code  that  can  protect  against  small  shifts  in  both  ampli¬ 
tude  and  phase,  but  not  against  large  shifts.  It  is  already 
interesting  to  discuss  the  case  of  a  system  consisting  of  a 
single  qudit,  but  where  the  dimension  n  of  the  encoded  sys¬ 
tem  is  (of  course)  less  than  d.  For  example,  a  qudit  (n  =  2) 
can  be  encoded  in  a  system  with  dimension  d=  18,  and  pro¬ 
tected  against  shifts  by  one  unit  in  the  amplitude  or  phase  of 
the  qudit;  that  is,  against  errors  of  the  form  XaZb  where 
|  a  | ,  |  b  |  ^  1 .  The  stabilizer  of  this  code  is  generated  by  the 
two  operators 

X6,  Z6,  (8) 

and  the  commutation  relations  of  the  Pauli  operators  with 
these  generators  are 

( XaZb )  •  X6  =  w6bX6  •  ( XaZb ) , 

( XaZb)-Z6  =  M6aZ6-(XaZb ).  (9) 

Therefore,  a  Pauli  operator  commutes  with  the  stabilizer  only 
if  a  and  b  are  both  multiples  of  3  =  18/6;  this  property  en¬ 
sures  that  the  code  can  correct  single  shifts  in  both  amplitude 
and  phase.  Logical  operators  acting  on  the  encoded  qudit  are 

X  =  X3,  Z  =  Z3,  (10) 

which  evidently  commute  with  the  stabilizer  and  are  not  con¬ 
tained  in  it. 

Since  the  code  words  are  eigenstates  of  Z6  with  eigen¬ 
value  one,  the  only  allowed  values  of  j  are  multiples  of  three. 
And  since  there  are  also  eigenstates  of  X6  with  eigenvalue 
one,  the  code  words  are  invariant  under  a  shift  in  j  by  six 
units.  A  basis  for  the  two-dimensional  code  space  is 

|0)=  -^=(|0)  +  |6)+|12)), 
|r>=-k(l3>+i9>+li5».  (id 

If  an  amplitude  error  occurs  that  shifts  j  by  ±  1 ,  the  error  can 
be  diagnosed  by  measuring  the  stabilizer  generator  Z6, 
which  reveals  the  value  of  j  modulo  3;  the  error  is  corrected 


by  adjusting  j  to  the  nearest  multiple  of  3.  Phase  errors  are 
shifts  in  the  Fourier  transformed  conjugate  basis,  and  can  be 
corrected  similarly. 

This  code  is  actually  perfect ,  meaning  that  each  possible 
pair  of  eigenvalues  of  the  generators  X6  and  Z6  is  a  valid 
syndrome  for  correcting  a  shift.  There  are  nine  possible  er¬ 
rors  {XaZb,\a\,\b\^l},  and  the  Hilbert  space  of  the  qudit 
contains  nine  copies  of  the  two-dimensional  code  space,  one 
corresponding  to  each  possible  error.  These  “error  spaces” 
just  barely  fit  in  the  qudit  space  for  d=  18  =  9  •  2. 

Similar  perfect  codes  can  be  constructed  that  protect 
against  larger  shifts.  For  d  =  rlr2n ,  consider  the  stabilizer 
generators 

Xr'n,  Zr  2”.  (12) 

There  is  an  encoded  qunit ,  acted  on  by  logical  operators 
X=Xri, 

Z  =  Zr2,  (13) 

which  evidently  commute  with  the  stabilizer  and  satisfy 

ZX=wr^XZ=e2vilnXZ.  (14) 

The  commutation  relations  of  the  Pauli  operators  with  the 
generators  are 

(XaZb)  ■  Xri"  =  •  ( XaZb )  =  e2vib/r2Xrin  ■  ( Xazb ) , 

(XaZb)  •  Zr2"  =  Wr2"flZr2”  •  ( XaZb )  =  e-2™/riZr2„  .  _ 

(15) 

The  phases  are  trivial  only  if  a  is  an  integer  multiple  of  r, 
and  b  an  integer  multiple  of  r2.  Therefore,  this  code  can 
correct  all  shifts  with 


\b\<j.  (16) 

The  number  of  possible  error  syndromes  is  rlr2  =  d/n ,  so  the 
code  is  perfect. 

Expressed  in  terms  of  Z  eigenstates,  the  code  words  con¬ 
tain  only  values  of  j  that  are  multiples  of  rx  (since  Zrin 
=  1),  and  are  invariant  under  a  shift  of  j  by  rxn  (since 
Xrin=\).  Hence  a  basis  for  the  ^-dimensional  code  sub¬ 
space  is 
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|0>=  7=  ( 1 0)  +  \nr j)+  .  .  .  +  \{r2~  1  )nrx)), 
2 

|1)=  —={\r ])  +  .  .  .  +|((r2-l)/i+l)r1)), 
VG> 


n~  1)=  ^=(|(^2-l)r1)+  .  .  .  +|(r2n-l)r1}).  (17) 
V^2 


function  are  stored  in  each  of  the  rlr2  cells.  But  of  course 
there  is  only  one  electron,  so  if  we  detect  the  electron  in  one 
cell  its  state  is  destroyed  in  all  the  cells. 

This  picture  of  the  state  encoded  in  a  Landau  level  cau¬ 
tions  us  about  the  restrictions  on  the  type  of  error  model  that 
the  code  can  fend  off  successfully.  If  the  environment 
strongly  probes  one  of  the  cells  and  detects  nothing,  the 
wave  function  is  suppressed  in  that  cell.  This  causes  a  X 
error  in  the  encoded  state  with  a  probability  of  about  l/2r2, 
and  a  Z  error  with  a  probability  of  about  1/2 rx.  The  code  is 
more  effective  if  the  typical  errors  gently  deform  the  state  in 
each  cell,  rather  than  strongly  deforming  it  in  one  cell. 


If  the  states  undergo  an  amplitude  shift,  the  value  of  j 
modulo  rx  is  determined  by  measuring  the  stabilizer  genera¬ 
tor  Zr2n,  and  the  shift  can  be  corrected  by  adjusting  j  to  the 
nearest  multiple  of  rx.  The  code  words  have  a  similar  form 
in  the  Fourier  transformed  conjugate  basis  (the  basis  of  X 
eigenstates),  but  with  rx  and  r2  interchanged.  Therefore,  am¬ 
plitude  shifts  by  less  than  rx/2  and  phase  shifts  by  less  than 
r2/2  can  be  corrected. 

III.  A  QUDIT  IN  A  LANDAU  LEVEL 

A  single  electron  in  a  uniform  magnetic  field  in  two  di¬ 
mensions  provides  an  enlightening  realization  of  our  codes. 
General  translations  in  a  magnetic  field  are  noncommuting, 
since  an  electron  transported  around  a  closed  path  acquires 
an  Aharonov-Bohm  phase  eie where  <F  is  the  magnetic 
flux  enclosed  by  the  path.  Two  translations  T  and  S  commute 
only  if  the  operator  TST~lS~l  translates  an  electron  around 
a  path  that  encloses  a  flux  O  =  &O0,  where  d>0  =  27r/e  is  the 
flux  quantum  and  k  is  an  integer. 

Translations  commute  with  the  Hamiltonian  H,  and  two 
translations  Tx  and  T2  form  a  maximally  commuting  set  if 
they  generate  a  lattice  that  has  a  unit  cell  enclosing  one  quan¬ 
tum  of  flux.  Simultaneously  diagonalizing  H,TX ,  and  T2,  we 
obtain  a  Landau  level  of  degenerate  energy  eigenstates,  one 
state  corresponding  to  each  quantum  of  magnetic  flux.  Then 
Tx  and  T2  are  the  stabilizer  generators  of  a  code,  where  Z 
=  Txn  and  X=T2  are  the  logical  operators  on  a  code  space 
of  dimension  n. 

Suppose  the  system  is  in  a  periodically  identified  box  (a 
torus),  so  that  Tr^  =  (T^)r2=\  are  translations  around  the 
cycles  of  the  torus.  The  number  of  flux  quanta  through  the 
torus,  and  hence  the  degeneracy  of  the  Landau  level,  is 
nrxr2.  The  code,  then,  embeds  an  //-dimensional  system  in  a 
system  of  dimension  d=rxr2n. 

In  this  situation,  the  logical  operations  X  and  Z  can  be 
visualized  as  translations  of  the  torus  in  two  different  direc¬ 
tions;  the  stabilizer  generator  Xn  is  a  translation  by  a  fraction 
1  /r2  of  the  length  of  the  torus  in  one  direction,  and  the  sta¬ 
bilizer  generator  Zn  is  a  translation  by  1  !rx  of  the  length  in 
the  other  direction.  Therefore,  for  any  state  in  the  code  space, 
the  wave  function  of  the  electron  in  a  cell  containing  n  flux 
quanta  is  periodically  repeated  altogether  rxr2  times  to  fill 
the  entire  torus.  Our  code  can  be  regarded  as  a  kind  of 
“quantum  repetition  code” — identical  “copies”  of  the  wave 


IV.  CONTINUOUS  VARIABLE  CODES  FOR  A  SINGLE 
OSCILLATOR 

Formally,  we  can  construct  quantum  codes  for  systems 
described  by  continuous  quantum  variables  by  considering 
the  large- J  limit  of  the  shift-resistant  codes  described  in  Sec. 
II.  We  might  have  hoped  to  increase  d  to  infinity  while  hold¬ 
ing  rx/d  and  r2/d  fixed,  maintaining  the  ability  to  correct 
shifts  in  both  amplitude  and  phase  that  are  a  fixed  fraction  of 
the  ranges  of  the  qudit.  However,  since  the  perfect  codes 
satisfy 

r i  1  r2  1 

-7= - ,  T= - ,  (18) 

d  nr2  d  nrx 

this  is  not  possible.  Nonetheless,  interesting  codes  can  be 
obtained  as  the  amplitude  and  phase  of  the  qudit  approach 
the  position  q  and  momentum  p  of  a  particle — we  can  hold 
fixed  the  size  of  the  shifts  A  q  and  A p  that  can  be  corrected, 
as  the  ranges  of  q  and  p  become  unbounded. 

Another  option  is  to  take  with  rx/d=  1/m  fixed  and 
r2  =  m/n  fixed,  obtaining  a  rotor  Z=el°  (or  a  particle  in  a 
periodically  identified  finite  box)  that  can  be  protected 
against  finite  shifts  in  both  the  orientation  6  of  the  rotor  and 
its  (quantized)  angular  momentum  L.  The  stabilizer  of  this 
code  is  generated  by 

Zr2n _ >  g  i 

Xrln  =  Xd,r2^  e-2iriL(n/m)  Q9) 

and  the  logical  operations  are 

^  _  g  i  6ml  n 

X=e-2mL/m  (20) 

Since  X  shifts  the  value  of  6  by  2  irlm ,  and  Z  shifts  the  value 
of  L  by  mln  =  r2 ,  this  code  can  correct  shifts  in  6  with  Ad 
<7 r/m  and  shifts  in  L  with  \AL\<m/2n. 

Alternatively,  we  can  consider  a  limit  in  which  rx  and  r2 
both  become  large.  We  may  write  rx  =  a/s  and  r2  =  Unas, 
where  d  =  nrlr2=l/s 2,  obtaining  a  code  with  stabilizer  gen¬ 
erators 

Xr2n _ >.  (g 2iriqs ^ ( 1/as)  _  gliriq/a 
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Xrln _ >  (^  —ips^(na/s)  =  g—inpa 


(21) 


and  logical  operations 

Z=e2  Triq/na^  X=e~iP<* ?  (22) 

where  a  is  an  arbitrary  real  number.  Using  the  identity 
eAeB  =  e^A,B^eBeA  (which  holds  if  A  and  B  commute  with 
their  commutator)  and  the  canonical  commutation  relation 
[q,p]  =  i,  we  verify  that 


ZY=mXZ,  (o  =  e27Ti,n. 


(23) 
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FIG.  1.  Code  words  of  the  n  =  2  code.  The  states  |0),  |I)  are 
superpositions  of  q  eigenstates,  periodically  spaced  with  period  2  a; 
the  two  basis  states  differ  by  a  displacement  in  q  by  a.  The  states 
(|0)±|l»/>/2  are  superpositions  of  p  eigenstates,  periodically 
spaced  with  period  2 77/ a;  the  two  basis  states  differ  by  a  displace¬ 
ment  in  p  by  77/ a. 


Since  X  translates  q  by  a  and  Z  translates  p  by  Inina,  the 
code  protects  against  shifts  with 


1 

W{j\q,p)  =  - —  dxeipxif/(j\q  +  x/2)*if/(j\q  —  x/  2) 

2  77  J  -00 


(24) 

Note  that  the  shifts  in  momentum  and  position  that  the  code 
can  correct  obey  the  condition 


ApAq<  —  h. 
2  n 


(25) 


In  typical  situations,  errors  in  q  and  p  are  of  comparable 
magnitude,  and  it  is  best  to  choose  a=  yjlnln  so  that 


/2  77  \  _  /  2tt\ 

Z—  exp  iq  a/ —  ,  Y=  exp  —  ip  \ —  •  (26) 


-  i  (-1  rL-^-s\L-ai~t\, 

s,t= -00  \  na  j  \  2  ) 

(28) 

the  8  functions  are  negative  on  the  sublattice  with  s ,  t  odd.  If 
we  integrate  over  p ,  the  oscillating  sign  causes  the  terms  with 
odd  t  to  cancel  in  the  sum  over  s,  and  the  surviving  positive 
8  functions  have  support  at  q  =  (nX  integer +  j)a.  If  we  in¬ 
tegrate  over  q ,  the  terms  with  odd  s  cancel  in  the  sum  over  t, 
and  the  surviving  positive  8  functions  have  support  at  p 
=  (2  77 in  a)  X  integer.  Wigner  functions  for  the  X  eigenstates 
are  similar,  but  with  the  roles  of  q  and  p  interchanged. 

It  is  also  of  interest  to  express  the  encoded  states  in  terms 
of  the  basis  of  coherent  states.  Consider  for  example  the 
encoded  state  with  X  =  1 ,  which  is  the  unique  simultaneous 
eigenstate  with  eigenvalue  one  of  the  operators  e2mqla  and 
e~ipa.  In  fact  starting  with  any  state  |  if/),  we  can  construct 
the  encoded  state  (up  to  normalization)  as 


Formally,  the  code  words  are  coherent  superpositions  of 
infinitely  squeezed  states,  e.g.,  (up  to  normalization) 


glTritq!  a 


I  <A> 


00 

Z=o)j)=  2  \q  =  OL(j  +  ns)), 

s=  —OO 


|  X=a>j)=  2 


277 

P  =  —  (j  +  ns) 
na 


(27) 


(See  Fig.  1.)  Of  course,  realistic  code  words  will  be  normal¬ 
izable  finitely  squeezed  states,  rather  than  nonnormalizable 
infinitely  squeezed  states.  But  squeezing  in  at  least  one  of  p 
and  q  is  required  to  comfortably  fulfill  condition  (25). 

The  Wigner  function  associated  with  the  code  word  wave 
function  i//^\q)  =  (q\Z=  m7)  is  a  sum  of  delta  functions  po¬ 
sitioned  at  the  sites  of  a  lattice  in  phase  space,  where  three 
quarters  of  the  delta  functions  are  positive  and  one  quarter 
are  negative.  Explicitly,  we  have 


=  2  exp[/(  —  spa  +  277tqi a+  7t,sT)]|  *A)-  (29) 

s,t 

In  particular,  if  we  choose  |  ijj)  to  be  the  ground  state  |0)  of 
the  oscillator,  then  the  operator  2^  t  exp[/(— spa+27Ttqla 
+  TTSt)]  displaces  it  to  a  coherent  state  centered  at  the  point 
(q,p)  =  (sa,277ti a)  in  the  quadrature  plane.  Thus  the  en¬ 
coded  state  is  an  equally  weighted  superposition  of  coherent 
states,  with  centers  chosen  from  the  sites  of  a  lattice  in  the 
quadrature  plane  whose  unit  cell  has  area  2tt.  Since  the  co¬ 
herent  states  are  overcomplete  ,  the  expansion  is  not  unique; 
indeed,  if  we  choose  |  if/)  to  be  a  coherent  state  rather  than  the 
vacuum,  then  the  lattice  is  rigidly  translated,  but  the  encoded 
state  remains  invariant. 

We  can  envision  the  stabilizer  of  the  code  as  a  lattice  of 
translations  in  phase  space  that  preserve  the  code  words;  the 
lattice  generated  by  the  translations  e2jriqla  and  e~inpa .  In 
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fact,  this  lattice  need  not  be  rectangular — we  can  encode  an 
^-dimensional  system  in  the  Hilbert  space  of  a  single  oscil¬ 
lator  by  choosing  any  two  variables  Q  and  P  that  satisfy  the 
canonical  commutation  relation  \Q,P]  =  i ,  and  constructing 
the  code  space  as  the  simultaneous  eigenstate  of  e27TlQ  and 
e~inP.  The  unit  cell  of  the  lattice  has  area  lirhn,  in  keeping 
with  the  principle  that  each  quantum  state  “occupies”  an 
area  2  irk  in  the  phase  space  of  a  system  with  one  continuous 
degree  of  freedom. 

V.  FINITE  SQUEEZING 

Strictly  speaking,  our  code  words  are  nonnormalizable 
states,  infinitely  squeezed  in  both  q  and  p.  In  practice,  we 
will  have  to  work  with  approximate  code  words  that  will  be 
finitely  squeezed  normalizable  states.  We  need  to  consider 
how  using  such  approximate  code  words  will  affect  the  prob¬ 
ability  of  error. 

We  will  replace  a  position  eigenstate  £(0)  by  a  normal¬ 
ized  Gaussian  of  width  A  centered  at  the  origin, 


Probability 


FIG.  2.  Probability  distribution  in  position  space  P(q) 
=  il(#l(|d)  + 1 1))|2  for  an  approximate  code  word  with  A  =  k 
=  0.25.  The  dashed  line  is  the  distribution’s  Gaussian  envelope. 


this  approximate  code  word  can  be  rewritten  as 


<lo>ir>W^n 


dp 


-°°(7r/A2) 


2\  1/4 


^|27^ 

Ka 


l*/'o)=  J 

-l 


dq 


1/2  q1! A2 


-»(tt-A2)1/4 
dp 

o(77/A2)1/4 


k) 


e~m*p\p).  (30) 


A  code  word,  formally  a  coherent  superposition  of  an  infinite 
number  of  8  functions,  becomes  a  sum  of  Gaussians 
weighted  by  a  Gaussian  envelope  function  of  width  /c_1;  in 
the  special  case  of  a  two-dimensional  code  space,  the  ap¬ 
proximate  code  words  become 


00 

\0)  =  N0  2  e~V2K2('2sa)2T(2sa)\il/0}, 

S=  ~  oo 


\l)  =  Nl  2  e~  U2K2l(2s  +  1)a)]2T[(2s+l)a]\iff0), 

S=  ~  oo 

(31) 

where  T(a)  translates  q  by  a ,  N0  i  are  normalization  factors, 
and  we  use,  e.g.,  |0)  rather  than  |0)  to  denote  the  approxi¬ 
mate  code  word.  We  will  assume  that  Ka  and  A/ a  are  small 
compared  to  one,  so  that  N^Nx^(Ak2 a1! 7r)1/4;  then  in 
momentum  space,  the  approximate  code  word  becomes,  e.g., 


(|o)+|i»/VT 


2  2 \  1/4 

k  a  ' 


77 


/ 


dp 


aV 


-co(7T/A2) 


2\  1/4 


00 

x2  e~mi<2(sa)2eip(as)\ p).  (32) 

S=  ~  oo 


By  applying  the  Poisson  summation  formula, 


00  00 

2  e-m(m-b)2  =  {a)-m  2  e~7rs2/ae2lrisb,  (33) 

rn=  —  oo  5=  —  00 


00 

x  2 

m=  —  oo 


exp 


\p) 


r 

dp  1 

'4t7A2\ 

1/4  00 

J  -oo(7 TK2)l/4 

\  <*2  / 

2^1  £ 

m—  —co 

1/ 

277 

Y  /  2] 

Xexp 

rip 

- m 

a 

l/*\ 

-1/2A  y 


\p),  (34) 


again  a  superposition  of  Gaussians  weighted  by  a  Gaussian 
envelope.  (See  Fig.  2.) 

The  approximate  code  words  1 0) ,  1 1 )  have  a  small  overlap 
if  A  is  small  compared  to  a ,  and  k  is  small  compared  to 
77/ a.  For  estimating  the  error  probability  caused  by  the  over¬ 
lap,  let’s  consider  the  special  case  where  q  and  p  are  treated 
symmetrically,  a  =  yjir  and  k  =  A ,  then 

(q  (T)  2  e“4’7A2s2exp[-(^r-2sVT7:)2/A2] 

\  77  S=  -oo 


and 


1  2 

dO>|o>+(p|r>p~-=  2  *-aV 

Z'  a/ 77  m=  — 00 

X  exp[  -(p-2m^ir) 2 1 A  2] .  (36) 

To  perform  error  recovery,  we  measure  the  value  of  q  and  p 
modulo  V77  and  then  correct  for  the  observed  shift.  In  the 
state  1 0) ,  the  probability  of  failure  is  the  probability  that  q  is 
closer  to  an  odd  multiple  of  \[tt  than  an  even  multiple,  and  in 
the  state  (|0)  +  |  T))/ y[2,  the  error  probability  is  the  probabil¬ 
ity  that  p  is  closer  to  an  odd  multiple  of  than  an  even 
multiple.  For  both  the  amplitude  and  phase  errors,  the  intrin¬ 
sic  error  probability  arising  from  the  imperfections  of  the 
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approximate  code  words  becomes  exponentially  small  r 

for  small  A.  Using  the  asymptotic  expansion  of  the  error  (Vi\v2)=  J  dudv  rjx(u,v)*  rj2(u,v).  (43) 

function, 


f 


dte-^=\—\e-x\l-°(l/x2)l 


(37) 


VI.  CONTINUOUS  VARIABLE  CODES  FOR  MANY 
OSCILLATORS 


we  may  estimate  the  error  probability  by  summing  the  con¬ 
tributions  from  the  tails  of  all  the  Gaussians,  obtaining 


Error  Prob<- 


2A 


■2  A  - 


-  77/4  A  z 


2A 


—  7t/4A2 


77 


(38) 


This  error  probability  is  about  1%  for  A —  .5,  and  is  al¬ 
ready  less  than  10-6  for  a  still  modest  value  A —  .25.  Using 
finitely  squeezed  approximate  code  words  does  not  badly 
compromise  the  error-correcting  power  of  the  code,  since  a 
gentle  spreading  in  p  and  q  is  just  the  kind  of  error  the  code 
is  intended  to  cope  with. 

The  mean  photon  number  of  a  finitely  squeezed  approxi¬ 
mate  code  word  is 


The  continuous  variable  codes  described  in  Sec.  IV  are 
based  on  simple  lattices  in  the  two-dimensional  phase  space 
of  a  single  particle.  We  can  construct  more  sophisticated 
codes  from  lattices  in  the  2J/V-dimensional  phase  space  of  N 
particles.  Then  codes  of  higher  quality  can  be  constructed 
that  take  advantage  of  efficient  packings  of  spheres  in  higher 
dimensions. 

For  a  system  of  N  oscillators,  a  tensor  product  of  Pauli 
operators  can  be  expressed  in  terms  of  the  canonical  vari¬ 
ables  qt  and  p{  as 


Uap=  exp 


i=  1 


(44) 


where  the  af  s  and  (3^  s  are  real  numbers.  (In  this  setting,  the 
Pauli  operators  are  sometimes  called  “Weyl  operators.”) 
Two  such  operators  commute  up  to  a  phase: 

UapUa,p,  =  e2™M**a'P'n  Ua,p,  Uaf3 ,  (45) 


(a^a)+l/2= -(p2  +  q2)^*A  2  (39) 

for  small  A.  Therefore,  an  error  probability  of  order  1CT6 
can  be  achieved  with  Gaussian  approximate  code  words  that 
have  mean  photon  number  of  about  (.25) _2  —  16. 

More  generally,  a  finitely  squeezed  code  word  |  if/)  can  be 
regarded  as  a  perfect  code  word  |  £)  that  has  undergone  an 
error;  we  may  write 


| ifj)=  J  dudv  rj(u,v)ed  up  +  v^ (40) 

where  p(u,v)  is  an  error  ‘‘wave  function.”  In  the  special 
case  of  a  Gaussian  finitely  squeezed  code  word,  we  have 

1 

y(u,v)=  , - = 

V77/cA 

where  A  and  k  are  the  squeezing  parameters  defined  above. 

If  p(u,v)  vanishes  for  \  u\>  all  or  |  v  \  >  7r/(n  a) ,  then  the 
error  is  correctable.  In  this  case,  the  interpretation  of  p(u,v) 
as  a  wave  function  has  a  precise  meaning,  since  there  is  an 
unambiguous  decomposition  of  a  state  into  code  word  and 
error.  Indeed,  if  |^i),  |£2)  are  perfect  code  words  and  |*/q), 

|  if/ 2)  are  the  corresponding  finitely  squeezed  code  words  with 
error  wave  functions  rp,  772,  then 

(^il^2}  =  (£il£>}Uil%)>  (42) 


exp 


-(w2/A2  +  u2//c2)  I , 


(41) 


where 


where 


(x)(af3,af  f3r)  =  a-  (3'  —  a'  •  f3  (46) 

is  the  symplectic  form.  Thus  two  Pauli  operators  commute  if 
and  only  if  their  symplectic  form  is  an  integer. 

Now  a  general  continuous  variable  stabilizer  code  is  the 
simultaneous  eigenspace  of  commuting  Pauli  operators,  the 
code’s  stabilizer  generators.  If  the  continuous  variable  phase 
space  is  2A/-dimensional  and  the  code  space  is  a  finite¬ 
dimensional  Hilbert  space,  then  there  must  be  IN  indepen¬ 
dent  generators.  The  elements  of  the  stabilizer  group  are  in 
one-to-one  correspondence  with  the  points  of  a  lattice  C  in 
phase  space,  via  the  relation 


U(kl  ,k2,  .  .  .  k2N)=  exp 


/  IN  V 

2tt|  2  kava 


(47) 


Here  {va,a=  1,2,  .  .  .  ,2  A}  are  the  basis  vectors  of  the  lattice 
(each  a  linear  combination  of  g’s  and  p’s),  the  ka’s  are  ar¬ 
bitrary  integers,  and  U(kx  ,k2,  ...  k2N)  is  the  corresponding 
element  of  the  stabilizer.  For  the  stabilizer  group  to  be  Abe¬ 
lian,  the  symplectic  inner  product  of  any  pair  of  basis  vectors 
must  be  an  integer;  that  is,  the  antisymmetric  2NX2N  ma¬ 
trix 


Aab  =  c°(va,vb)  (48) 

has  integral  entries.  The  lattice  C  has  a  2NX2N  generator 
matrix  M  whose  rows  are  the  basis  vectors, 
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Af  = 


/  ^  \ 
^2 


M1ojMt=I. 


(57) 


(49) 


It  follows  from  Eq.  (48)  and  Eq.  (55)  that  the  C  basis 
vectors  can  be  expanded  in  terms  of  the  C 1  basis  vectors  as 


2  Aabub> 

b 


(58) 


In  terms  of  M,  the  matrix  A  can  be  expressed  as 

A  =  McoMt ,  (50) 

where  a>  denotes  the  2NX2N  matrix 

«=(-/  oL  (51) 

and  I  is  the  NXN  identity  matrix. 

The  generator  matrix  of  a  lattice  is  not  unique.  The  re¬ 
placement 


or 

M=AMl,  (59) 

and  hence  that 

*>(«  a,Ub)  =  (A)ba’  (60) 

or 

M1(o(M±)T=(A~1)T-  (61) 

If  the  lattice  basis  vectors  are  chosen  so  that  A  has  the  stan¬ 
dard  form  Eq.  (54),  then 


M^M'=RM  (52) 

leaves  the  lattice  unmodified,  where  R  is  an  invertible  inte¬ 
gral  matrix  with  determinant  ±  1  (whose  inverse  is  also  in¬ 
tegral).  Under  this  replacement,  the  matrix  A  changes  accord¬ 
ing  to 


A->A'=RARt. 


(53) 


(A-‘)r= 


0 

-D~l 


(62) 


In  the  special  case  of  a  self-dual  lattice,  corresponding  to  a 
code  with  a  one-dimensional  code  space,  both  A  and  A - 1 
must  be  integral;  hence  D  =  D~l  and  the  standard  form  of  A 
is 


By  Gaussian  elimination,  an  R  can  be  constructed  such  that 
the  antisymmetric  matrix  A  is  transformed  to 


A '  = 


0  D\ 
-D  0  > 


(54) 


where  D  is  a  positive  diagonal  NXN  matrix. 

There  are  also  Pauli  operators  that  provide  a  basis  for  the 
operations  acting  on  the  code  subspace — these  are  the  Pauli 
operators  that  commute  with  the  stabilizer  but  are  not  con¬ 
tained  in  the  stabilizer.  The  operators  that  commute  with  the 
stabilizer  themselves  form  a  lattice  C1  that  is  dual  (in  the 
symplectic  form)  to  the  stabilizer  lattice.  The  basis  vectors  of 
this  lattice  can  be  chosen  to  be  {ub  9b  =  1,2,3,  .  .  .  ,2N}  such 
that 


A  = 


0 

-I 


(63) 


Since  the  code  subspace  is  invariant  under  the  translations 
in  £,  we  can  think  of  the  encoded  information  as  residing  on 
a  torus,  the  unit  cell  of  C.  The  encoded  Pauli  operators 
{XaZb}  are  a  lattice  of  translations  on  this  torus,  correspond¬ 
ing  to  the  coset  space  C1/ C.  The  number  of  encoded  Pauli 
operators  is  the  ratio  of  the  volume  of  the  unit  cell  of  C  to  the 
volume  of  the  unit  cell  of  C1 ,  namely  the  determinant  of  A, 
which  is  therefore  the  square  of  the  dimension  of  the  Hilbert 
space  of  the  code.  Thus  the  dimension  of  the  code  space  is 

n  =  |P£A|  =  detD,  (64) 


v(ua,vb)=Sab; 


then  the  generator  matrix 


Ul  \ 

U  2 


M1  = 


of  C1  has  the  property 


where  PfA  denotes  the  Pfaffian ,  the  square  root  of  the  deter¬ 
minant  of  the  antisymmetric  matrix  A. 

The  stabilizer  lattice  unit  cell  has  volume  |PfA|  in  units 
with  h  =  27rh=l,  and  the  unit  cell  of  the  lattice  of  encoded 
operations  has  volume  |PfA|_1  in  these  units.  So  the  code 
fits  an  n-dimensional  code  space  into  n  units  of  phase  space 
volume,  as  expected. 

Codes  of  the  CSS  type  (those  analogous  to  the  binary 
quantum  codes  first  constructed  by  Calderbank  and  Shor  [15] 
and  by  Steane  [16])  are  constructed  by  choosing  one  lattice 
Cq  describing  stabilizer  generators  that  are  linear  combina¬ 
tions  of  the  q' s,  and  another  lattice  CpCCq  describing  sta¬ 
bilizer  generators  that  are  linear  combinations  of  the  p’s. 


012310-8 


ENCODING  A  QUBIT  IN  AN  OSCILLATOR 


PHYSICAL  REVIEW  A  64  012310 


FIG.  3.  The  stabilizer  lattice  and  its  dual  for  an  n  =  2  code  of  a 
single  oscillator.  Solid  lines  indicate  the  stabilizer  lattice;  solid  and 
dotted  lines  together  comprise  the  dual  lattice.  In  units  of  (27 rh)2, 
the  unit  cell  of  the  stabilizer  lattice  (shaded)  has  area  2,  and  the  unit 
cell  of  its  dual  has  area  1/2. 


(Here  C  q  denotes  the  Euclidean  dual  of  the  lattice  Cq  .)  The 
generator  matrix  of  a  CSS  code  has  the  form 


M  = 


(65) 


where  Mq  and  Mp  are  N  X  N  matrices,  and  the  integral  ma¬ 
trix  A  has  the  form 


A  = 


0 

-MpM] 


(66) 


An  important  special  class  of  quantum  codes  for  many 
oscillators  are  the  concatenated  codes.  In  particular,  we  can 
encode  a  qudit  in  each  of  N  oscillators  using  the  code  of  Sec. 
IV.  Then  we  can  use  a  binary  stabilizer  code  that  encodes  k 
qudits  in  a  block  of  N  oscillators,  and  protects  against  arbi¬ 
trary  errors  on  any  /  oscillators,  where  2t+  1  is  the  binary 
code’s  distance.  The  concatenated  codes  have  the  important 
advantage  that  they  can  protect  against  a  broader  class  of 
errors  than  small  diffusive  shifts  applied  to  each  oscillator  — 
if  most  of  the  oscillators  undergo  only  small  shifts  in  p  and  q, 
but  a  few  oscillators  sustain  more  extensive  damage,  then 
concatenated  codes  still  work  effectively. 

For  example,  there  is  a  binary  [[7,1,3]]  quantum  code, 
well  suited  to  fault-tolerant  processing,  that  encodes  one 
logical  qudit  in  a  block  of  seven  qudits  and  can  protect 
against  heavy  damage  on  any  one  of  the  seven  [2].  Given 
seven  oscillators,  we  can  encode  a  qudit  in  each  one  that  is 
resistant  to  quantum  diffusion,  and  then  use  the  [[7,1,3]] 
block  code  to  protect  one  logical  qudit  against  severe  dam¬ 
age  to  any  one  of  the  oscillators. 

For  n^ 5,  there  is  a  [[5,1,3]]  polynomial  code  [17],  also 
well  suited  to  fault-tolerant  processing,  encoding  one  qunit  in 
a  block  of  5.  (Actually,  [[5,1,3]]  quantum  codes  exist  for 
n< 5  as  well  [6,7],  but  these  codes  are  less  conducive  to 
fault-tolerant  computing.)  The  larger  value  of  n  increases  the 
vulnerability  of  each  qunit  to  shift  errors.  Hence,  whether  the 
[[7,1,3]]  binary  code  or  the  [[5,1,3]]  should  be  preferred  de¬ 
pends  on  the  relationship  of  the  size  of  the  typical  shift  errors 
to  the  rate  of  large  errors. 

VII.  ERROR  MODELS 


For  single-oscillator  codes  described  in  Sec.  IV,  A  is  the 
2X2  matrix 


A  = 


0 

—  n 


(67) 


What  sort  of  errors  can  be  corrected  by  these  codes?  The 
codes  are  designed  to  protect  against  errors  that  shift  the 
values  of  the  canonical  variables  p  and  q.  In  fact  the  Pauli 
operators  are  a  complete  basis,  so  the  action  of  a  general 
superoperator  £  acting  on  the  input  density  matrix  p  of  a 
single  oscillator  can  be  expanded  in  terms  of  such  shifts,  as 
in 


where  n  is  the  code’s  dimension.  For  a  single-oscillator  CSS 
code,  the  lattice  is  rectangular,  as  shown  in  Fig.  3. 

The  closest  packing  of  circles  in  two  dimensions  is 
achieved  by  the  hexagonal  lattice.  The  generator  matrix  for  a 
hexagonally  encoded  qunit  can  be  chosen  to  be 


Af  = 


and  the  dual  lattice  is  generated  by 


(68) 


£(p)=  J  dad/3dar  d/3'  C(a,/3\a'  /3') 

y^ei(ap  +  (3q) pe-da  p  +  P' q) '  (7Q) 

If  the  support  of  C(a,(3\a'  ,/?')  is  concentrated  on  suffi¬ 
ciently  small  values  of  its  arguments,  then  the  input  p  can  be 
recovered  with  high  fidelity. 

A  useful  model  of  decoherence  is  the  special  case  of  a 
“Pauli  channel”  in  which  C(a,/3’,ar ,/3f)  is  diagonal  and 
the  superoperator  can  be  expressed  as 


M±=-M. 

n 


(69) 


£(p)=  J  dad/3P(a,/3)ei(ap+^)pe-‘(ap+^\  (71) 


The  shortest  vector  of  the  dual  lattice  has  length  (2/n^3)m, 
compared  to  length  l/^n  for  the  square  lattice.  Therefore  the 
size  of  the  smallest  uncorrectable  shift  is  larger  for  the  hex¬ 
agonal  code  than  for  the  square  lattice  code,  by  the  factor 
(2/  V3)1/2~  1.07457. 


Since  £  is  positive  and  trace  preserving,  we  infer  that 
P(a,/3)^0  and 

dad(3P(a,(3 )  =  1 . 
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Thus,  we  may  interpret  P(a,f3)  as  a  probability  distribution: 
the  phase  space  translation 

a,p  +  P)  (73) 

is  applied  with  probability  P(a,(3). 

Weak  interactions  between  an  oscillator  and  its  environ¬ 
ment  drive  a  diffusive  process  that  can  be  well  modeled  by  a 
Pauli  channel.  If  the  environment  quickly  “forgets”  what  it 
learns  about  the  oscillator,  the  evolution  of  the  oscillator  can 
be  described  by  a  master  equation.  Over  a  short  time  interval 
dt ,  the  shifts  applied  to  the  oscillator  may  be  assumed  to  be 
small,  so  that  the  Pauli  operator  can  be  expanded  in  powers 
of  a  and  (3.  Suppose  that  the  shifts  are  symmetrically  dis¬ 
tributed  in  phase  space  such  that 

(a)  =  (P)  =  0, 

(a2)  =  (P2), 

(<*P)  =  0,  (74) 

where  ( • )  denotes  the  mean  value  determined  by  the  prob¬ 
ability  distribution  P(ct,/3).  Suppose  further  that  the  shifts 
are  diffusive,  so  that  the  mean  square  displacement  increases 
linearly  with  dt;  we  may  write 

(a2)  =  ((32)  =  Ddt,  (75) 


applied  often  enough;  roughly,  the  time  interval  At  between 
error  correction  steps  should  be  small  compared  to  the  char¬ 
acteristic  diffusion  time  D  ~ 1 . 

Interactions  with  the  environment  might  also  damp  the 
amplitude  of  the  oscillator,  as  described  by  the  master  equa¬ 
tion 

I  t  1  t  1  t  \ 

p  =  Tl  apa'  —  —  a'ap  —  —  pa'a  I ;  (79) 

here  a  =  (q  +  ip)lyj2  is  the  annihilation  operator  and  T  is  a 
decay  rate.  This  master  equation  cannot  be  obtained  from  a 
Pauli  channel,  but  as  for  quantum  diffusion,  the  effects  of 
amplitude  damping  over  short-time  intervals  can  be  ex¬ 
pressed  in  terms  of  small  phase-space  displacements. 

The  master  equation  for  amplitude  damping  can  be  ob¬ 
tained  as  the  df— >0  limit  of  the  superoperator 

p(t-\-dt)  =  S(p(t))  =  (  yjTdta)p(t)(  y/Tdta^) 


1  r  dt  ] 

1  /  Tdt  \ 

1  I - —  a  'a 

)p(0l  7 - 1 

For  dt  small,  the  annihilation  operator  can  be  expanded  in 
terms  of  Pauli  operators  as 

VF dta ~  -  Uei'm2q -  e 


where  D  is  a  diffusion  constant.  We  then  obtain 

I  l  ,  l  ,\ 

=  p(t)  + Ddt\ppp—  —p  p--pp 

+  Ddti^qpq~  - q2p~  -pq2j+0(dt312), 

(76) 


_| _ (gi  \T clt!2p  _  g  —  i VT dtllp ^ 


(81) 


Thus,  if  the  time  interval  At  between  error  correction  steps  is 
small  compared  to  the  damping  time  T  “ 1 ,  the  displacements 
applied  to  code  words  are  small,  and  error  correction  will  be 
effective. 

Aside  from  decoherence,  we  also  need  to  worry  about 
“unitary  errors.”  For  example,  an  inadvertent  rotation  of  the 
phase  of  the  oscillator  induces  the  unitary  transformation 

U(0)=  exp  (iOa'a).  (82) 


or 

P=-yl>,b,p]]-y[^,[^,p]].  (77) 


Like  any  unitary  transformation,  this  phase  rotation  can  be 
expanded  in  terms  of  Pauli  operators.  It  is  convenient  to 
introduce  the  notation  for  the  phase-space  displacement  op¬ 
erator 


The  interpretation  of  D  as  a  diffusion  constant  can  be  con¬ 
firmed  by  computing 


D{y)  =  exp (ya  —  y*at)=  exp/ ^2[(lmy)q-  (Rey)/?], 

(83) 


d 

dt 


tr  {p2p)  =  D  = 


d 

dt 


■tr  (q2p); 


(78) 


where  y  is  a  complex  number.  The  displacements  satisfy  the 
identity 


the  mean  square  values  of  p  and  q  increase  with  time  as  Dt. 

More  generally,  the  master  equation  contains  a  diffusive 
term  determined  by  the  covariance  of  the  distribution 
P(a,f3),  and  perhaps  also  a  nondissipative  drift  term  deter¬ 
mined  by  the  mean  of  P(a,/3).  Our  quantum  error- 
correcting  codes  can  successfully  suppress  decoherence 
caused  by  quantum  diffusion,  if  the  recovery  operation  is 


tr(D(y)D(q)f)=TTS2(y- p),  (84) 

so  the  operator  U(  6)  can  be  expanded  in  terms  of  displace¬ 
ments  as 

u(6)=  ~j  d2yue(y)D(y),  (85) 
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where 

ue(y)  =  ti(U(e)D(y)').  (86) 

Evaluating  the  trace  in  the  coherent  state  basis,  we  find  that 

igi0/2  (  i  \ 

U°iy)=  2^2)  eXH  “  2  1  r|2c°t(  m) )  •  (8?) 

For  small  6 ,  the  coefficient 

«0(y)~^exp|--|y|2J  (88) 

has  a  rapidly  oscillating  phase,  and  can  be  regarded  as  a 
distribution  with  support  concentrated  on  values  of  y  such 
that  |  y|2~  0;  indeed,  formally 

lim  u  e(y)  =  tt  81  (y) .  (89) 

9—>0 

Thus  a  rotation  by  a  small  angle  6  can  be  accurately  ex¬ 
panded  in  terms  of  small  displacements — error  correction  is 
effective  if  an  oscillator  is  slightly  overrotated  or  underro¬ 
tated. 

VIII.  THE  GAUSSIAN  QUANTUM  CHANNEL 

At  what  rate  can  error-free  digital  information  be  con¬ 
veyed  by  a  noisy  continuous  signal?  In  classical  information 
theory,  an  answer  is  provided  by  Shannon’s  noisy  channel 
coding  theorem  for  the  Gaussian  channel  [18].  This  theorem 
establishes  the  capacity  that  can  be  attained  by  a  signal  with 
specified  average  power,  for  a  channel  with  specified  band¬ 
width  and  specified  Gaussian  noise  power.  The  somewhat 
surprising  conclusion  is  that  a  nonzero  rate  can  be  attained 
for  any  nonvanishing  value  of  the  average  signal  power. 

A  natural  generalization  of  the  Gaussian  classical  channel 
is  the  Gaussian  quantum  channel.  The  Gaussian  quantum 
channel  is  a  Pauli  channel:  N  oscillators  are  transmitted,  and 
the  q  and  p  displacements  acting  on  the  oscillators  are  inde¬ 
pendent  Gaussian  random  variables  with  mean  0  and  vari¬ 
ance  a2.  A  code  is  an  M- dimensional  subspace  of  the  Hilbert 
space  of  the  N  oscillators,  and  the  rate  R  of  the  code  (in 
qudits)  is  defined  as 

1 

R=-log2M.  (90) 


capacity  (in  qudits)  is  a  function  of  the  dimensionless  vari¬ 
able  hi  a 2,  where  h  is  Planck’s  constant. 

An  upper  bound  on  the  quantum  capacity  of  the  Gaussian 
quantum  channel  was  derived  by  Holevo  and  Werner  [19]; 
they  obtained  (reverting  now  to  units  with  h=  1 ) 

Cg^log2(  1/cr2),  (91) 

for  0<cr2<l,  and  CG  =  0  for  <r2  3=  1 .  They  also  computed 
the  coherent  information  1 Q  of  the  Gaussian  quantum  chan¬ 
nel,  and  maximized  it  over  Gaussian  signal  states,  finding 
[19] 

Ue)max=log2(l/eo-2),  (92) 

for  0<ct2<1/£  (where  e  =  2.718  28  .  .  . ).  The  coherent  in¬ 
formation  is  conjectured  to  be  an  attainable  rate  [1-3];  if  this 
conjecture  is  true,  then  Eq.  (92)  provides  a  lower  bound  on 

cG. 

Using  our  continuous  variable  codes,  rigorous  lower 
bounds  on  Cq  can  be  established.  For  a2  sufficiently  small, 
a  nonzero  attainable  rate  can  be  established  asymptotically 
for  large  N  by  either  of  two  methods.  In  one  method,  the  n 
=  2  code  described  in  Sec.  IV  is  invoked  for  each  oscillator, 
and  concatenated  with  a  binary  quantum  code.  In  the  other 
method,  which  more  closely  follows  Shannon’s  construction, 
a  code  for  N  oscillators  is  constructed  as  in  Sec.  VI,  based  on 
a  close  packing  of  spheres  in  2A/-dimensional  phase  space. 
However  (in  contrast  to  the  classical  case),  neither  method 
works  if  a2  is  too  large.  For  large  a2,  encodings  can  be 
chosen  that  protect  against  q  shifts  or  against  p  shifts,  but  not 
against  both. 

To  establish  an  attainable  rate  using  concatenated  coding 
(the  method  that  is  easier  to  explain),  we  first  recall  a  result 
concerning  the  quantum  capacities  of  binary  channels 
[15,20].  If  X  and  Z  errors  are  independent  and  each  occur 
with  probability  pe ,  then  binary  CSS  codes  exist  that 
achieve  a  rate 

R>\-2H2(pe)=\  +  2pe\og1pe+2(\-pe)\og2{\-pe)-, 

(93) 

this  rate  is  nonzero  for  pe<Al00. 

Now,  for  the  Gaussian  quantum  channel,  if  we  use  the  n 
=  2  continuous  variable  code,  errors  afflicting  the  encoded 
qudit  are  described  by  a  binary  channel  with  independent  X 
and  Z  errors.  Since  the  code  can  correct  shifts  in  q  or  p  that 
satisfy  Aq,Ap<yfrr/2,  the  error  probability  is 


The  quantum-information  capacity  Cq  of  the  channel  is  the 
maximal  rate  at  which  quantum  information  can  be  transmit¬ 
ted  with  fidelity  arbitrarily  close  to  one. 

The  need  for  a  constraint  on  the  signal  power  to  define  the 
capacity  of  the  Gaussian  classical  channel  can  be  understood 
on  dimensional  grounds.  The  classical  capacity  (in  bits)  is  a 
dimensionless  function  of  the  variance  a2,  but  a2  has  di¬ 
mensions.  Another  quantity  with  the  same  dimensions  as  <x2 
is  needed  to  construct  a  dimensionless  variable,  and  the 
power  fulfills  this  role.  But  no  power  constraint  is  needed  to 
define  the  quantum  capacity  of  the  quantum  channel.  The 


Pe<  2 


1 

yj2l TO2  J  Gr/2 


dxe 


(94) 


Since  the  expression  bounding  pe  in  Eq.  (94)  has  the  value 
.110  for  cr^.555,  we  conclude  that  the  Gaussian  quantum 
channel  has  non  vanishing  quantum  capacity  Cq  provided 
that 


o-C.555.  (95) 

One  might  expect  to  do  better  by  concatenating  the  hex¬ 
agonal  n  =  2  single-oscillator  code  with  a  binary  stabilizer 
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code,  since  the  hexagonal  code  can  correct  larger  shifts  than 
the  code  derived  from  a  square  lattice.  For  the  Gaussian 
quantum  channel,  the  symmetry  of  the  hexagonal  lattice  en¬ 
sures  that  X ,  F,  and  Z  errors  afflicting  the  encoded  qudit  are 
equally  likely.  A  shift  is  correctable  if  it  lies  within  the 
“Voronoi  cell”  of  the  dual  lattice,  the  cell  containing  all  the 
points  that  are  closer  to  the  origin  than  to  any  other  lattice 
site.  By  integrating  the  Gaussian  distribution  over  the  hex¬ 
agonal  Voronoi  cell,  we  find  that  the  probability  pe  totai  °f  an 
uncorrectable  error  satisfies 


Pe,  total  ^  1 


12 
2  TUT 


fr  r  x/  V3 

\  dx\  a 

Jo  Jo 


dye 


-(■ x2  +  y2)!2a 2? 


where  r=(7r/2^3)m  is  the  size  of  the  smallest  uncorrectable 
shift.  For  a  binary  quantum  channel  with  equally  likely  X ,  Y, 
and  Z  errors,  it  is  known  [21]  that  there  are  stabilizer  codes 
achieving  a  nonvanishing  rate  for  pet Otai<.1905;  our  bound 
on  pe  ^ai  reaches  this  value  for  . 547. 

Somewhat  surprisingly,  for  very  noisy  Gaussian  quantum 
channels,  square  lattice  codes  concatenated  with  CSS  codes 
seem  to  do  better  than  hexagonal  codes  concatenated  with 
stabilizer  codes.  The  reason  this  happens  is  that  a  CSS  code 
can  correct  independent  X  and  Z  errors  that  occur  with  total 
probability  p  e,tot<i\= Px+ Pz~  Px‘ Pz>  which  approaches 
0.2079 >  0.1905  as  Px=Pz~ >0.1100.  For  a  given  value  of  cr, 
the  qudit  encoded  in  each  oscillator  will  have  a  lower  error 
probability  if  the  hexagonal  code  is  used.  But  if  the  square 
lattice  is  used,  a  higher  qudit  error  rate  is  permissible,  and 
this  effect  dominates  when  the  channel  is  very  noisy. 

We  remark  that  this  analysis  is  readily  extended  to  more 
general  Gaussian  quantum  channels.  We  may  consider  Pauli 
channels  acting  on  a  single  oscillator  in  which  the  probability 
distribution  P(a,/3)  is  a  more  general  Gaussian  function,  not 
necessarily  symmetric  in  p  and  q.  In  that  case,  a  symplectic 
transformation  (one  preserving  the  commutator  of  p  and  q) 
can  be  chosen  that  transforms  the  covariance  matrix  of  the 
Gaussian  to  a  multiple  of  the  identity;  therefore,  this  case 
reduces  to  that  already  discussed  above.  We  may  also  con¬ 
sider  channels  acting  on  N  oscillators  that  apply  shifts  in  the 
2V-dimensional  phase  space,  chosen  from  a  Gaussian  en¬ 
semble.  Again  there  is  a  symplectic  transformation  that  di¬ 
agonalizes  the  covariance  matrix;  therefore,  this  case  reduces 
to  N  independent  single  oscillator  channels,  each  with  its 
own  value  of  <x2. 


group  of  U(ft^)  that  we  can  implement  “easily;”  but  to 
complete  a  set  of  universal  gates  we  must  add  further  trans¬ 
formations  that  are  “difficult.”  In  the  case  of  our  continuous 
variable  codes,  the  easy  gates  will  be  accomplished  using 
linear  optical  elements  (phase  shifters  and  beam  splitters), 
along  with  elements  that  can  “squeeze”  an  oscillator.  For 
the  4  ‘difficult’  ’  gates  we  will  require  the  ability  to  count  pho¬ 
tons. 

The  easy  gates  are  the  gates  in  the  Clifford  group.  In 
general,  the  Clifford  group  of  a  system  of  N  qunits  is  the 
group  of  unitary  transformations  that,  acting  by  conjugation, 
take  tensor  products  of  Pauli  operators  to  tensor  products  of 
Pauli  operators  (one  says  that  they  preserve  the  “Pauli 
group”).  Since  for  N  oscillators  the  tensor  products  of  Pauli 
operators  have  the  form  (44),  the  Clifford  group  transforma¬ 
tions,  acting  by  conjugation,  are  linear  transformations  of  the 
p's  and  q's  that  preserve  the  canonical  commutation  rela¬ 
tions.  Such  transformations  are  called  symplectic  transforma¬ 
tions.  The  symplectic  group  has  a  subgroup  that  preserves 
the  photon  number 


N 

( total  photon  number)  =  2  a\at. 


(97) 


The  transformations  in  this  subgroup  can  be  implemented 
with  linear  optics  [22].  The  full  symplectic  group  also  con¬ 
tains  “squeeze  operators”  that  take  an  a  to  a  linear  combi¬ 
nation  of  a' s  and  s;  equivalently,  the  squeeze  operators 
rescale  canonical  operators  by  a  real  number  X  along  one 
axis  in  the  quadrature  plane,  and  by  X  _  1  along  the  conjugate 
axis,  as  in  (for  example) 

q^\qx,  p  i  >  A. —  i .  (98) 

With  squeezing  and  linear  optics,  we  can  in  principle  imple¬ 
ment  any  symplectic  transformation. 

Aside  from  the  symplectic  transformations,  we  will  also 
assume  that  it  is  easy  to  do  displacements  that  shift  q  and  p 
by  constants.  A  displacement  of  qx  by  c  is  actually  the  lim¬ 
iting  case  of  a  symplectic  transformation  on  two  oscillators 
qx  and  q2\ 


P\~ 

-*■ Pi  +  sp2 

q2^q2-sq\, 

Pl- 

+P2~SPl, 

(99) 


IX.  SYMPLECTIC  OPERATIONS 

To  use  these  codes  for  fault-tolerant  quantum  computa¬ 
tion,  we  will  need  to  be  able  to  prepare  encoded  states,  per¬ 
form  error  recovery,  and  execute  quantum  gates  that  act  on 
the  encoded  quantum  information.  The  most  difficult  task  is 
encoding;  we  will  postpone  the  discussion  of  encoding  until 
after  we  have  discussed  encoded  operations  and  error  recov¬ 
ery. 

Suppose,  for  example,  that  we  have  N  oscillators,  each 
encoding  a  qunit.  We  wish  to  apply  XJ(nN)  transformations 
that  preserve  the  code  subspace  of  the  N  qunits.  As  is  typical 
of  quantum  codes,  we  will  find  that  there  is  a  discrete  sub- 


where  e-^0  with  sq2  =  c  held  fixed. 

Since  for  the  code  with  stabilizer  generators  (21)  the  Pauli 

operators  acting  on  our  encoded  qunits  are  X=eipa  and  Z 
=  e21 Tiq/na^  Qjffortj  gr0up  transformations  acting  on  N 

qunits  constitute  a  subgroup  of  the  symplectic  transforma¬ 
tions  (including  shifts)  acting  on  N  oscillators,  the  subgroup 
that  preserves  a  specified  lattice  in  phase  space.  Thus  we  can 
do  any  encoded  Clifford  group  gate  we  please  by  executing 
an  appropriate  symplectic  transformation  (possibly  including 
a  shift). 

A  similar  comment  applies  to  the  case  of  a  qunit  encoded 
in  a  qudit.  Since  the  logical  Pauli  operators  are  X=XTl 
and  Z  =  Zr2,  each  Clifford  group  transformation  in  the 
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^-dimensional  code  space  is  also  a  Clifford  group  transfor¬ 
mation  on  the  underlying  qudit. 

But  we  must  also  be  sure  that  our  implementation  of  the 
Clifford  group  is  fault  tolerant.  In  previous  discussions  of 
quantum  fault  tolerance  for  [[N,k,2t+  1]]  codes,  the  central 
theme  has  been  that  propagation  of  error  from  one  qudit  to 
another  in  the  same  code  block  must  be  very  carefully  con¬ 
trolled  [23,24].  For  shift-resistant  codes  the  main  issue  is 
rather  different.  Since  each  qudit  typically  has  a  (small)  error 
anyway,  propagation  of  error  from  one  qudit  to  another  is  not 
necessarily  so  serious.  But  what  must  be  controlled  is  ampli¬ 
fication  of  errors — gates  that  turn  small  errors  into  large  er¬ 
rors  should  be  avoided. 

The  Clifford  group  can  be  generated  by  gates  that  are 
fault-tolerant  in  this  sense.  The  Clifford  group  for  qunits  can 
be  generated  by  three  elements.  The  SUM  gate  is  a  two-qunit 
gate  that  acts  by  conjugation  on  the  Pauli  operators  accord¬ 
ing  to 


sum:  X\Xb2^X\Xb2~a ,  Z\z\-^Z\xhz\.  (100) 

Here  qunit  1  is  said  to  be  the  control  of  the  SUM  gate,  and 
qunit  2  is  said  to  be  its  target;  in  the  binary  (n  =  2)  case,  SUM 
is  known  as  controlled-NOT,  or  CNOT.  The  Fourier  gate  F  acts 
by  conjugation  as 


F:  X^Z,  Z^X-1;  (101) 


for  n  =  2  the  Fourier  Transform  is  called  the  Hadamard  gate. 
The  phase  gate  P  acts  as 


P :  X— >(  7;)XZ,  Z^Z,  (102) 

where  the  ^-dependent  phase  y  is  o 1/2  if  n  is  even  and  1  if  n 
is  odd.  Any  element  of  the  Clifford  group  can  be  expressed 
as  a  product  of  these  three  generators.  (In  Ref.  [8]  another 
gate  S  was  included  among  the  generators  of  the  Clifford 
group,  but  in  fact  the  S  gate  can  be  expressed  as  a  product  of 
SUM  gates.) 

For  an  ^-dimensional  system  encoded  in  a  continuous 
variable  system,  these  Clifford  group  generators  can  all  be 
realized  as  symplectic  transformations.  In  the  case  where  the 
stabilizer  generators  are  symmetric  in  q  and  p , 


X  —  expl 


Z— 


exp  iq 


the  required  symplectic  transformations  are 


SUM:^-^!,  Pi^px-P2, 

q1^qxJrq1,  p1~^p1, 

F :  q^p,  P^~q, 


P:  q^q,  p^p-q  +  c,  (104) 

where  the  ^-dependent  shift  c  is  0  for  n  even  and  y  ir/ln  for 
n  odd.  Under  these  symplectic  transformations,  small  devia¬ 
tions  of  q  and  p  from  the  stabilizer  lattice  remain  small;  in 
this  sense  the  transformations  are  fault  tolerant. 


(a) 


data  - 1 - 

|X  =  1) — | SUM | — Measure  q 


(6) 


data  — [SUMl— 


|Z  =  1) 


- Measure  p 


FIG.  4.  Measurement  of  the  error  syndrome,  (a)  To  diagnose  the 
q  shift,  an  ancilla  is  prepared  in  the  encoded  X=  1  state,  a  SUM  gate 
is  executed  with  the  data  as  control  and  the  ancilla  as  target,  and  the 
position  of  the  ancilla  is  measured,  (b)  To  diagnose  the  p  shift,  the 
ancilla  is  prepared  in  the  Z  =  1  state,  a  SUM  gate  is  executed  with  the 
ancilla  as  control  and  the  data  as  target,  and  the  momentum  of  the 
ancilla  is  measured. 


X.  ERROR  RECOVERY 

If  we  are  willing  to  destroy  the  encoded  state,  then  mea¬ 
suring  the  encoded  X  or  Z  is  easy — we  simply  conduct  a 
homodyne  measurement  of  the  q  or  p  quadrature  of  the  os¬ 
cillator.  For  example,  suppose  that  we  measure  q  for  a  state 
in  the  code  subspace.  If  there  are  no  errors  and  the  measure¬ 
ment  has  perfect  resolution,  the  only  allowed  values  of  q  will 
be  integer  multiples  of  a.  If  there  are  errors  or  the  measure¬ 
ment  is  imperfect,  classical  error  correction  can  be  applied  to 
the  outcome,  by  adjusting  it  to  the  nearest  a- k,  where  k  is  an 
integer.  Then  the  outcome  of  the  measurement  of  Z  is  a)k. 

To  diagnose  errors  in  a  coded  data  state,  we  must  measure 
the  stabilizer  generators.  This  measurement  can  be  imple¬ 
mented  by  “feeding”  the  errors  from  the  code  block  to  a 
coded  ancilla,  and  then  measuring  the  ancilla  destructively, 
following  the  general  procedure  proposed  by  Steane  [25] 
(see  Fig.  4).  For  example,  to  measure  the  generator  e27riq,a 
(i.e.,  the  value  of  q  modulo  a ),  we  prepare  the  ancilla  in  the 
state  (|0)  +  |r»/V2,  the  equally  weighted  superposition  of 
all  |  q  =  s-ct),  s  being  an  integer.  Then  a  SUM  gate  is  ex¬ 
ecuted  with  the  data  as  control  and  the  ancilla  as  target — 
acting  according  to 


q2-^q\+qi,  (105) 

where  q\,q2  are  the  values  of  q  for  the  data  and  ancilla 
respectively,  prior  to  the  execution  of  the  SUM  gate.  By  mea¬ 
suring  q  of  the  ancilla,  the  value  of  qx  +  q2  is  obtained,  and 
this  value  modulo  a  determines  the  shift  that  should  be  ap¬ 
plied  to  the  data  to  recover  from  the  error. 

Similarly,  to  measure  the  stabilizer  generator  einpa ,  we 
prepare  the  ancilla  in  the  state  |0),  the  equally  weighted 
superposition  of  all  \p  =  s  *2  ir/na),  s  being  an  integer.  Then 
a  SUM  gate  is  executed  with  the  ancilla  as  control  and  the 
data  as  target.  Finally,  the  p  quadrature  of  the  ancilla  is  mea¬ 
sured.  The  outcome  reveals  the  value  of  p2~P\  prior  to  the 
SUM  gate,  where  px  is  the  momentum  of  the  data,  and  p2  is 
the  momentum  of  the  ancilla.  The  measured  value  modulo 
2  irlna  then  determines  the  shift  that  should  be  applied  to 
the  data  to  recover  from  the  error. 

Of  course,  the  ancilla  used  in  the  syndrome  measurement 
can  also  be  faulty,  resulting  in  errors  in  the  syndrome  and 
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imperfect  recovery.  Similarly,  the  measurement  itself  will 
not  have  perfect  resolution,  and  the  shift  applied  to  recover 
will  not  be  precisely  correct.  Furthermore,  as  is  discussed  in 
Sec.  V,  the  ideal  code  words  are  unphysical  nonnormalizable 
states,  so  that  the  encoded  information  will  always  be  carried 
by  approximate  code  words.  For  all  these  reasons,  deviations 
from  the  code  subspace  are  unavoidable.  But  if  a  fresh  sup¬ 
ply  of  ancilla  oscillators  is  continuously  available,  we  can 
prevent  these  small  errors  from  accumulating  and  eventually 
damaging  the  encoded  quantum  information. 

XI.  UNIVERSAL  QUANTUM  COMPUTATION 

Symplectic  transformations  together  with  homodyne  mea¬ 
surements  are  adequate  for  Clifford  group  computation  and 
for  error  recovery  (assuming  we  have  a  supply  of  encoded 
states).  But  to  achieve  universal  computation  in  the  code 
space,  we  need  to  introduce  additional  operations.  Fortu¬ 
nately,  the  quantum  optics  laboratory  offers  us  another  tool 
that  can  be  used  to  go  beyond  the  symplectic  computational 
model — the  ability  to  count  photons. 

There  are  a  variety  of  ways  in  which  photon  counting  can 
be  exploited  to  complete  a  universal  set  of  fault-tolerant 
gates.  We  will  describe  two  possible  ways,  just  to  illustrate 
how  universal  fault-tolerant  quantum  computation  might  be 
realized  with  plausible  experimental  tools.  For  this  discus¬ 
sion,  we  will  consider  the  binary  case  n  =  2. 


modulo  four;  we  can  measure  the  eigenvalue  of  the  encoded 
Hadamard  transformation  by  counting  photons. 

In  fact  the  photon  number  in  the  code  space  is  even — all 
code  words  are  invariant  under  a  1 80°  rotation  in  the  quadra¬ 
ture  plane.  Because  of  this  feature,  the  preparation  of  the 
Hadamard  eigenstate  has  some  fault  tolerance  built  in;  if  the 
photon  count  is  off  by  one,  the  number  will  be  odd  and  an 
error  will  be  detected.  In  that  case  we  reject  the  state  we  have 
prepared  and  make  a  new  attempt.  If  the  photon  number  is 
large,  then  obtaining  a  reliable  determination  of  the  photon 
number  modulo  four  will  require  highly  efficient  photodetec¬ 
tion.  But  on  the  other  hand,  the  photon  number  need  not  be 
very  large  —  the  mean  value  of  a^a  is  about  A-2,  where  A 
is  the  squeeze  factor,  and  we  have  seen  that  the  intrinsic  error 
rate  due  to  imperfect  squeezing  is  quite  small  for  A  ~  1/4,  or 
(a^a)~  16. 

An  alternative  to  preparing  an  encoded  EPR  pair  and  de¬ 
structively  measuring  one  member  of  the  pair  is  to  prepare 
1 0 )  and  then  perform  a  quantum  nondemolition  measurement 
of  the  photon  number  modulo  4.  This  might  be  done  by 
coupling  the  oscillator  to  a  two-level  atom  as  proposed  in 
Ref.  [28].  Indeed,  since  only  one  bit  of  information  needs  to 
be  collected  (the  photon  number  is  either  0  or  2  modulo  4), 
the  measurement  could  be  made  in  principle  by  reading  out  a 
single  atom.  Suppose  that  the  coupling  of  oscillator  to  atom 
is  described  by  the  perturbation 


A.  Preparing  a  Hadamard  eigenstate 


H'  =  \a^acrz , 


(109) 


We  can  complete  the  universal  gate  set  if  we  have  the 
ability  to  prepare  eigenstates  of  the  Hadamard  operator  H 
[26,27].  For  this  purpose  it  suffices  to  be  able  to  destructively 
measure  H  of  an  encoded  qudit.  Assuming  we  are  able  to 
prepare  a  supply  of  the  encoded  Z  eigenstate  1 0),  we  can 
make  an  encoded  Einstein-Podolsky-Rosen  (EPR)  pair  using 
symplectic  gates.  Then  by  destructively  measuring  H  for  one 
encoded  qudit  in  the  pair,  we  prepare  the  other  qudit  in  an 
encoded  eigenstate  of  H  with  the  known  eigenvalue. 

But  how  can  we  destructively  measure  HI  The  Hadamard 
gate  acts  by  conjugation  on  the  encoded  Pauli  operators  ac¬ 
cording  to 


where  az  =  —  1  in  the  atomic  ground  state  | g)  and  crz  =  1  in 
the  atomic  excited  state  \e).  By  turning  on  this  coupling  for 
a  time  t  =  7t/4X  ,  we  execute  the  unitary  transformation 

U=  exp[  -  i(ir/4)a^ acrz].  (HO) 

Then  the  atomic  state  (\g)+\e))l ^2  evolves  as 


\J:-^(\g)  +  \e))^^=eia'a7T,4(\g)  +  e-iata7r,2\e)). 


(HD 


H:  X^Z,  Z^X.  (106) 

If  we  use  the  code  that  treats  q  and  p  symmetrically  so  that 
X=  exp(— ip^frr)  and  Z=  exp (iqy/ir),  then  the  Hadamard 
gate  can  be  implemented  by  the  symplectic  transformation. 

q^p,  p^~q  (107) 

(recalling  that  X2  =  Z2  =  I  on  the  code  subspace).  This  trans- 
formation  is  just  the  Fourier  transform 

F:  expj  i—a^a\  (108) 

(where  a  is  the  photon  number),  which  describes  the  natu¬ 
ral  evolution  of  the  oscillator  for  one  quarter  cycle.  Thus  the 
phase  of  the  Hadamard  operator  is  simply  the  photon  number 


By  measuring  the  atomic  state  in  the  basis  (|g)±  \e))/^2, 
we  read  out  the  value  of  the  photon  number  modulo  4  (as¬ 
sumed  to  be  either  2  or  4).  Since  this  is  a  nondemolition 
measurement,  it  can  be  repeated  to  improve  reliability.  By 
measuring  the  photon  number  mod  4  many  times  (perhaps 
with  rounds  of  error  correction  in  between  the  measure¬ 
ments),  we  obtain  a  Hadamard  eigenstate  with  excellent  fi¬ 
delity. 

How  does  the  ability  to  construct  the  Hadamard  eigenstate 
enable  us  to  achieve  universal  quantum  computation?  We 
can  make  contact  with  constructions  that  have  been  de¬ 
scribed  previously  in  the  literature  by  observing  that  the 
Hadamard  eigenstate  can  be  transformed  by  applying  sym¬ 
plectic  gates  to  the  “7r/8-phase  state.”  First  note  that  the  two 
Hadamard  eigenstates  can  be  converted  to  one  another  by 
applying  the  encoded  gate  XZ,  which  can  be  implemented 
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e-«r/4p  = 


g-iTT/4 


0 


0 

ei7r/4 


Recall  that  the  imperfections  of  the  code  words  arising 
from  finite  squeezing  can  be  described  by  an  “embedded 
error”  |  rj)  as  in  Eq.  (40);  a  Gaussian  approximate  code  word 
has  a  Gaussian  embedded  error 


ivo 

l^x/8> 


e 


S\*> 


7}(u,v)=  ,  exp(  —  —(u2/A2  +  v2/k2)  I,  (117) 
\IttAk  \  1 


FIG.  5.  Implementation  of  the  S  gate.  An  ancilla  is  prepared  in 
the  state  |  and  a  CNOT  gate  is  executed  with  the  data  as  control 
and  the  ancilla  as  target;  then  the  ancilla  is  measured  in  the  basis 
(|0),  1 1 )} .  A  P  gate  is  applied  to  the  data  conditioned  on  the  mea¬ 
surement  outcome. 


by  shifting  both  p  and  q.  Therefore  it  is  sufficient  to  consider 
the  eigenstate  corresponding  to  the  eigenvalue  1 , 

\i//H=i)=  cos(7t/8)|0)+  sin(  tt/8)  1 1 )  -  (112) 

By  applying  the  symplectic  single-qudit  gate 


1  1 


1  0 


H  ‘  P~  = 


V2 


1  -1-0 


V21 


1 


,  7 


we  obtain  the  7t/8  state 
1 


^/8)=^(e_i77/8|0)+e,'7r/8|1))' 


(113) 


(114) 


Now  this  7t/8  state  can  be  used  to  perform  the  nonsym- 
plectic  phase  gate 


where  A  is  the  width  in  q  and  k  is  the  width  in  p.  Symplectic 
gates  act  separately  on  the  encoded  qudit  and  the  “embedded 
error’  ’  1 77) ;  for  example,  the  Fourier  transform  gate  and  the 
SUM  gate  act  on  the  error  according  to 

F:\u,v)—>\v,  —  u), 

SUMilwj  ,v  1  ;u2 ,v 2) ^\u  1  ,v  1  +  V 2',U2~ u  1  2) - 

(118) 

By  measuring  the  photon  number  modulo  4,  we  actually 
measure  the  product  of  the  eigenvalue  of  the  Hadamard  gate 
acting  on  the  code  word  and  the  eigenvalue  of  F  acting  on 
the  embedded  error.  The  latter  always  equals  1  if  we  use 
symmetrically  squeezed  code  words,  with  A  =  k. 

Symmetric  squeezing  is  not  in  itself  sufficient  to  ensure 
that  the  measurement  of  the  photon  number  modulo  4  will 
prepare  the  desired  encoded  Hadamard  eigenstate.  We  also 
need  to  consider  how  the  embedded  error  is  affected  by  the 
preparation  of  the  EPR  pair  that  precedes  the  measurement. 
To  prepare  the  EPR  pair,  we  use  the  SUM  gate.  Suppose  that 
we  start  with  two  symmetrically  squeezed  states.  Then  the 
SUM  gate  yields  the  error  wave  function 


tj'  (ui,v  i;u2,v2)=  exp{  —  \u\  +  (v  |  —  l>2)2  +  (ux  +  u2)2 


which  completes  the  universal  gate  set  [29,30].  The  gate  is 
constructed  by  executing  the  circuit  shown  in  Fig.  5.  We 
perform  a  CNOT  gate  with  the  arbitrary  single-qudit  state 
\if/)  =  a\0)  +  b\l)  as  the  control,  and  the  77/8  phase  state  as 
the  target;  then  the  target  qudit  is  measured  in  the  basis 
{ 1 0) ,  1 1 )}.  If  the  measurement  outcome  is  |0)  (which  occurs 
with  probability  1/2),  then  the  control  qudit  has  become 
ael7T/s\0)  +  be~l7r/8\  1)  =  S|  i//)  and  we  are  done.  If  the  mea¬ 
surement  outcome  is  1 1 ) ,  then  the  control  qudit  has  become 
etc1 77/8 1 0 )  T  be1 77/8 1 1 ) ,  and  we  obtain  by  applying  the 
symplectic  single-qudit  gate 


Completing  the  universal  gate  set  by  measuring  the  Had¬ 
amard  transformation  has  some  drawbacks.  For  one  thing, 
while  photon  number  modulo  four  corresponds  to  the  Had¬ 
amard  eigenvalue  in  the  ideal  code  space,  this  correspon¬ 
dence  will  not  apply  to  approximate  code  words  unless  they 
are  of  a  special  type. 


+  i^]/A2}.  (119) 

Not  only  is  it  not  symmetric,  but  the  error  is  entangled  be¬ 
tween  the  two  oscillators.  The  Fourier  transform  measure¬ 
ment  will  not  give  the  desired  result  when  applied  to  either 
oscillator. 

To  ameliorate  this  problem,  we  could  perform  error  cor¬ 
rection  after  the  preparation  of  the  EPR  pair  and  before  the 
measurement,  where  the  error  correction  protocol  has  been 
designed  to  produce  symmetrically  squeezed  states.  Or  we 
could  avoid  preparing  the  EPR  state  by  using  the  nondemo¬ 
lition  measurement  of  photon  number  modulo  4,  as  described 
above. 

B.  Preparing  a  cubic  phase  state 

Now  we  will  describe  another  way  to  use  photon  counting 
to  implement  nonsymplectic  gates,  which  is  less  sensitive  to 
the  code  word  quality.  Again,  we  will  complete  the  universal 
gate  set  by  constructing  the  77/8  phase  gate  S. 

For  our  binary  (n  =  2)  code,  the  code  subspace  has  the 
basis 

+00 

|0)=  2  |  q  =  2sa), 

S=  —  00 
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FIG.  6.  Implementation  of  the  cubic  phase  gate.  An  ancilla  is 
prepared  in  the  state  |  y),  and  a  SUM-  1  gate  is  executed  with  the 
data  as  control  and  the  ancilla  as  target;  then  the  position  of  the 
ancilla  is  measured.  A  symplectic  gate  U (a)  is  then  applied  to  the 
data,  conditioned  on  the  outcome  a  of  the  measurement. 


+  00 

|  T)  =  2  \q  =  (2s+  \)a).  (120) 

S  =  —00 

(For  now  we  ignore  the  embedded  error  due  to  imperfect 
squeezing;  it  will  be  taken  into  account  later.)  An  S  gate 
acting  on  the  encoded  qudit  is  implemented  (up  to  an  irrel¬ 
evant  overall  phase)  by  the  unitary  operator 


If  the  function  cf>(q)  is  cubic,  then  the  argument  of  the  ex¬ 
ponential  is  quadratic  and  hence  XJ(a)  is  a  symplectic  trans¬ 
formation. 

Now  the  problem  of  implementing  universal  quantum 
computation  in  the  code  subspace  has  been  reduced  to  the 
problem  of  preparing  the  cubic  phase  state  |  y).  We  can  ac¬ 
complish  this  task  by  preparing  an  EPR  pair,  and  then  per¬ 
forming  a  suitable  photon  counting  measurement  (a  nonideal 
homodyne  measurement)  on  one  member  of  the  pair. 

Of  course,  the  EPR  pair  will  not  be  perfect.  To  be  definite, 
let  us  suppose  (although  this  assumption  is  not  really  neces¬ 
sary)  that  it  is  a  Gaussian  state 


I  ^aV'a)' 


/  ffp  l1/2f 

1  2/ ^1  +  *?2\ 2 

dqxdq2exp 

\Trcrql  J 

1-5-51  2  )| 

Xexp 


(128) 


W=  exp|  —  [2(q/ a)3  +  (q/ a)2  —  2(q/a)]j .  (121) 

Indeed,  we  can  check  that 

(  0,  if  x  =  2  s  , 

2v3  +  v2-2v(mod8)  =  |  (122) 

(1,  ifx  =  2s  +  1  . 

The  operator  W  is  the  product  of  a  symplectic  gate  and  the 
cubic  phase  gate 


Vy=  exp  (iyq3),  (123) 

where  y=  7r/(2a3).  But  how  do  we  implement  the  cubic 
gate?  In  fact,  if  we  are  able  to  prepare  a  “cubic  phase  state” 

|  y)=J  dxe^x)x),  (124) 

then  we  can  perform  the  gate  V y  by  executing  the  circuit 
shown  in  Fig.  6. 

To  understand  how  the  circuit  works,  consider  the  more 
general  problem  of  implementing  a  phase  gate  that  acts  on 
the  position  eigenstates  according  to 

Vf.lq^e^lq)  (125) 


with  crp  ,crq<  1 . 

Now  suppose  that  the  second  oscillator  is  mixed  with  a 
coherent  light  beam,  resulting  in  a  large  shift  in  momentum, 

\ip)^eiwq\ip),  w>a-\a~p1-,  (129) 

then  the  photon  number  is  measured  and  n  photons  are  de¬ 
tected.  Thus  the  state  of  the  hrst  oscillator  becomes  (up  to 
normalization) 


Wi n))~[^r)  f  dqM\)e~  1/2 

\OToV  J 

X  J  dq2(p*(q2)eiw‘> 2e"  1/2+-<?2>2/+,  (130) 

where  |  cpn)  denotes  the  photon  number  eigenstate,  the  eigen¬ 
state  with  eigenvalue  n  +  \  of  the  Hamiltonian  H=\(p2 
+  q2). 

We  can  evaluate  the  q2  integral  in  Eq.  (130)  by  appealing 
to  the  semiclassical  approximation.  For  q2  in  the  classically 
allowed  region  and  far  from  the  classical  turning  points,  we 
may  write 


<p„*U2)' 


yj2Trp{q2) 


exp 


f  42 

,:J  c 


dxp(x) 


(where  <fi(q)  is  a  real-valued  function),  using  the  prepared 
phase  state 


+ 


^2Trp(q2) 


f  12 

'J  ‘ 


exp  +  i  dxp(x) 


(131) 


|</>)=  J  dxel^x^\x).  (126) 

If  we  perform  the  gate  SUM-  ^  with  position  eigenstate  | q) 
as  control  and  |  (f>)  as  target,  and  then  measure  the  position  of 
the  target  obtaining  the  outcome  \a),  the  state  of  the  control 
oscillator  has  become  el^q+a^\q).  We  can  therefore  com¬ 
plete  the  construction  of  V $  by  applying  the  transformation 

U  (a)  =  ei[<Kq)~<Kq+a)].  (127) 


where 

p(x)=  V2?x+  1  —  x2.  (132) 

For  w>aql ,  the  rapid  phase  oscillations  strongly  suppress 
the  contribution  to  the  integral  arising  from  the  left-moving 
part  of  cp(n\q 2).  A  contribution  from  the  right-moving  part 
survives  provided  that 

\p(ql)-w\<a~1 .  (133) 
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When  this  condition  is  satisfied,  it  is  a  reasonable  approxi¬ 
mation  to  replace  the  Gaussian  factor  e~ll2(^qx~q2)  laq  in  the 
q2  integral  by  ^27rcr2qS(qi  —  q2),  so  that  we  obtain 

\iJ,\n))~(2cTp*q)mj  dqi\qx)e-m^-^L= 


Is  this  procedure  fault  tolerant?  Before  considering  the 
errors  introduced  during  the  implementation  of  the  cubic 
phase  gate,  we  should  check  that  the  gate  does  not  cata¬ 
strophically  amplify  any  preexisting  errors.  In  general,  a 
phase  gate  can  transform  a  small  position  shift  error  into  a 
potentially  dangerous  momentum  shift  error.  Commuting 
V(4>)  =  el^q)  through  the  shift  operator  e~lup ,  we  find 


Xexp 


dx[p(x)  —  w] 


(134) 


The  probability  that  n  photons  are  detected  is  given  by  the 
norm  of  this  |  The  values  of  n  that  occur  with  appre¬ 
ciable  probability  satisfy  Eq.  (133)  for  some  q1  with  \qx\ 
<  cr~ 1 ;  thus  typical  measurement  outcomes  are  in  the  range 


n  +  ■ 


1  1 


-{w±a  )2  +  ; 


1 


(135) 


with  a  flat  probability  distribution 


Prob(  n)  =  {ipP\tpP)~— .  (136) 

w 


Heuristically,  after  the  momentum  shift  is  applied,  the  oscil¬ 
lator  that  is  measured  has  momentum  of  order  w  ±  crq 1 ,  and 
position  of  order  cr~ 1 ,  so  that  the  value  of  the  energy  is  n 

+  l2  =  hp2  +  q2)~hw±  <T-ql)2+hcr-p2. 

For  a  particular  typical  outcome  of  the  photon-counting 
measurement,  since  |  if/[n^)  has  its  support  on  \ql\<a~l 
<^w,  we  can  Taylor  expand  p(x)  about  x  =  qx  to  express 
|. if/[n))  as 


i)a 


exp  i 


i  J  ( \!(2n 


+  1 )—  x2  ~w)dx 


e~iup  =  e~iup  eifu(q)  ei(j){q) ,  (140) 

where  fu(q)  =  (f>(q  +  u)  —  </>(<?);  the  operator  el^u^  can  be 
expanded  in  terms  of  momentum  shift  operators  of  the  form 
elvq  by  evaluating  the  Fourier  transform 

fu(v)  =  f  (141) 

Assuming  we  use  a  code  where  the  parameter  a  is  of  order 
one,  uncorrectable  errors  will  be  likely  if  fu(v)  has  signifi¬ 
cant  support  on  values  of  v  that  are  order  one. 

Suppose  that  V(<fi)  acts  on  an  approximate  code  word 
whose  wave  function  is  concentrated  on  values  of  q  in  the 
domain  \q\<L.  Phase  cancellations  will  strongly  suppress 
fu(v ),  unless  the  stationary  phase  condition  f'u(q)  =  v  is  sat¬ 
isfied  for  some  value  of  q  in  the  domain  of  the  approximate 
code  word.  Therefore,  can  propagate  a  preexisting  po¬ 
sition  shift  u  to  a  momentum  shift  error  of  magnitude 

M -max  !/'(<?)  | .  (142) 

The  cubic  phase  gate  needed  to  implement  the  encoded  S 
gate  is  W=el^q\  where  cf)(q)  =  7rq3/2a3 ,  so  that  fu(q) 
=  37Tuq2/2a3  +  •  •  •  (ignoring  small  terms  linear  and  con¬ 
stant  in  q),  and  f'u(q)  =  2imqla3\  the  gate  transforms  the 
position  shift  u  to  a  momentum  shift 


oc  exp  — .  q\  —  i(^2n+\—w)qi 

\  6  y2 n  +  1 

+  0{q\lw3)  J.  (137) 

This  is  a  cubic  phase  state  to  good  precision  if  w  is  large 
enough. 

The  coefficient  y'  of  q\  in  the  phase  of  is  of  order 
n  ~ 1/2 ,  while  the  phase  y  of  the  operator  V y  that  we  wish  to 
execute  is  of  order  one.  However,  we  can  construct  V y  from 
Vy  as 

V^iSyyr^yiSyy),  (138) 

where  Sr  is  a  squeeze  operation  that  acts  according  to 
Sr:q~>(r)mq, 

p—’(r)U3p.  (139) 

Alternatively,  we  could  squeeze  the  phase  state  |  yf)  before 
we  use  it  to  implement  the  cubic  phase  gate. 


v  ~3ttLu/ a3 .  (143) 

For  a  of  order  one,  then,  to  ensure  that  v  is  small  we  should 
use  approximate  code  words  with  the  property  that  the  typi¬ 
cal  embedded  position  shift  u  satisfies 

\u\<L~\  (144) 

In  particular,  if  the  approximate  code  word’s  embedded  er¬ 
rors  are  Gaussian,  where  k  is  the  typical  size  of  a  momentum 
shift  and  A  is  the  typical  size  of  a  position  shift,  we  require 

A<k.  (145) 

We  assume  that  shift  errors  due  to  other  causes  are  no  larger 
than  the  embedded  error. 

In  the  circuit  Fig.  6  that  implements  the  cubic  phase  gate, 
position  shift  errors  in  either  the  encoded  state  |  if/)  or  the 
ancilla  state  |  y)  might  cause  trouble.  A  shift  by  u  in  |  ip)  is 
transformed  to  a  phase  error  el^u^q\  and  a  shift  by  u  in  |  y) 
infects  |  ijj)  with  a  phase  error  el^u(q+a\  Therefore,  we  should 
require  that  position  shift  errors  in  both  |  if/)  and  |  y)  satisfy 
the  criterion  (144),  where  L  is  the  larger  of  the  two  wave 
packet  widths. 
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When  a  cubic  phase  state  is  prepared  by  measuring  half  of 
an  EPR  pair,  the  packet  width  is  of  order  apl  and  typical 
position  shift  errors  have  u~(Tq.  However,  we  must  also 
take  into  account  that  either  the  encoded  state  or  the  ancilla 
must  be  squeezed  as  in  Eq.  (139).  Suppose  that  the  ancilla  is 
squeezed,  by  a  factor  of  order  nl/6~w113;  the  wave  packet  is 
rescaled  so  that,  after  squeezing,  the  width  L '  and  the  typical 
shifts  u '  are  given  by 

L'-aplw~l/3,  u'~o-qw~l/3.  (146) 

Then  the  condition  \u’\<L'~l  is  satisfied  provided  that  crq 
<apw213.  We  also  require  that  the  rescaled  packet  has  a 
large  width  compared  to  1,  or  ap<w~l/3. 

For  the  derivation  of  Eq.  (137),  we  used  the  approxima¬ 
tions  waq>  1  and  wap>  1 .  We  also  need  to  check  that  the 
remainder  terms  in  the  Taylor  expansion  give  rise  to  a  phase 
error  that  is  acceptably  small.  This  error  has  the  form  el^qi\ 
where  f(qx)  =  0(q\lw 3),  corresponding  to  a  momentum 
shift 


v~f\qx)~a-pAw-\  (147) 

Squeezing  amplifies  this  momentum  shift  error  to  v ' 
~vwll3~  o-p4w~m,  which  will  be  small  compared  to  1  pro¬ 
vided  that  crp>w~ 2/3 .  To  summarize,  our  implementation  of 
the  cubic  phase  gate  works  well  if  the  approximate  code 
words  have  embedded  errors  satisfying  A<^ac,  and  if  widths 
crq  and  crp  of  the  approximate  EPR  state  satisfy  w  >  crq  1  and 

w~1/3>(Tp>w~2/3.  (148) 

Finally,  how  accurately  must  we  count  the  photons?  An 
error  An  in  the  photon  number  results  in  a  phase  error  elvqi 
with  \v\~n~l/2An  in  if/\n\q^),  which  will  be  amplified  by 
squeezing  to  \v '  |  ~  \v  \wll3~n~ 1/3 An.  Therefore,  the  preci¬ 
sion  of  the  photon  number  measurement  should  satisfy 

A  n<nl/3  (149) 

to  ensure  that  this  error  is  acceptably  small. 

C.  Purification 

Either  of  the  above  two  methods  could  be  used  to  imple¬ 
ment  a  nonsymplectic  phase  transformation  that  completes 
the  universal  gate  set.  Of  course,  experimental  limitations 
might  make  it  challenging  to  execute  the  gate  with  very  high 
fidelity.  One  wonders  whether  it  is  possible  to  refine  the 
method  to  implement  fault-tolerant  universal  gates  of  im¬ 
proved  fidelity. 

In  fact,  such  refinements  are  possible.  We  have  seen  that 
we  can  reach  beyond  the  symplectic  transformations  and 
achieve  universal  quantum  computation  if  we  have  a  supply 
of  appropriate  “nonsymplectic  states”  that  cannot  be  created 
with  the  symplectic  gates.  If  the  nonsymplectic  states  have 
the  right  properties,  then  we  can  carry  out  a  purification  pro¬ 
tocol  to  distill  from  our  initial  supply  of  noisy  nonsymplectic 
states  a  smaller  number  of  nonsymplectic  states  with  much 
better  fidelity  [31,32]. 


FIG.  7.  Construction  of  the  three-qudit  gate  A 2(Z).  (a)  A  A(P) 
gate  can  be  constructed  (up  to  an  overall  phase)  from  two  S  gates, 
an  S^1  gate,  and  two  CNOT’S.  The  circuit  is  executed  from  left  to 
right,  (b)  A  A 2(Z)  gates  can  be  constructed  from  two  A (P)  gates, 
a  A(P_1)  gate,  and  two  CNOT’S. 


An  example  of  a  nonsymplectic  state  that  admits  such  a 
purification  protocol  is  a  variant  of  the  state  originally  intro¬ 
duced  by  Shor  [23],  the  three-qudit  state 


2“3/2  X  (_  \)abc\a)]\b)2\c)2 ;  (150) 

a,b,c  g  {0,1} 

it  can  be  characterized  as  the  simultaneous  eigenstate  of 
three  commuting  symplectic  operators:  A (Z)l2X3  and  its 
two  cyclic  permutations,  where  A (Z)  is  the  two-qudit  con¬ 
ditional  phase  gate 

A(Z):|  a, &)—>(  —  \  )ab\a,b).  (151) 

As  Shor  explained,  this  nonsymplectic  state  can  be  employed 
to  implement  the  Toffoli  gate 

T:\a,b,c)^\a,b,c®ab),  (152) 

and  so  provides  an  alternative  way  to  complete  the  universal 
gate  set. 

To  purify  our  supply  of  nonsymplectic  states,  symplectic 
gates  are  applied  to  a  pair  of  nonsymplectic  states  and  then 
one  of  the  states  is  measured.  Based  on  the  outcome  of  the 
measurement,  the  other  state  is  either  kept  or  discarded.  If 
the  initial  ensemble  of  states  approximates  the  nonsymplectic 
states  with  adequate  fidelity,  then  as  purification  proceeds, 
the  fidelity  of  the  remaining  ensemble  converges  rapidly  to¬ 
ward  one. 

The  details  of  the  purification  protocol  will  be  described 
elsewhere;  here  we  will  only  remark  that  these  Shor  states 
can  be  readily  created  using  symplectic  gates  and  77/8 -phase 
gates.  The  Shor  state  is  obtained  if  we  apply  the  transforma¬ 
tion 


A2(Z):|a,&,c)— >(—  \)abc\a,b,c)  (153) 

to  the  state 

HxH2H3  |0,0,0)  =  2_3/2  2  |  a,b,c).  (154) 

As  shown  in  Fig.  7,  A 2(Z)  can  be  applied  by  executing  a 
circuit  containing  5  S  gates,  4  S~l  gates,  and  8  CNOT  gates. 

Therefore,  if  we  can  apply  symplectic  gates  accurately, 
and  are  also  able  to  create  a  supply  of  7t/8  states  of  reason¬ 
able  fidelity  (or  can  otherwise  implement  S  gates  of  reason- 
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able  fidelity),  then  we  can  use  the  purification  protocol  to 
implement  Toffoli  gates  with  very  good  fidelity. 

XII.  ENCODING 

Now  we  have  discussed  how  to  execute  universal  quan¬ 
tum  computation  fault  tolerantly,  and  how  to  perform  error 
recovery.  But  the  discussion  has  all  been  premised  on  the 
assumption  that  we  can  prepare  encoded  states.  It  is  finally 
time  to  consider  how  this  can  be  done.  In  fact,  preparing 
simultaneous  eigenstates  of  the  stabilizer  generators 
exp(27 riq/a)  and  exp(— inpa)  is  a  challenging  task. 

For  the  [[N,/:]]  stabilizer  codes  that  have  been  discussed 
previously,  encoding  is  not  intrinsically  difficult  in  that  it  can 
be  accomplished  with  Clifford  group  gates.  Acting  by  con¬ 
jugation,  Clifford  group  transformations  take  tensor  products 
of  Pauli  matrices  to  tensor  products  of  Pauli  operators.  In 
particular,  there  is  a  Clifford  group  transformation  that  takes 
the  state  \0)®N  (the  simultaneous  eigenstate  with  eigenvalue 
one  of  all  N  single-qudit  Z’s)  to  the  encoded  |O)0*  (the  si¬ 
multaneous  eigenstate  with  eigenvalue  one  of  (N—k)  stabi¬ 
lizer  generators  and  k  encoded  Z’s). 

Where  our  codes  are  different,  in  both  their  finite¬ 
dimensional  and  infinite-dimensional  incarnations,  is  that  a 
single  qudit  or  oscillator  is  required  to  obey  two  independent 
stabilizer  conditions — i.e.,  to  be  the  simultaneous  eigenstate 
of  two  independent  Pauli  operators.  Hence  there  is  no  Clif¬ 
ford  group  encoder.  In  the  continuous  variable  case,  the 
problem  can  be  stated  in  more  familiar  language:  the  sym- 
plectic  transformations  take  Gaussian  (coherent  or  squeezed) 
states  to  Gaussian  states.  Hence  no  symplectic  transforma¬ 
tion  can  take  (say)  the  oscillator’s  ground  state  to  a  state  in 
the  code  subspace. 

So  encoding  requires  nonsymplectic  operations,  and  as  far 
as  we  know  it  cannot  be  accomplished  by  counting  photons 
either — we  must  resort  to  a  nonlinear  coupling  between  os¬ 
cillators,  such  as  a  ^(3)  coupling.  We  will  describe  one  pos¬ 
sible  encoding  scheme:  First,  we  prepare  a  squeezed  state,  an 
eigenstate  of  the  momentum  with  p  =  0 .  This  state  is  already 
an  eigenstate  with  eigenvalue  one  of  the  stabilizer  generator 
einpa ,  but  not  an  eigenstate  of  e2mqla\  rather  its  value  of  q  is 
completely  indefinite.  To  obtain  an  encoded  state,  we  must 
project  out  the  component  with  a  definite  value  of  q  modulo 
a. 

This  can  be  achieved  by  coupling  the  oscillator  to  another 
oscillator  that  serves  as  a  meter,  via  the  perturbation  of  the 
Hamiltonian 

H'  =  \q{b^b),  (155) 

where  b  is  the  annihilation  operator  of  the  meter.1  This  per- 
turbation  modifies  the  frequency  of  the  meter, 

^^meter  (156) 


1  There  is  an  extensive  literature  on  the  experimental  realization 
and  applications  of  this  kind  of  coupling;  see  [33]. 


FIG.  8.  Preparation  of  an  encoded  state,  (a)  An  eigenstate  of  p  is 
prepared,  which  has  an  indefinite  value  of  q.  (b)  The  value  of  q 
modulo  na  is  measured,  projecting  out  a  state  that  differs  from  the 
encoded  Z  eigenstate  by  a  shift  in  q. 

then  if  this  coupling  is  turned  on  for  a  time  t  =  27r/Xna,  the 
phase  of  the  meter  advances  by 

^0mete=2Trqlna.  (157) 

By  reading  out  the  phase,  we  can  determine  the  value  of  q 
modulo  na ,  and  apply  a  shift  if  necessary  to  obtain  the  state 
with  q  =  0  (mod  na),  the  known  state  |0)  in  the  code  sub¬ 
space.  (See  Fig.  8.) 

Of  course,  in  practice  the  state  squeezed  in  p  prepared  in 
the  first  step  will  be  only  finitely  squeezed,  and  the  measure¬ 
ment  of  q  modulo  na  will  have  imperfect  resolution.  If  the 
squeezed  state  is  Gaussian  and  the  measurement  has  a 
Gaussian  acceptance,  then  this  procedure  will  produce  an 
approximate  code  word  of  the  sort  described  in  Sec.  V. 

If  we  are  able  to  prepare  “good  enough”  encoded  states, 
we  can  distill  better  ones.  The  distillation  protocol  is  similar 
to  the  error  recovery  procedure,  but  where  the  ancilla  used 
for  syndrome  measurement  may  be  fairly  noisy.  We  might 
improve  the  convergence  of  the  distillation  procedure  by  dis¬ 
carding  the  data  oscillator  if  the  measurement  of  the  ancilla 
oscillator  yields  a  value  of  q  or  p  that  is  too  distant  from  the 
values  allowed  by  the  code  stabilizer. 

So  far,  we  have  described  how  to  prepare  encoded  states 
for  the  “single-oscillator”  codes  described  in  Sec.  IV.  To 
prepare  an  encoded  state  for  one  of  the  V-oscillator  codes 
described  in  Sec.  VI,  we  proceed  in  two  steps.  First  we  pre¬ 
pare  each  of  N  oscillators  in  a  single-oscillator  encoded  state. 
Then  we  apply  a  symplectic  transformation  to  obtain  the 
encoded  state  of  the  V-oscillator  code. 

A  particular  known  encoded  state  of  a  lattice  stabilizer 
code  can  itself  be  regarded  as  a  code  with  an 
(n=  1) -dimensional  code  space.  Hence  it  can  be  character¬ 
ized  by  a  self-dual  symplectic  lattice.  For  example,  the  X 
=  1  state  of  a  qunit  encoded  in  a  single  oscillator  is  the 
simultaneous  eigenstate  with  eigenvalue  of  one  of  the  opera¬ 
tors  e~ipa  and  el7Tiqla — the  state  associated  with  the  self¬ 
dual  lattice  whose  basis  vectors  are  pa!\j 2 tv  and  q^2iil a. 

One  encoded  state  can  be  transformed  to  another  by  sym¬ 
plectic  gates  if  there  is  a  symplectic  linear  transformation 
that  takes  the  self-dual  lattice  associated  with  the  first  state  to 
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the  self-dual  lattice  associated  with  the  second.  In  fact,  such 
a  symplectic  transformation  exists  for  any  pair  of  self-dual 
lattices. 

A  linear  transformation  acting  on  the  p’s  and  q’s  modifies 
the  generator  matrix  M  of  a  lattice  according  to 

M^MS;  (158) 

this  transformation  is  symplectic  if 

ScjSt=(o,  (159) 

where 


We  saw  in  Sec.  VI  that  we  can  always  choose  the  generator 
matrix  M  of  a  self-dual  lattice  so  that  the  matrix  A  has  the 
form 

A=McoMt=m\  (161) 

that  is,  so  that  M  is  a  symplectic  matrix.  Therefore,  the  gen¬ 
erator  matrices  M 1  and  M2  of  two  self-dual  lattices  can  each 
be  chosen  to  be  symplectic;  then  the  linear  transformation 

S  =  m;1M2  (162) 

that  takes  one  lattice  to  the  other  is  also  symplectic.  Thus, 
while  the  task  of  preparing  the  encoded  states  of  the  single¬ 
oscillator  codes  can  be  accomplished  only  by  introducing  a 
nonlinear  coupling  between  oscillators,  proceeding  from 
single-oscillator  encoded  states  to  many-oscillator  encoded 
states  can  be  achieved  with  linear  optical  operations  and 
squeezing. 

XIII.  PHYSICAL  FAULT  TOLERANCE? 

In  a  physical  setting,  making  use  of  the  continuous  vari¬ 
able  quantum  error-correcting  codes  proposed  here  (or 
“digital”  quantum  codes  that  have  been  proposed  previ¬ 
ously)  is  a  daunting  challenge.  We  must  continually  measure 
the  stabilizer  operators  (the  “error  syndrome”)  to  diagnose 
the  errors;  to  recover  we  must  apply  frequent  shifts  of  the 
canonical  variables  that  are  conditioned  on  the  measurement 
outcomes.  Cold  ancilla  oscillators  must  be  provided  that  are 
steadily  consumed  by  the  syndrome  measurements.  The  an- 
cillas  must  be  discarded  (or  refreshed)  to  rid  the  system  of 
excess  entropy  that  has  been  introduced  by  the  accumulated 
errors. 

An  alternative  to  this  complex  scheme  was  suggested  in 
Ref.  [34].  Perhaps  we  can  engineer  a  quantum  system  whose 
(degenerate)  ground  state  is  the  code  subspace.  Then  the 
natural  coupling  of  the  system  to  its  environment  will  allow 
the  system  to  relax  to  the  code  space,  removing  errors  intro¬ 
duced  by  quantum  and  thermal  noise,  or  through  the  imper¬ 
fect  execution  of  quantum  gates.  Such  a  system,  if  it  could  be 
built,  would  be  a  highly  stable  quantum  memory. 


Continuous  variable  coding  suggests  possible  approaches 
to  implementing  this  type  of  physical  fault  tolerance.  For 
example,  the  Hamiltonian 

H  =  2  —  [cosp+  cos(27 rnq)]  (163) 

has  an  n- fold  degenerate  (but  nonnormalizable)  ground  state 
that  is  just  the  code  space  of  a  continuous  variable  code. 
(The  operators  cos  p  and  cos  2 imq  commute  and  can  be  si¬ 
multaneously  diagonalized.)  The  low-lying  states  of  a  real 
system  whose  Hamiltonian  is  a  reasonable  approximation  to 
H  would  resemble  the  approximate  code  words  described  in 
Sec.  V. 

One  possible  way  to  realize  physical  fault  tolerance  is 
suggested  by  the  codes  for  an  electron  in  a  Landau  level, 
described  in  Sec.  III.  The  wave  functions  in  the  code  space 
are  doubly  periodic  with  a  unit  cell  that  encloses  n  flux 
quanta,  where  n  is  the  code’s  dimension.  If  we  turn  on  a 
tunable  periodic  potential  whose  unit  cell  matches  that  of  the 
code,  then  the  Landau  level  is  split  into  n  energy  bands,  and 
the  code  words  are  the  states  with  vanishing  Bloch  momen¬ 
tum.  Therefore,  an  encoded  state  could  be  prepared  by  turn¬ 
ing  on  the  potential,  waiting  for  dissipative  effects  to  cause 
the  electrons  to  relax  to  the  bottom  of  the  lowest  band,  and 
then  adiabatically  turning  off  the  potential.  If  dissipative  ef¬ 
fects  cause  electrons  to  relax  to  the  bottom  of  a  band  on  a 
timescale  that  is  short  compared  to  spontaneous  decay  from 
one  band  to  another,  then  more  general  encoded  states  could 
be  prepared  by  a  similar  method.  Furthermore,  turning  on  the 
potential  from  time  to  time  would  remove  the  accumulated 
Bloch  momentum  introduced  by  errors,  allowing  the  electron 
to  relax  back  to  the  code  space. 

XIV.  CONCLUDING  COMMENTS 

We  have  described  codes  that  protect  quantum  states  en¬ 
coded  in  a  finite-dimensional  subspace  of  the  Hilbert  space 
of  a  system  described  by  continuous  quantum  variables. 
With  these  codes,  continuous  variable  systems  can  be  used 
for  robust  storage  and  fault-tolerant  processing  of  quantum 
information. 

For  example,  the  coded  information  could  reside  in  the 
Hilbert  space  of  a  single-particle  system  described  by  ca¬ 
nonical  quantum  variables  q  and  p.  In  practice,  these  vari¬ 
ables  might  describe  the  states  of  a  mode  of  the  electromag¬ 
netic  field  in  a  high-finesse  microcavity,  or  the  state  of  the 
center  of  mass  motion  of  an  ion  in  a  trap.  Or  the  continuous 
Hilbert  space  could  be  the  state  space  of  a  rotor  described  by 
an  angular  variable  6  and  its  conjugate  angular  momentum 
L;  in  practice,  these  variables  might  be  the  phase  and  charge 
of  a  superconducting  quantum  dot.  Our  coding  scheme  can 
also  be  applied  to  a  charged  particle  in  a  magnetic  field. 

Our  codes  are  designed  to  protect  against  small  errors  that 
occur  continually — diffusive  drifts  in  the  values  of  the  ca¬ 
nonical  variables.  The  codes  are  less  effective  in  protecting 
against  large  errors  that  occur  rarely.  In  some  settings,  we 
may  desire  protection  against  both  kinds  of  errors.  One  way 
to  achieve  that  would  be  to  concatenate  our  continuous- 
variable  codes  with  conventional  finite-dimensional  quantum 
codes. 
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When  we  consider  how  to  manipulate  continuous-variable 
quantum  information  fault  tolerantly,  the  issues  that  arise  are 
rather  different  than  in  previous  discussions  of  quantum  fault 
tolerance.  With  continuous  variable  codes,  propagation  of 
error  from  one  oscillator  to  another  is  not  necessarily  a  seri¬ 
ous  problem.  More  damaging  are  processes  that  amplify  a 
small  shift  of  the  canonical  variables  to  a  large  shift.  We 
have  described  how  to  implement  a  universal  set  of  fault- 
tolerant  quantum  gates;  with  these,  harmful  error  amplifica¬ 
tion  can  be  avoided  as  the  encoded  state  is  processed. 

Apart  from  encouraging  the  intriguing  possibility  that 
continuous  quantum  variables  might  prove  useful  for  the 
construction  of  robust  quantum  memories  and  computers, 
these  new  quantum  codes  also  have  important  theoretical 
applications.  In  this  paper  we  have  discussed  an  application 
to  the  theory  of  the  quantum  capacity  of  the  Gaussian  quan¬ 
tum  channel.  Furthermore,  quantum  codes  can  be  invoked  to 
investigate  the  efficacy  of  quantum  cryptographic  protocols, 


even  in  cases  where  the  protocol  makes  no  direct  use  of  the 
encoded  states  [35].  With  continuous-variable  codes,  we  can 
demonstrate  the  security  of  key  distribution  protocols  based 
on  the  transmission  of  continuous  variable  quantum  informa¬ 
tion.  This  application  is  discussed  in  a  separate  paper  [36]. 
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We  have  fabricated  and  characterized  donor-mode  nanocavities  formed  by  a  single  defect  cavity 
defined  within  a  two-dimensional  photonic  crystal  slab.  Quantum  dots  emitting  in  the  1.1 -1.3 
micron  range  were  used  as  luminescence  sources,  and  a  design  using  fractional  edge  dislocations 
was  used  to  demonstrate  well-confined  dipole  modes  with  high  quality  factors.  By  applying  the 
fractional  dislocation  geometry,  the  measured  quality  factor  could  be  increased  to  values  as  high  as 
2800.  This  compares  with  typical  quality  factors  of  around  1500  measured  from  more  conventional 
shallow  donor  mode  cavities  with  larger  mode  volumes.  ©  2001  American  Institute  of  Physics. 
[DOI:  10.1063/1.1427748] 


The  combination  of  high  quality  factors  ( Q )  and  small 
mode  volumes  are  two  important  attributes  for  high  perfor¬ 
mance  nanocavities.  High  Q  microcavities  will  become  very 
important  for  control  of  optical  emission  processes1  for 
strong  coupling  experiments  and  many  other  industrial  appli¬ 
cations  of  novel  light  emitters.  State  of  the  art  lithography 
and  etching  enables  us  to  construct  microcavities  with  many 
different  kinds  of  geometries  such  as  micropillars,2,3 
microdisks,4  AlO^-apertured  vertical  cavities,5  and  photonic 
band-gap  cavities.6  Photonic  crystal  nanocavities7-9  have  re¬ 
cently  attracted  much  attention  because  such  cavities  are  pre¬ 
dicted  to  provide  the  desired  combination  of  both  small 
mode  volume  and  high  Q  values.  Our  photonic  crystal  cavi¬ 
ties  are  constructed  in  thin  slabs,  which  are  perforated  with 
triangular  hole  arrays  and  used  to  define  photonic  crystal 
mirrors.  In  the  simplest  defect  cavity  designs,  regardless  of 
whether  we  increase  the  number  of  defects  forming  the  cav¬ 
ity,  it  has  so  far  not  been  possible  to  demonstrate  both  small 
mode  volume  and  high  Q .  The  maximum  Q  for  donor  cavi¬ 
ties  is  around  1500  from  extended  modes  in  empty  lattice 
photonic  crystal  cavities.10  Previous  work  to  obtain  higher  Q 
values  has  focused  on  thin  slabs  patterned  within  hexagonal 
hole  patterns  missing  several  holes  within  a  photonic  crystal 
lattice.  However,  those  cavities  are  not  suitable  for  many 
applications  since  they  have  relatively  large  mode  volumes 
and  many  modes  exist  within  the  semiconductor  emission 
range.  Typical  well-confined  donor  mode  cavities  have  small 
mode  volumes,  but  also  suffer  from  relatively  low  Q  values, 
which  are  limited  by  vertical  scattering  losses.  To  address  the 
problem,  Vuckovic  et  al.n  recently  predicted  by  3D-FDTD 
(finite  difference  time  domain)  calculations  that  single  defect 
cavities  with  fractional  edge  dislocations  can  have  well- 
localized  modes  with  surprisingly  high  Q  values  of  as  high 
as  30  000.  In  this  letter,  we  report  the  experimental  demon¬ 
stration  that  by  using  fractional  edge  dislocations  in  photonic 
crystal  cavities,  it  is  possible  to  measure  microcavities  with 
high  g’s  as  well  as  small  mode  volumes. 
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In  Fig.  1,  we  describe  the  structure  used  in  this  work. 
The  schematic  of  our  single  defect  photonic  crystal  nanocav¬ 
ity  is  constructed  within  a  thin  slab  of  GaAs,  which  is  per¬ 
forated  with  a  triangular  lattice  of  holes.  A  single  smaller 
hole  replaces  a  larger  hole  within  the  center  of  this  slab  to 
define  the  optical  cavity.  This  cavity  supports  doubly  degen¬ 
erate  shallow  donor  states.  In  our  design,  we  have  modified 
this  geometry  in  two  important  ways:  (1)  In  the  x  direction, 
in  line  with  the  smaller  hole,  ellipsoids  rather  than  circles  are 
fabricated  at  lattice  spacing  ( a )  with  an  elongated  major  axis 
of2r  +  1s,  =  2r(l+  p/2a ' )  where  r  is  a  radius  of  original  hole, 
a '  =V3  a  12  is  an  original  distance  of  vicinal  lattice  array  in  y 
direction,  and  pi  a  is  an  elongation  parameter.  (2)  Holes  are 
moved  away  from  the  v  axis  by  p/2  toward  +y  or  —  y  di¬ 
rection.  In  +  y  or  —  y  direction,  the  minor  axis  of  ellipsoids 
is  still  2  r.  These  two  operations  correspond  to  adding  a  frac¬ 
tional  edge  dislocation  in  the  y  direction.  It  also  should  be 
noted  that  Vuckovic  et  al.n  used  the  major  axis  of  ellipsoids 
of  2 r+p  which  is  bigger  than  the  value  2r(l  -\-p/2a ')  used 
in  this  letter. 

We  used  self-assembled  grown  quantum  dots  as  the 
light-emitting  material  grown  on  (001)  GaAs  by  molecular 


FIG.  1.  Schematic  of  two-dimensional  photonic  crystal  slab  cavity  with 
fractional  edge  dislocation. 
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FIG.  2.  Scanning  electron  microscopy  image  of  photo¬ 
nic  crystal  slab  with  fractional  edge  dislocation. 


beam  epitaxy.  Three  stacked  InAs  quantum  dot  (QD)  layers 
(QDs  density:  3  X  1010  dots/cm2)  were  clad  by  Al0 16Ga084As 
layers  on  top  of  a  400  nm  Al0  94Ga0  06As  layer.  A  GaAs  cap 
layer  is  then  added  to  protect  the  top  on  the  final  layer.  The 
cavity  thickness  ( d)  is  240  nm.  The  ground  state  emission  of 
QDs  used  in  this  report  showed  linewidths  as  narrow  as  40 
meV.  The  ground  state  emission  at  room  temperature  is 
peaked  around  1240  nm. 

The  patterns  to  form  hexagonal-arrayed  photonic  crystal 
defect  cavity12  were  lithographically  defined.  Photonic  crys¬ 
tal  cavities  are  surrounded  by  twenty  layers  of  photonic  crys¬ 
tal  for  good  optical  confinement  in  plane.  The  lattice  spacing 
( a )  used  in  this  work  is  lithographically  controlled  to  370 
nm  (d/a  =  0.65).  After  lithography,  the  beam- written  pat¬ 
terns  were  transferred  through  the  active  membrane  by  using 
an  Ar+  ion  beam  assisted  with  a  Cl2  jet,  and  the 
Al0  94Ga006As  layer  under  cavities  was  subsequently  oxi¬ 
dized  in  steam  to  define  a  perforated  dielectric  slab  structure 
on  top  of  an  AlO^  cladding  layer.  Then,  the  A10x  layer  was 
completely  dissolved  in  potassium  hydroxide  solution.  Fig¬ 
ure  2  shows  images  taken  by  scanning  electron  microscopy 
(SEM)  for  photonic  crystal  cavities  containing  a  single  donor 
defect  with  fractional  edge  dislocations.  We  fabricated 
samples  with  four  different  p/a  values  (p  =  0,0.10,  0.15, 
and  0.20 a).  The  value  of  the  hole  radii  defining  the  photonic 
crystals  r  ranged  from  0.28  to  0.29a,  whereas  the  radius  of 
the  smaller  hole  defining  the  cavity  ranged  and  from  0.20  to 
0.23a.  In  the  four  samples,  the  geometry  parameters  have 
slight  fluctuations,  but  the  imperfections  could  be  measured 
by  careful  SEM  characterization  and  compared  to  model  re¬ 
sults. 

Local  microphotoluminescence  pumping  of  the  center  of 
cavity,  and  measuring  the  luminescence  normal  to  the  sur¬ 


face,  allows  us  to  quickly  compare  the  performance  of  dif¬ 
ferent  cavity  designs.  The  830  nm  pump  laser  light  was  ab¬ 
sorbed  by  the  InAs  quantum  dots  and  the  wetting  layers,  and 
the  pump  light  could  be  focused  onto  the  sample  with  a  spot 
diameter  of  2  pm.  Light  emission  from  the  photonic  crystal 
slab  surface  was  then  detected  by  an  optical  spectrum  ana¬ 
lyzer.  The  peak  pumping  power  was  1.4  mW,  which  corre¬ 
sponds  to  a  power  density  of  45  kW/cm2. 

Figure  3  shows  the  p/a  dependence  of  photolumines¬ 
cence  spectrum  measured  from  four  cavities  with  almost  the 
same  r/a  and  r'/a.  In  this  case,  we  selected  a  2  nm  band¬ 
width  for  the  spectrum  measurement  to  detect  weak  emission 
from  the  samples.  The  shallow  donor  (SD1  and  SD2)  peaks 
have  distinct  y  and  v  polarizations,  respectively.  By  consid¬ 
ering  polarization  dependence  and  simulation  results,  SD1 
and  SD2  were  found  to  match  numerically  predicted  shallow 
donor  modes.  We  believe  that  the  degeneracy  of  shallow  do¬ 
nor  modes  for  p/a  =  0  are  lifted  because  of  fabrication  fluc¬ 
tuations  in  our  quantum  dot  nanocavities.  In  Fig.  3,  it  can  be 
observed  that,  for  p/a  =  0,  only  two  donor  modes  are  seen, 
whereas  a  group  of  peaks  labeled  as  (X)  emerge  just  above 
SD2  resonance  in  cavities  with  higher  p/a  asymmetries.  All 
the  peaks  shift  toward  longer  wavelengths  as  p/a  increases. 
However,  the  shift  of  SD1  resonance  seems  to  be  more  pro¬ 
nounced  than  that  of  SD2  resonance.  This  can  be  understood 
by  assuming  that  the  added  edge  dislocation  overlaps  better 
with  the  SD1  mode  than  with  the  SD2  mode.  Figure  4  shows 
the  influence  of  p/a  asymmetry  on  the  Q  as  well  as  the 
normalized  resonance  energy  (a/X)  taken  from  SD1  reso¬ 
nances.  The  SD1  resonance  energy  monotonically  decreases 
almost  linearly  as  p/a  increases  for  the  same  reason  as  de¬ 
scribed  above.  Though  the  shift  is  small,  the  SD2  frequency 
dependence  also  has  the  same  trend.  On  the  other  hand,  Q 
continues  to  increase  as  p/a  increases  for  this  plot.  In  our 
samples,  a  maximum  Q  was  obtained  at  p/a  =  0.20.  Typical 
spectra  taken  from  the  nanocavities  are  shown  in  Fig.  5, 


wavelength  (  nm  ) 

FIG.  3.  PL  spectrum  of  samples  with  different  elongation  parameter  (p/a) 


ranging  from  0  to  0.20.  FIG.  4.  Elongation  parameter  (p/a)  dependence  of  Q  and  frequency  (a/\). 
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FIG.  5.  PL  spectra  of  the  sample  with  Q  of  2800. 

where  the  resolution  width  of  the  spectrometer  was  set  to  0.2 
nm.  The  measured  Q  was  as  high  as  2800.  Therefore,  by 
adding  fractional  edge  dislocation,  we  could  increase  the 
measured  Q  to  twice  the  value  measured  from  a  symmetric 
cavity.  To  compare  the  measured  Q  with  our  simulation  re¬ 
sults,  we  have  carefully  measured  the  geometries  of  our  fab¬ 
ricated  structures,  and  modeled  these  with  our  three- 
dimensional  FDTD  code.  Indeed,  we  find  that  the  calculated 
Q  values  for  our  experimentally  realized  geometries  (of 
4400)  are  much  closer  to  the  measured  values,  and  that  the 
simulated  mode  volume  is  0.43(\M)3. 

To  confirm  that  the  measured  mode  is  localized  to  the 
defect  region,  we  also  changed  the  spatial  location  of  pump¬ 
ing  position  to  several  points  ten  lattice  layers  away  from  the 
central  hole.12  We  could  not  see  the  SD1  resonance  in  such 
measurements,  which  leads  us  to  believe  that  the  shallow 
donor  modes  are  well  confined.  This  is  a  strong  indicator  that 
the  measured  cavity  mode  has  both  a  high  Q  as  well  as  a 
small  mode  volume. 

In  order  to  verify  whether  the  resonances  appear  in  an 
appropriate  range  of  emission,  we  have  compared  the  ob¬ 
served  spectra  with  spectra  obtained  by  3D-FDTD  simula¬ 
tions,  as  shown  in  Fig.  6.  For  the  simulation,  spectral  inten¬ 
sities  in  several  points  in  the  middle  of  the  slab  were  added. 
Therefore,  the  spectra  reflects  on  confined  modes  in  the  slab 
while  the  detected  spectra  reflects  on  leaky  part  of  modes. 
That  is  why  we  also  observe  inhomogeneous  emission  from 
the  QD  layer  in  the  spectrum.  The  SD1  and  SD2  resonance 
frequencies  are  very  similar  to  values  predicted  by  the  simu¬ 
lation.  As  the  QD  emission  covers  only  the  SD1,  SD2,  and 
air  band  region  we  did  not  see  any  prominent  peaks  around 
a/\  =  0.33,  which  is  shown  in  the  simulated  Fourier  spectra. 
However,  we  believe  those  modes  do  not  leak  into  air  sig¬ 
nificantly  since  they  are  guided  modes  and  the  original  in¬ 
tensity  of  the  QD  emission  in  this  frequency  range  is  too 
weak  to  be  observed.  Therefore,  we  conclude  that  the  mea¬ 
sured  spectra  showed  a  good  agreement  with  the  simulated 
one.  Though  our  predictions  indicate  that  Q  values  of  30  000 
are  possible  in  geometries  similar  with  the  ones  presented 
here,  our  measurement  results  are  enough  to  show  the  advan¬ 
tage  of  using  the  fractional  edge  dislocation  over  the  conven¬ 
tional  designs.  We  believe  it  will  become  feasible  to  obtain 
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FIG.  6.  Simulated  Fourier  spectrum  of  optical  modes  around  a  band  gap. 

the  high  Q  predicted  once  the  optimum  photonic  crystal  pa¬ 
rameters  are  precisely  fabricated,  and  we  are  presently  work¬ 
ing  on  more  precise  fabrication  accuracy  and  similar  designs 
to  obtain  even  higher  measured  quality  factors. 

In  summary,  we  have  characterized  donor  modes  sup¬ 
ported  in  single  defect  nanocavities  defined  within,  two- 
dimensional  photonic  crystal  slabs  with  fractional  edge  dis¬ 
locations.  The  mode  modulated  by  the  dislocation  was  found 
to  be  well-confined  around  the  single  defect  as  predicted. 
High  quality  factors  of  up  to  2800  were  measured  by  lumi¬ 
nescence  from  quantum  dot  emitters  at  1.2  jmm.  These  values 
were  significantly  higher  than  the  values  of  1500  measured 
for  the  simpler  symmetric  shallow  donor  mode  cavities  with¬ 
out  the  fractional  edge  dislocation.  Our  measured  Q  values 
were  limited  by  fabrication  precision  and  fine  design  optimi¬ 
zation,  and  these  results  indicate  that  further  improvement  of 
Q  is  very  likely  in  these  kinds  of  geometries. 
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Conversion  of  conventional  gravitational-wave  interferometers  into  quantum  nondemolition 
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The  LIGO-II  gravitational- wave  interferometers  (ca.  2006-2008)  are  designed  to  have  sensitivities  near  the 
standard  quantum  limit  (SQL)  in  the  vicinity  of  100  Hz.  This  paper  describes  and  analyzes  possible  designs  for 
subsequent  LIGO-III  interferometers  that  can  beat  the  SQL.  These  designs  are  identical  to  a  conventional  broad 
band  interferometer  (without  signal  recycling),  except  for  new  input  and/or  output  optics.  Three  designs  are 
analyzed:  (i)  a  squeezed-input  interferometer  (conceived  by  Unruh  based  on  earlier  work  of  Caves)  in  which 
squeezed  vacuum  with  frequency-dependent  (FD)  squeeze  angle  is  injected  into  the  interferometer’s  dark  port; 

(ii)  a  variational- output  interferometer  (conceived  in  a  different  form  by  Vyatchanin,  Matsko  and  Zubova),  in 
which  homodyne  detection  with  FD  homodyne  phase  is  performed  on  the  output  light;  and  (iii)  a  squeezed- 
variational  interferometer  with  squeezed  input  and  FD-homodyne  output.  It  is  shown  that  the  FD  squeezed- 
input  light  can  be  produced  by  sending  ordinary  squeezed  light  through  two  successive  Fabry-Perot  filter 
cavities  before  injection  into  the  interferometer,  and  FD-homodyne  detection  can  be  achieved  by  sending  the 
output  light  through  two  filter  cavities  before  ordinary  homodyne  detection.  With  anticipated  technology 
(power  squeeze  factor  e~2R  =  0.1  for  input  squeezed  vacuum  and  net  fractional  loss  of  signal  power  in  arm 
cavities  and  output  optical  train  e*  =  0.01)  and  using  an  input  laser  power  IQ  in  units  of  that  required  to  reach 
the  SQL  (the  planned  LIGO-II  power,  7S QL),  the  three  types  of  interferometer  could  beat  the  amplitude  SQL  at 
100  Hz  by  the  following  amounts  pi=  V Shl  sJs^L  and  with  the  following  corresponding  increase  V=  1  Ip?  in 
the  volume  of  the  universe  that  can  be  searched  for  a  given  noncosmological  source:  Squeezed  input — p, 

—  y\e~2R— 0.3  and  V— 1/0.3 3  —  30  using  IQ  //sql=  1  •  Variational- output — pi—  e*4—  0.3  and  V— 30  but  only  if 
the  optics  can  handle  a  ten  times  larger  power:  IQ  //SqL—  \ly[e^=  10.  Squeezed  varational — pi 
=  1.3(e~2Re^)m—  0.24  and  V— 80  using  Io/IS0L=l;  and  pi  —  (e~2Re^)m—  0.18  and  V— 180  using  7o/7S0L 
=  yle~2K/e^  3.2. 
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I.  INTRODUCTION  AND  SUMMARY 

In  an  interferometric  gravitational-wave  detector,  laser 
light  is  used  to  monitor  the  motions  of  mirror-endowed  test 
masses,  which  are  driven  by  gravitational  waves  h(t).  The 
light  produces  two  types  of  noise:  photon  shot  noise ,  which  it 
superposes  on  the  interferometer’s  output  signal,  and  fluctu¬ 
ating  radiation-pressure  noise ,  by  which  it  pushes  the  test 
masses  in  random  a  manner  that  can  mask  their  gravity- 
wave-induced  motion.  The  shot-noise  spectral  density  scales 
with  the  light  power  1 0  entering  the  interferometer  as  Sfot 
rc  l/I0  ;  the  radiation-pressure  noise  scales  as  S^^IQ  . 

In  the  first  generation  of  kilometer-scale  interferometers 
[e.g.,  the  Laser  Interferometric  Gravitational  Wave  Observa¬ 
tory’s  LIGO-I  interferometers,  2002-2003  [1]],  the  laser 
power  will  be  low  enough  that  shot-noise  dominates  and 
radiation-pressure  noise  is  unimportant.  Tentative  plans  for 
the  next  generation  interferometers  (LIGO-II,  ca.  2006- 
2008)  include  increasing  IQ  to  the  point  that,  S^=Ss^ot  at  the 
interferometers’  optimal  gravitational- wave  frequency, 
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12/277—100  Hz.  The  resulting  net  noise  S h  =  3- S^ot 

=  2SSfr0t  is  the  lowest  that  can  be  achieved  with  conventional 
interferometer  designs.  Further  increases  of  light  power  will 
drive  the  radiation-pressure  on  upward,  increasing  the  net 
noise,  while  reductions  of  light  power  will  drive  the  shot 
noise  upward,  also  increasing  the  net  noise. 

This  minimum  achievable  noise  is  called  the  “standard 
quantum  limit”  (SQL)  [2]  and  is  denoted  Sl^L=hl QL.  It  can 
be  regarded  as  arising  from  the  effort  of  the  quantum  prop¬ 
erties  of  the  light  to  enforce  the  Heisenberg  uncertainty  prin¬ 
ciple  on  the  interferometer  test  masses,  in  just  the  manner  of 
the  Heisenberg  microscope.  Indeed,  a  common  derivation  of 
the  SQL  is  based  on  the  uncertainty  principle  for  the  test 
masses’  position  and  momentum  [3]:  The  light  makes  a  se¬ 
quence  of  measurements  of  the  difference  v  of  test-mass  po¬ 
sitions.  If  a  measurement  is  too  accurate,  then  by  state  reduc¬ 
tion  it  will  narrow  the  test-mass  wave  function  so  tightly  (Ajc 
very  small)  that  the  momentum  becomes  highly  uncertain 
(large  A p),  producing  a  wave  function  spreading  that  is  so 
rapid  as  to  create  great  position  uncertainty  at  the  time  of  the 
next  measurement.  There  is  an  optimal  accuracy  for  the  first 
measurement — an  accuracy  that  produces  only  a  factor  \J2 
spreading  and  results  in  optimal  predictability  for  the  next 
measurement.  This  optimal  accuracy  corresponds  to  /zSQL. 
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Despite  this  apparent  intimate  connection  of  the  SQL  to 
test-mass  quantization,  it  turns  out  that  the  test-mass  quanti¬ 
zation  has  no  influence  whatsoever  on  the  output  noise  in 
gravitational- wave  interferometers  [4].  The  sole  forms  of 
quantum  noise  in  the  output  are  photon  shot  noise  and  pho¬ 
ton  radiation-pressure  noise.1 

Braginsky  (the  person  who  first  recognized  the  existence 
of  the  SQL  for  gravitational- wave  detectors  and  other  high- 
precision  measuring  devices  [5])  realized,  in  the  mid  1970s, 
that  the  SQL  can  be  overcome,  but  to  do  so  would  require 
significant  modifications  of  the  experimental  design.  Bragin¬ 
sky  gave  the  name  quantum  nondemolition  (QND)  to  de¬ 
vices  that  can  beat  the  SQL;  this  name  indicates  the  ability  of 
QND  devices  to  prevent  their  own  quantum  properties  from 
demolishing  the  information  one  is  trying  to  extract  [6]. 

The  LIGO-I  interferometers  are  now  being  assembled  at 
the  LIGO  sites,  in  preparation  for  the  first  LIGO 
gravitational- wave  searches.  In  parallel,  the  LIGO  scientific 
community  (LSC)  is  deeply  immersed  in  research  and  devel¬ 
opment  for  the  LIGO-II  interferometers  [7],  and  a  small  por¬ 
tion  of  the  LSC  is  attempting  to  invent  practical  designs  for 
the  third  generation  of  interferometers,  LIGO-III.  This  paper 
is  a  contribution  to  the  LIGO-III  design  effort. 

In  going  from  LIGO-II  to  LIGO-III,  a  large  number  of 
noise  sources  must  be  reduced.  Perhaps  the  most  serious  are 
the  photon  shot  noise  and  radiation  pressure  noise  (“optical 
noise”),  and  thermal  noise  in  the  test  masses  and  their  sus¬ 
pensions  [7,8].  In  this  paper  we  shall  deal  solely  with  the 
shot  noise  and  radiation  pressure  noise  (and  the  associated 
SQL);  we  shall  tacitly  assume  that  all  other  noise  sources, 
including  thermal  noise,  can  be  reduced  sufficiently  to  take 
full  advantage  of  the  optical  techniques  that  we  propose  and 
analyze. 

Because  LIGO-II  is  designed  to  operate  at  the  SQL,  in 
moving  to  LIGO-III  there  are  just  two  ways  to  reduce  the 
optical  noise:  increase  the  masses  m  of  the  mirrored  test 
masses  (it  turns  out  that  Z^sql^  Vm),  or  redesign  the  interfer¬ 
ometers  so  they  can  perform  QND.  The  transition  from 


!In  brief,  the  reasons  for  this  are  the  following:  The  interferom¬ 
eter’s  measured  output,  in  general,  is  one  quadrature  of  the  electric 
field  [the  of  Eqs.  (54)  and  (10)  below],  and  this  output  observ¬ 
able  commutes  with  itself  at  different  times  by  virtue  of  Eqs.  (7) 
with  a^b.  This  means  that  the  digitized  data  points  (collected  at  a 
rate  of  20  kHz)  are  mutually  commuting  Hermitian  observables. 
One  consequence  of  this  is  that  reduction  of  the  state  of  the  inter¬ 
ferometer  due  to  data  collected  at  one  moment  of  time  will  not 
influence  the  data  collected  at  any  later  moment  of  time.  Another 
consequence  is  that,  when  one  Fourier  analyzes  the  interferometer 
output,  one  puts  all  information  about  the  initial  states  of  the  test 
masses  into  data  points  near  zero  frequency,  and  when  one  then 
filters  the  output  to  remove  low-frequency  noise  (noise  at  / 
=  TH2tt^  10  Hz),  one  thereby  removes  from  the  data  all  informa¬ 
tion  about  the  test-mass  initial  states;  the  only  remaining  test-mass 
information  is  that  associated  with  Heisenberg-picture  changes  of 
the  test-mass  positions  at  10  Hz,  changes  induced  by  external 
forces:  light  pressure  (which  is  quantized)  and  thermal-  and 
seismic-noise  forces  (for  which  quantum  effects  are  unimportant). 
See  Ref.  [4]  for  further  detail. 
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LIGO-I  to  LIGO-II  will  already  (probably)  entail  a  mass  in¬ 
crease,  from  m  =  1 1  kg  to  m  =  30  kg,  in  large  measure  be¬ 
cause  the  SQL  at  11  kg  was  unhappily  constraining  [7].  Any 
large  further  mass  increase  would  entail  great  danger  of  un¬ 
acceptably  large  noise  due  to  energy  coupling  through  the 
test-mass  suspensions  and  into  or  from  the  overhead  supports 
(the  seismic  isolation  system);  a  larger  mass  would  also  en¬ 
tail  practical  problems  due  to  the  increased  test-mass  dimen¬ 
sions.  Accordingly,  there  is  strong  motivation  for  trying  to 
pursue  the  QND  route. 

Our  Caltech  and  Moscow  University  research  groups  are 
jointly  exploring  three  approaches  to  QND  interferometer 
design: 

First:  The  conversion  of  conventional  interferometers  into 
QND  interferometers  by  modifying  their  input  and/or  output 
optics  (this  paper).  This  approach  achieves  QND  by  creating 
and  manipulating  correlations  between  photon  shot  noise  and 
radiation  pressure  noise;  see  below.  It  is  the  simplest  of  our 
three  approaches,  but  has  one  serious  drawback:  an  uncom¬ 
fortably  high  light  power,  Wcirc^  1  MW,  that  must  circulate 
inside  the  interferometers’  arm  cavities  [9].  It  is  not  clear 
whether  the  test-mass  mirrors  can  be  improved  sufficiently  to 
handle  this  high  a  power  in  a  sufficiently  noise-free  way. 

Second :  A  modification  of  the  interferometer  design  (in¬ 
cluding  using  two  optical  cavities  in  each  arm)  so  as  to  make 
its  output  signal  be  proportional  to  the  relative  speeds  of  the 
test  masses  rather  than  their  relative  positions  [10,11].  Since 
the  test-mass  speed  is  proportional  to  momentum,  and  mo¬ 
mentum  (unlike  position)  is  very  nearly  conserved  under 
free  test-mass  evolution  on  gravity-wave  time  scales 
(  —  0.01  sec),  the  relative  speed  is  very  nearly  a  “QND  ob¬ 
servable”  [12]  and  thus  is  beautifully  suited  to  QND  mea¬ 
surements.  Unfortunately,  the  resulting  speed-meter  interfer¬ 
ometer ,  like  our  input- output-modified  interferometers, 
suffers  from  a  high  circulating  light  power  [9],  Wcirc 
>1  MW. 

Third :  Radical  redesigns  of  the  interferometer  aimed  at 
achieving  QND  performance  with  Wcirc  well  below  1  MW 
[13].  These,  as  currently  conceived  by  Braginsky,  Goro¬ 
detsky  and  Khalili,  entail  transfering  the  gravitational- wave 
signal  to  a  single,  small  test  mass  via  light  pressure,  and 
using  a  local  QND  sensor  to  read  out  the  test  mass’s  motions 
relative  to  a  local  inertial  frame. 

In  this  paper  we  explore  the  first  approach.  The  founda¬ 
tion  for  this  approach  is  the  realization  that:  (i)  photon  shot 
noise  and  radiation-pressure  noise  together  enforce  the  SQL 
only  if  they  are  uncorrelated ;  see,  e.g.,  Ref.  [4];  (ii)  when¬ 
ever  carrier  light  with  side  bands  reflects  off  a  mirror  (in  our 
case,  the  mirrors  of  an  interferometer’s  arm  cavities),  the 
reflection  ponderomotively  squeezes  the  light’s  side  bands, 
thereby  creating  correlations  between  their  radiation-pressure 
noise  in  one  quadrature  and  shot  noise  in  the  other;  (iii)  these 
correlations  are  not  accessed  by  a  conventional  interferom¬ 
eter  because  of  the  particular  quadrature  that  its  photodiode 
measures;  (iv)  however,  these  correlations  can  be  accessed 
by  (conceptually)  simple  modifications  of  the  interferom¬ 
eter’s  input  and/or  output  optics,  and  by  doing  so  one  can 
beat  the  SQL.  These  correlations  were  first  noticed  explicitly 
by  Unruh  [14],  but  were  present  implicitly  in  Braginsky’s 
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FIG.  1.  Schematic  diagram  of  a  squeezed-input  interferometer. 

earlier  identification  of  the  phenomenon  of  ponderomotive 
squeezing  [15,16]. 

In  this  paper  we  study  three  variants  of  QND  interferom¬ 
eters  that  rely  on  ponderomotive-squeeze  correlations: 

(i)  Squeezed-input  interferometer.  Unruh  [14]  (building 
on  earlier  work  of  Caves  [17])  invented  this  design  nearly  20 
years  ago,  and  since  then  it  has  been  reanalyzed  by  several 
other  researchers  [18,19].  In  this  design,  squeezed  vacuum  is 
sent  into  the  dark  port  of  the  interferometer  (“modified  in¬ 
put”)  and  the  output  light  is  monitored  with  a  photodetector 
as  in  conventional  interferometers. 

For  a  broad-band  squeezed-input  interferometer,  the 
squeeze  angle  must  be  a  specified  function  of  frequency  that 
changes  significantly  across  the  interferometer’s  operating 
gravity-wave  band.  (This  contrasts  with  past  experiments 
employing  squeezed  light  to  enhance  interferometry  [20,21], 
where  the  squeeze  angle  was  constant  across  the  operating 
band.)  Previous  papers  on  squeezed-input  interferometers 
have  ignored  the  issue  of  how,  in  practice,  one  might  achieve 
the  required  frequency-dependent  (FD)  squeeze  angle.  In 
Sec.  V  C,  we  show  that  it  can  be  produced  via  ordinary, 
frequency-independent  squeezing  (e.g.,  by  nonlinear  optics 
[22]),  followed  by  filtration  through  two  Fabry-Perot  cavities 
with  suitably  adjusted  bandwidths  and  resonant-frequency 
offsets  from  the  light’s  carrier  frequency.  A  schematic  dia¬ 
gram  of  the  resulting  squeezed-input  interferometer  is  shown 
in  Fig.  1  and  is  discussed  in  detail  below.  Our  predicted 
performance  for  such  an  interferometer  agrees  with  that  of 
previous  research. 

(ii)  Variational- output  interferometer.  Vyatchanin,  Matsko 
and  Zubova  invented  this  design  conceptually  in  the  early 
1990s  [23-25].  It  entails  a  conventional  interferometer  input 
(ordinary  vacuum  into  the  dark  port),  but  a  modified  output: 
instead  of  photodetection,  one  performs  homodyne  detection 
with  a  homodyne  phase  that  depends  on  frequency  in  essen¬ 
tially  the  same  way  as  the  squeeze  angle  of  a  squeezed-input 
interferometer.  Vyatchanin,  Matsko  and  Zubova  did  not 


PHYSICAL  REVIEW  D  65  022002 


FIG.  2.  Schematic  diagram  of  a  squeezed-variational  interfer¬ 
ometer.  A  variational-output  interferometer  differs  from  this  solely 
by  replacing  the  input  squeezed  vacuum  by  ordinary  vacuum. 

know  how  to  achieve  FD  homodyne  detection  in  practice,  so 
they  proposed  approximating  it  by  homodyne  detection  with 
a  time-dependent  (TD)  homodyne  phase.  Such  TD  homo¬ 
dyne  detection  can  beat  the  SQL,  but  (by  contrast  with  FD 
homodyne)  it  is  not  well-suited  to  gravitational-wave 
searches,  where  little  is  known  in  advance  about  the  gravita¬ 
tional  waveforms  or  their  arrival  times.  In  this  paper  (Sec.  V 
and  Appendix  C),  we  show  that  the  desired  FD  homodyne 
detection  can  be  achieved  by  sending  the  interferometer’s 
output  light  through  two  successive  Fabry-Perot  cavities  that 
are  essentially  identical  to  those  needed  in  our  variant  of  a 
squeezed-input  interferometer,  and  by  then  performing  con¬ 
ventional  homodyne  detection  with  fixed  homodyne  angle.  A 
schematic  diagram  of  the  resulting  variational- output  inter¬ 
ferometer  is  shown  in  Fig.  2. 

(iii)  Squeezed-variational  interferometer.  This  design  (not 
considered  in  the  previous  literature2)  is  the  obvious  combi¬ 
nation  of  the  first  two;  one  puts  squeezed  vacuum  into  the 
dark  port  and  performs  FD  homodyne  detection  on  the  out¬ 
put  light.  The  optimal  performance  is  achieved  by  squeezing 
the  input  at  a  fixed  (frequency-independent)  angle;  filtration 
cavities  are  needed  only  at  the  output  (for  the  FD  homodyne 
detection)  and  not  at  the  input;  cf.  Fig.  2. 

In  Sec.  IV  we  compute  the  spectral  density  of  the  noise 
for  all  three  designs,  ignoring  the  effects  of  optical  losses. 
We  find  (in  agreement  with  previous  analyses  [18,19])  that, 
when  the  FD  squeeze  angle  is  optimized,  the  squeezed-input 
interferometer  has  its  shot  noise  and  radiation-pressure  noise 


2A  design  similar  to  it  has  previously  been  proposed  and  analyzed 
[24]  for  a  simple  optical  meter,  in  which  the  position  of  a  movable 
mirror  (test  mass)  is  monitored  by  measuring  the  phase  or  some 
other  quadrature  of  a  light  wave  reflected  from  the  mirror.  In  this 
case  it  was  shown  that  the  SQL  can  be  beat  by  a  combination  of 
phase- squeezed  input  light  and  TD  homodyne  detection. 
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both  reduced  in  amplitude  (at  fixed  light  power)  by  e~R , 
where  R  is  the  (frequency-independent)  squeeze  factor;  see 
Fig.  2  below.  This  enables  a  lossless  squeezed-input  interfer¬ 
ometer  to  beat  the  SQL  by  a  factor  e~R  (when  the  power  is 
optimized)  but  no  more.  By  contrast,  the  lossless, 
variational-output  interferometer,  with  optimized  FD  homo¬ 
dyne  phase,  can  have  its  radiation-pressure  noise  completely 
removed  from  the  output  signal,  and  its  shot  noise  will  scale 
with  light  power  as  1 I 1 0  as  for  a  conventional  interferom¬ 
eter.  As  a  result,  the  lossless  variational-output  interferometer 
can  beat  the  SQL  in  amplitude  by  ^ISqL/2I0,  where  /SQL  is 
the  light  power  required  by  a  conventional  interferometer  to 
reach  the  SQL.  The  optimized,  lossless,  squeezed-variational 
interferometer  has  its  radiation-pressure  noise  completely  re¬ 
moved,  and  its  shot  noise  reduced  by  e~R,  so  it  can  beat  the 
SQL  in  amplitude  by  e  ^V^sql/2/0. 

Imperfections  in  squeezing,  in  the  filter  cavities,  and  in 
the  homodyne  local-oscillator  phase  will  produce  errors  A\ 
in  the  FD  squeeze  angle  \(I2)  of  a  squeezed-input  or 
squeezed- variational  interferometer,  and  A  £  in  the  FD  homo¬ 
dyne  phase  £(I2)  of  a  variational-output  or  squeezed- 
variational  interferometer.  At  the  end  of  Sec.  VI  E,  we  shall 
show  that,  to  keep  these  errors  from  seriously  compromising 
the  most  promising  interferometer’s  performance,  |  AX  |  must 
be  no  larger  than  —0.05  radian,  and  |A£|  must  be  no  larger 
than  —0.01  radian.  This  translates  into  constraints  of  order 
five  percent  on  the  accuracies  of  the  filter  cavity  finesses  and 
about  0.01  on  their  fractional  frequency  offsets  and  on  the 
homodyne  detector’s  local-oscillator  phase. 

The  performance  will  be  seriously  constrained  by  un¬ 
squeezed  vacuum  that  leaks  into  the  interferometer’s  optical 
train  at  all  locations  where  there  are  optical  losses,  whether 
those  losses  are  fundamentally  irreversible  (e.g.,  absorption) 
or  reversible  (e.g.,  finite  transmissivity  of  an  arm  cavity’s 
end  mirror).  We  explore  the  effects  of  such  optical  losses  in 
Sec.  VI.  The  dominant  losses  and  associated  noise  produc¬ 
tion  occur  in  the  interferometer’s  arm  cavities  and  FD  filter 
cavities.  The  filter  cavities’  net  losses  and  noise  will  domi¬ 
nate  unless  the  number  of  bounces  the  light  makes  in  them  is 
minimized  by  making  them  roughly  as  long  as  the  arm  cavi¬ 
ties.  This  suggests  that  they  be  4  km  long  and  reside  in  the 
beam  tubes  alongside  the  interferometer’s  arm  cavities.  To 
separate  the  filters’  inputs  and  outputs,  they  might  best  be 
triangular  cavities  with  two  mirrors  at  the  corner  station  and 
one  in  the  end  station. 

Our  loss  calculations  reveal  the  following: 

The  squeezed-input  interferometer  is  little  affected  by 
losses  in  the  interferometer’s  arm  cavities  or  in  the  output 
optical  train,  so  long  as  the  fractional  energy  loss  e  is  small 
compared  to  the  squeeze  factor  e~2R ,  as  is  likely  to  be  the 
case.  However,  losses  in  the  input  optical  train  (most  seri¬ 
ously  the  filter  cavities  and  a  circulator)  influence  the  noise 
by  constraining  the  net  squeeze  factor  e~2R  of  the  light  en¬ 
tering  the  arm  cavities.  The  resulting  noise,  expressed  in 
terms  of  e~2R ,  is  the  same  as  in  a  lossless  squeezed-input 
interferometer  (discussed  above):  With  the  light  power  opti¬ 
mized  so  I q  =  I sql>  the  squeezed-input  interferometer  can 
beat  the  amplitude  SQL  by  a  factor  pi  =  4s~h! sj S^QL 


—  yje ~2R— 0.3  (where  e~2R— 0.1  is  a  likely  achievable  value 
of  the  power  squeeze  factor). 

The  variational- output  and  squeezed-variational  interfer¬ 
ometers  are  strongly  affected  by  losses  in  the  interferom¬ 
eter’s  arm  cavities  and  in  the  output  optical  train  (most  seri¬ 
ously:  a  circulator,  the  two  filter  cavities,  the  mixing  with  the 
homodyne  detector’s  local-oscillator  field,  and  the  photodi¬ 
ode  inefficiency).  The  net  fractional  loss  6^  of  signal  power 
and  (for  squeezed-variational)  the  squeeze  factor  e~2R  for 
input  power  together  determine  the  interferometer’s  opti¬ 
mized  performance:  The  amplitude  SQL  can  be  beat  by  an 
amount  /x  =  (e~2Re*)1/4,  and  the  input  laser  power  required 
to  achieve  this  optimal  performance  is  IQ  //Sql—  sle~2R/e 
In  particular,  the  variational-output  interferometer  (no  input 
squeezing;  e~2R=  1),  with  the  possibly  achievable  loss  level 
6^,  —  0.01 ,  can  beat  the  SQL  by  the  same  amount  as  our  es¬ 
timate  for  the  squeezed-input  interferometer,  pi—  6^/4—  0.3, 
but  requires  ten  times  higher  input  optical  power,  /0//sql 

—  l/^e^—10 — which  could  be  a  very  serious  problem.  By 
contrast,  the  squeezed-variational  interferometer  with 
the  above  parameters  has  an  optimized  performance  /x 

—  (O.lXO.Ol)174—  0.18  (substantially  better  than  squeezed- 
input  or  variational-output),  and  achieves  this  with  an  opti¬ 
mizing  input  power  /0//SqL=  Vo.  1/0.01  — 3.2.  If  the  input 
power  is  pulled  down  from  this  optimizing  value  to 
/0//Sql=  1  so  it  is  the  same  as  for  the  squeezed-input  inter¬ 
ferometer,  then  the  squeezed-variational  performance  is  de¬ 
bilitated  by  a  factor  1.3,  to  /x  — 0.24,  which  is  still  somewhat 
better  than  for  squeezed-input. 

It  will  require  considerable  research  and  development  to 
actually  achieve  performances  at  the  above  levels,  and  there 
could  be  a  number  of  unknown  pitfalls  along  the  way.  For 
example,  ponderomotive  squeezing,  which  underlies  all  three 
of  our  QND  configurations,  has  never  yet  been  seen  in  the 
laboratory  and  may  entail  unknown  technical  difficulties. 

Fortunately,  the  technology  for  producing  squeezed 
vacuum  via  nonlinear  optics  is  rather  well  developed  [22] 
and  has  even  been  used  to  enhance  the  performance  of  inter¬ 
ferometers  [20,21].  Moreover,  much  effort  is  being  invested 
in  the  development  of  low-loss  test-mass  suspensions,  and 
this  gives  the  prospect  for  new  (ponderomotive)  methods  of 
generating  squeezed  light  that  may  perform  better  than  tra¬ 
ditional  nonlinear  optics.  These  facts,  plus  the  fact  that,  in  a 
squeezed-input  configuration,  the  output  signal  is  only  mod¬ 
estly  squeezed  and  thus  is  not  nearly  so  delicate  as  the 
highly- squeezed  output  of  an  optimally  performing 
squeezed- variational  configuration,  make  us  feel  more  confi¬ 
dent  of  success  with  squeezed-input  interferometers  than 
with  squeezed- variational  ones. 

On  the  other  hand,  the  technology  for  a  squeezed- 
variational  interferometer  is  not  much  different  from  that  for 
a  squeezed-input  one:  Both  require  input  squeezing  and  both 
require  filter  cavities  with  roughly  the  same  specifications; 
the  only  significant  differences  are  the  need  for  conventional, 
frequency-independent  homodyne  detection  in  the  squeezed- 
variational  interferometer,  and  its  higher-degree  of  output 
squeezing  corresponding  to  higher  sensitivity.  Therefore,  the 
squeezed- variational  interferometer  may  turn  out  to  be  just 
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FIG.  3.  Gravitational- wave  interferometer  with  two  inputs  (the 
carrier  which  has  power  IQ  entering  the  bright  port,  and  quantum 
field  a  entering  the  dark  port)  and  one  relevant  output  (the  quantum 
field  b  leaving  the  dark  port). 

as  practical  as  the  squeezed-input,  and  may  achieve  signifi¬ 
cantly  better  overall  performance  at  the  same  laser  power. 

This  paper  is  organized  as  follows:  In  Sec.  II  we  sketch 
our  mathematical  description  of  the  interferometer,  including 
our  use  of  the  Caves-Schumaker  [26,27]  formalism  for  two- 
photon  quantum  optics,  including  light  squeezing  (cf.  Appen¬ 
dix  A);  and  we  write  down  the  interferometer’s  input-output 
relation  in  the  absence  of  losses  [Eq.  (16);  cf.  Appendix  B  for 
derivation].  In  Sec.  Ill,  relying  on  our  general  lossless  input- 
output  relation  (16),  we  derive  the  noise  spectral  density 
Sh(j)  for  a  conventional  interferometer  and  elucidate  thereby 
the  SQL.  In  Sec.  IV,  we  describe  mathematically  our  three 
QND  interferometer  designs  and,  using  our  lossless  input- 
output  relation  (16),  derive  their  lossless  noise  spectral  den¬ 
sities.  In  Sec.  V,  we  show  that  FD  homodyne  detection  can 
be  achieved  by  filtration  followed  by  conventional  homo¬ 
dyne  detection,  and  in  Appendix  C  we  show  that  the  required 
filtration  can  be  achieved  by  sending  the  light  through  two 
successive  Fabry-Perot  cavities  with  suitably  chosen  cavity 
parameters.  We  list  and  discuss  the  required  cavity  param¬ 
eters  in  Sec.  V.  In  Sec.  VI,  we  compute  the  effects  of  optical 
losses  on  the  interferometers’  noise  spectral  density;  our 
computation  relies  on  an  input-output  relation  (97)  and  (101) 
derived  in  Appendix  B.  In  Sec.  VII  we  discuss  and  compare 
the  noise  performances  of  our  three  types  of  inteferometers. 
Finally,  in  Sec.  VIII  we  briefly  recapitulate  and  then  list  and 
briefly  discuss  a  number  of  issues  that  need  study,  as  foun¬ 
dations  for  possibly  implementing  these  QND  interferom¬ 
eters  in  LIGO-III. 

This  paper  assumes  that  the  reader  is  familiar  with  mod¬ 
ern  quantum  optics  and  its  theoretical  tools  as  presented,  for 
example,  in  Refs.  [28]. 


II.  MATHEMATICAL  DESCRIPTION  OF  THE 
INTERFEROMETER 

A.  Input  and  output  fields 

Figure  3  shows  the  standard  configuration  for  a 
gravitational-wave  interferometer.  In  this  subsection  we  fo¬ 
cus  on  the  beam  splitter’s  input  and  output.  In  our  equations 
we  idealize  the  beam  splitter  as  infinitesimally  thin  and  write 
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the  input  and  output  fields  as  functions  of  time  (not  time  and 
position)  at  the  common  centers  of  the  beams  as  they  strike 
the  splitter. 

At  the  beam  splitter’s  bright  port  the  input  is  a  carrier 
field,  presumed  to  be  in  a  perfectly  coherent  state  with  power 
1 0~  10  kW  (achieved  via  power  recycling  [29]),  angular  fre¬ 
quency  1.78X1015  sec-1  (1.06  micron  light),  and  ex¬ 
citation  confined  to  the  cos (o0t)  quadrature  [i.e.,  the  mean 
field  arriving  at  the  beam  splitter  is  proportional  to  cos(o>0f)]- 

At  the  dark  port  the  input  is  a  (quantized)  electromagnetic 
field  with  the  positive-frequency  part  of  the  electric  field 
given  by  the  standard  expression 
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Here  A  is  the  effective  cross  sectional  area  of  the  beam  and 
a  w  is  the  annihilation  operator,  whose  commutation  relations 
are 


[«»,««']  =  0,  [a(0,a]o,]  =  2TrS(a)- «').  (2) 

Throughout  this  paper  we  use  the  Heisenberg  picture,  so 
£’(  +  )  evolves  with  time  as  indicated.  However,  our  creation 
and  annihilation  operators  and  a\  are  fixed  in  time,  with 
their  usual  Heisenberg-picture  time  evolutions  always  fac¬ 
tored  out  explicitly  as  in  Eq.  (1). 

We  split  the  field  (1)  into  side  bands  about  the  carrier 
frequency  oQ  ,  o  =  o0±£l,  with  side-band  frequencies  12  in 
the  gravitational- wave  range  —60  to  —6000  sec-1  (10  to 
1000  Hz),  and  we  define 

a  +  =  aoj()+n  >  a-  =  aMo-n  •  (3) 

As  in  Eq.  (2),  we  continue  to  use  a  prime  on  the  subscript  to 
denote  frequency  12':  a  +  >  =  a(t)o+^f  .  Correspondingly, 
the  commutation  relations  (2)  imply  for  the  only  nonzero 
commutators 

\a+  ,a\,]  =  277^(12  —  12'),  \a_  ,<zi,]  =  27r£(f2  — 12'); 

(4) 


and  expression  (1)  for  the  dark-port  input  field  becomes 
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Here  (and  throughout  this  paper)  we  approximate  w0±I2 
—  oQ  inside  the  square  root,  since  I2/co0  —  3  X  10“ 13  is  so 
small;  and  we  formally  extend  the  integrals  over  12  to  infin¬ 
ity,  for  ease  of  notation. 

Because  the  radiation  pressure  in  the  optical  cavities  pro¬ 
duces  squeezing,  and  because  this  ponderomotive  squeezing 
is  central  to  the  operation  of  our  interferometers,  we  shall 
find  it  convenient  to  think  about  the  interferometer  not  in 
terms  of  the  single-photon  modes,  whose  annihilation  opera- 
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tors  are  a  +  and  a  _  ,  but  rather  in  terms  of  the  correlated 
two-photon  modes  (Appendix  A  and  Refs.  [26,27])  whose 
field  amplitudes  are 


a++al_  a  +  ~al_ 

ax  = - p — ,  a2  = - p — .  (6) 

V  2  y2i 

The  commutation  relations  (4)  imply  the  following  values 
for  the  commutators  of  these  field  amplitudes  and  their  ad- 
joints: 


[ax  ,<22'] “  —[a2,a\,]  =  i27rS(Q  —  Q')  (7a) 

and  all  others  vanish  [though  some  would  be  of  order 
(I2/m0)  if  we  had  not  approximated  m0±I2  —  o)0  inside  the 
square  root  in  Eq.  (5);  cf.  [26,27]]: 

[a i  ,<21/]  =  [a1  ,a\,]  =  [a\ ,a\,]  =  [ai  ,a2r]  =  \a\  ,^^1  =  0, 

(7b) 


and  similarly  with  1  2 .  In  terms  of  these  two-photon  am¬ 

plitudes,  Eq.  (5)  and  =  imply  that  the  full  electric 
field  operator  for  the  dark-port  input  is 
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TABLE  I.  Interferometer  parameters  and  their  fiducial  values. 


Parameter 

Symbol 

Fiducial  value 

light  frequency 

1.8X1015  s"1 

arm  cavity  ^-bandwidth 

r 

2ttX  100  s"1 

gravitational  wave  frequency 

12 

— 

mirror  mass 

m 

30  kg 

arm  length 

L 

4  km 

light  power  to  beam  splitter 

h 

— 

light  power  to  reach  SQL 

Gql 

1.0X  104  w 

gravitational  wave  SQL 

^SQL 

2X  10-24  (y/fl) Hz“1/2 

opto-mechanical  coupling  const 

K 

^sql)  2  y4 
122(y2  +  122) 

fractional  signal-power  loss 

0.01 

max  power  squeeze  factor 

0.1 

binaries — sources  that  emit  roughly  equal  power  into  all  log- 
arthmic  frequency  intervals  A 11/12—1  in  the  LIGO  band 
—  10  Hz^f=  12/2  7r^  1000  Hz.  Optimization  turns  out  to 
entail  making  the  lowest  point  in  the  interferometer’s  dimen¬ 
sionless  noise  spectrum  fXSh(f )  as  low  as  possible.  Be¬ 
cause  of  the  relative  contributions  of  shot  noise,  radiation 
pressure  noise,  and  thermal  noise,  this  lowest  point  turns  out 
to  be  at  f=  12/277—  100  Hz.  To  minimize  the  noise  at  this 
frequency,  one  makes  the  end  mirrors  of  the  interferometer’s 
arm  cavities  (Fig.  3)  as  highly  reflecting  as  possible  (we  shall 
idealize  them  as  perfectly  reflecting  until  Sec.  VI),  and  one 
gives  their  corner  mirrors  transmisivities  T—  0.033,  so  the 
cavities’  half  bandwidths  are 


Thus,  we  see  that  a  x  is  the  field  amplitude  for  photons  in  the 
cos  0)ot  quadrature  and  a2  is  that  for  photons  in  the  sin  0Jot 
quadrature  [26,27].  These  and  other  quadratures  will  be  cen¬ 
tral  to  our  analysis. 

The  output  field  at  the  beam  splitter’s  dark  port  is  de¬ 
scribed  by  the  same  equations  as  the  input  field,  but  with  the 
annihilation  operators  a  replaced  by  b\  for  example, 


£out 
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A  77  ft  C 
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Jo 
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dn 

277 
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Jo 


dn 


,  +  ble+,i“)— 
2  77 


(9) 


We  shall  find  it  convenient  to  introduce  explicitly  the  cosine 
and  sine  quadratures  of  the  output  field,  Ex(t)  and  E2(t ), 
defined  by 

Eout=El{t)cos((o0t)  +  E2(t)sin((o0t)-, 


E:(t)  = 


V4i7 ho)a  [° 

Ac  Jo 


(V 


—  iCLt 


+b)e 


t  ,  +  iCltyE 

277  ’ 


(10) 


y=  —  —  2ttX  1 00  Hz.  (11) 

4L 

Here  L  =  4  km  is  the  cavities’  length  (the  interferometer 
“arm  length”).  We  shall  refer  to  y  as  the  interferometer’s 
optimal  frequency ,  and  when  analyzing  QND  interferom¬ 
eters,  we  shall  adjust  their  parameters  so  as  to  beat  the  SQL 
by  the  maximum  possible  amount  at  12  —  y.  In  Table  I  we  list 
y,L  and  other  parameters  that  appear  extensively  in  this  pa¬ 
per,  along  with  their  fiducial  numerical  values. 

In  this  and  the  next  few  sections  we  assume,  for  simplic¬ 
ity,  that  the  mirrors  and  beam  splitter  are  lossless;  we  shall 
study  the  effects  of  losses  in  Sec.  VI  below.  We  assume  that 
the  carrier  light  (frequency  co0)  exites  the  arm  cavities  pre¬ 
cisely  on  resonance. 

We  presume  that  all  four  mirrors  (“test  masses”)  have 
masses  m  —  30  kg,  as  is  planned  for  LIGO-II. 

We  label  the  two  arms  n  for  north  and  e  for  east,  and 
denote  by  Xn  and  Xe  the  changes  in  the  lengths  of  the  cavi¬ 
ties  induced  by  the  test-mass  motions.  We  denote  by 

x=Xn-Xe  (12) 


B.  Interferometer  arms  and  gravitational  waves 

LIGO’s  interferometers  are  generally  optimized  for  the 
waves  from  inspiraling  neutron- star  and  black-hole 


the  changes  in  the  arm-length  difference,  and  we  regard  x  as 
a  quantum  mechanical  observable  (though  it  could  equally 
well  be  treated  as  classical  [4]).  In  the  absence  of  external 
forces,  we  idealize  v  as  behaving  like  a  free  mass  (no  pen- 
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dular  restoring  forces).  This  idealization  could  easily  be  re¬ 
laxed,  but  because  all  signals  below  —10  Hz  are  removed  in 
the  data  analysis,  the  pendular  forces  have  no  influence  on 
the  interferometer’s  ultimate  performance. 

The  arm-length  difference  evolves  in  response  to  the 
gravitational  wave  and  to  the  back-action  influence  of  the 
light’s  fluctuating  radiation  pressure.  Accordingly,  we  can 
write  it  as 

Po  f  +  °°  n  dfl 

x(t)  =  xD  +  J  (Lh  +  xBA)e  1 


_  h 

b2  =  Ab2+^2K- - eip,  A  b2  =  (a2- JCa^e2113.  (16) 

^SQL 

Here  and  below,  for  any  operator  A,  A A=A  —  (A).  This 
input-output  equation  and  the  quantities  appearing  in  it  re¬ 
quire  explanation. 

The  quantities  A  bj  are  the  noise-producing  parts  of  bj , 
which  remain  when  the  gravitational-wave  signal  is  turned 
off.  The  dj  impinge  on  the  arm  cavities  at  a  frequency  a>0 
+ 12  that  is  off  resonance,  so  they  acquire  the  phase  shift  2  ft 
upon  emerging,  where 


Here  is  the  initial  value  of  v  when  a  particular  segment  of 
data  begins  to  be  collected,  pQ  is  the  corresponding  initial 
generalized  momentum,  m/4  is  the  reduced  mass3  associated 
with  the  test-mass  degree  of  freedom  x,  h  is  the  Fourier  trans¬ 
form  of  the  gravitational-wave  field 


and  xBA  is  the  influence  of  the  radiation-pressure  back  action. 
Notice  our  notation:  x,  xBA,  and  h  are  the  12 -dependent  Fou¬ 
rier  transforms  of  x(t),  xBA(t),  and  h(t). 

Elsewhere  [4]  we  discuss  the  fact  that  and  pQ  influence 
the  interferometer  output  only  near  zero  frequency  12  —  0, 
and  their  influence  is  thus  removed  when  the  output  data  are 
filtered.  For  this  reason,  we  ignore  them  and  rewrite  x(t)  as 

f  +00  n  JI2 

x(t)=  (Lh  +  xBA)e  1  '  —  .  (15) 

J  —  oo  2  77 

C.  Output  field  expressed  in  terms  of  input 

Because  we  have  idealized  the  beam  splitter  as  infinitesi¬ 
mally  thin,  the  input  field  emerging  from  it  and  traveling 
toward  the  arm  cavities  has  the  coherent  laser  light  in  the 
same  cos  co0t  quadrature  as  the  dark-port  field  amplitude  a  \ . 
We  further  idealize  the  distances  between  the  beam  splitter 
and  the  arm-cavity  input  mirrors  as  integral  multiples  of  the 
carrier  wavelength  \0  =  2i tc/(o0  and  as  small  compared  to 
2  77 cl  y—  300  m.  (These  idealizations  could  easily  be  relaxed 
without  change  in  the  ultimate  results.) 

Relying  on  these  idealizations,  we  show  in  Appendix  B 
that  the  annihilation  operators  bj  for  the  beam  splitter’s  out¬ 
put  quadrature  fields  Ej(t)  are  related  to  the  input  annihila¬ 
tion  operators  aj  and  the  gravitational-wave  signal  h  by  the 
linear  relations 


b^Ab^a^2^, 


3In  each  arm  of  the  interferometer,  the  quantity  measured  is  the 
difference  between  the  positions  of  the  two  mirrors’  centers  of 
mass;  this  degree  of  freedom  behaves  like  a  free  particle  with  re¬ 
duced  mass  mr  =  mX  m/(m  +  m)  =  m/2.  The  interferometer  output 
is  the  difference,  between  the  two  arms,  of  this  free-particle  degree 
of  freedom;  that  difference  behaves  like  a  free  particle  with  reduced 
mass  mr/2=m/4. 


ft  =  arctan(  12  /  y ) .  (17) 

If  the  test  masses  were  unable  to  move,  then  A  bj  would  just 
be  aje2lf3;  however,  the  fluctuating  light  pressure  produces 
the  test-mass  motion  xBA ,  thereby  inducing  a  phase  shift  in 
the  light  inside  the  cavity,  which  shows  up  in  the  emerging 
light  as  the  term  —Kax  in  b2-  (cf.  Appendix  B).  The  quantity 


(I0  /^sql)2t 
2(y2  +  I22) 


is  the  coupling  constant  by  which  this  radiation-pressure 
back-action  converts  input  ax  into  output  A b2-  In  this  cou¬ 
pling  constant,  /SQL  is  the  input  laser  power  required,  in  a 
conventional  interferometer  (Sec.  Ill),  to  reach  the  standard 
quantum  limit: 


mL2y 4 

JSql=-4^— : “i-OXiO4  W.  (19) 

In  Eq.  (16),  the  gravitational- wave  signal  shows  up  as  the 
classical  piece  \l2JCh/hSQL  of  b2-  Here,  as  we  shall  see  be¬ 
low, 

,'»L-V^7“2x10_mSHz"°  (20) 


is  the  standard  quantum  limit  for  the  square  root  of  the 
single-sided  spectral  density  of  h(t),  ^^S~h. 


III.  CONVENTIONAL  INTERFEROMETER 

In  an  (idealized)  conventional  interferometer,  the  beam¬ 
splitter’s  output  quadrature  field  E2(t)  is  measured  by  means 
of  conventional  photodetection.4  The  Fourier  transform  of 


4Here  and  throughout  this  paper  we  regard  some  particular 
quadrature  Eg(t)  as  being  measured  directly.  This  corresponds  to 
superposing  on  Eg(t)  carrier  light  with  the  same  quadrature  phase 
as  and  then  performing  direct  photodetection,  which  produces  a 
photocurrent  whose  time  variations  are  proportional  to  Eg(t).  For  a 
conventional  interferometer  the  carrier  light  in  the  desired  quadra¬ 
ture,  that  of  E2(t ),  can  be  produced  by  operating  with  the  dark  port 
biased  slightly  away  from  the  precise  dark  fringe.  In  future  research 
it  might  be  necessary  to  modify  the  QND  designs  described  in  this 
paper  so  as  to  accommodate  the  modulations  that  are  actually  used 
in  the  detection  process;  see  Sec.  VIII  and  especially  footnote  13. 
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this  measured  quadrature  is  proportional  to  the  field  ampli¬ 
tude  b2  =  \b2  +  ^2JC(h/hSQL)elP;  cf.  Eqs.  (10)  and  (16). 
Correspondingly,  we  can  think  of  b2  =  b2(Cl)  as  the  quantity 
measured,  and  when  we  compute,  from  the  output,  the  Fou¬ 
rier  transform  h  =  h(Cl)  of  the  gravitational- wave  signal,  the 
noise  in  that  computation  will  be 

(21) 

This  noise  is  an  operator  for  the  Fourier  transform  of  a 
random  process,  and  the  corresponding  single-sided  spectral 
density  Sh(f)  associated  with  this  noise  is  given  by  the  stan¬ 
dard  formula  [3,26,27] 

p.vS(fl  —  W)Sh(f)  =  (m\hn(fl)hl(fl')\m)sym.  (22) 

Here/ =12/277  is  frequency,  | in)  is  the  quantum  state  of  the 
input  light  field  (the  field  operators  ax  and  a2 ),  and  the  sub¬ 
script  “sym”  means  “symmetrize  the  operators  whose  ex¬ 
pectation  value  is  being  computed,”  i.e.,  replace 
h„mhl(.Sl')  by  Hhn(n)hl(il')  +  hlOl')hnm).  Note 
that  when  Eq.  (21)  for  hn  is  inserted  into  Eq.  (22),  the  phase 
factor  e~lP  cancels,  i.e.,  it  has  no  influence  on  the  noise  Sh . 
This  allows  us  to  replace  Eq.  (21)  by 

Km=^MAb2'  {23) 

For  a  conventional  interferometer,  the  dark-port  input  is  in 
its  vacuum  state,  which  we  denote  by 

|in)  =  |0fl).  (24) 

For  this  vacuum  input,  the  standard  relations  a  +  \0a) 
=  <2_|0a)  =  0,  together  with  Eqs.  (6)  and  (7),  imply  [26,27] 


FIG.  4.  The  square  root  of  the  spectral  density  V Sh  of  the 
gravitational-wave  noise  for  several  interferometer  designs,  as  a 
function  of  angular  frequency  12,  with  optical  losses  assumed  neg¬ 
ligible ;  yjsh  is  measured  in  units  of  the  standard  quantum  limit  at 
frequency  12  =  y,  and  12  is  measured  in  units  of  y.  The  noise  curves 
shown  are:  (i)  the  standard  quantum  limit  itself,  hs qL(12)  [Eq. 
(20)];  (ii)  the  noise  for  a  conventional  interferometer  with  laser 
power  1 0  =  I sql  [Eq.  (29)];  (iii)  the  noise  for  a  squeezed-input  inter¬ 
ferometer  with  1 0  =  I sql.  squeeze  factor  e~2R  =  0.1 ,  and  (a)  opti¬ 
mized  FD  squeeze  angle  X=  —  0(12)  [Eq.  (49);  solid  curve],  (b) 
optimized  frequency -independent  squeeze  angle  [Eq.  (52);  dashed 
curve];  (iv)  the  noise  for  a  variational-output  interferometer  with 
IQ=  10/SqL  and  optimized  frequency-dependent  homodyne  phase 
£=<L(12)  [Eq.  (58)];  and  (v)  the  noise  for  a  squeezed-variational 
interferometer  with  7o=10/SqL,  input  squeeze  factor  e~2R  =  0.1 , 
and  optimized  input  squeeze  angle  X  =  tt/2  and  output  homodyne 
phase  £=0(12)  [Eq.  (73)]. 

9  8  h 

S^hSQL  =  m(l2L2-  (28) 

Recall  that  1C  is  a  function  of  frequency  12  and  is  propor¬ 
tional  to  the  input  laser  power  IQ  [Eq.  (18)].  In  our  conven¬ 
tional  interferometer,  we  adjust  the  laser  power  to  I0  =  I sql 
[Eq.  (19)],  thereby  making  /C(I2=y)=l,  which  minimizes 
Sh  at  the  interferometer’s  optimal  frequency  f2=  y.  The 
noise  spectral  density  then  becomes  [cf.  Eqs.  (27)  and  (18)] 


(°alaj«l'|0a)sym=y2  W  )  Sjk  .  (25) 


Comparing  this  relation  with  Eq.  (22)  and  its  generalization 
to  multiple  random  processes,  we  see  that  (when  |  in) 
=  |0a))<2i(I2)  and  a2(Cl)  can  be  regarded  as  the  Fourier 
transforms  of  classical  random  processes  with  single-sided 
spectral  densities  and  cross-spectral  density  given  by  [4] 

Sai(f)  =  Sa2(f)=  1,  Saia2(f)  =  0.  (26) 

Combining  Eqs.  (16)  and  (23)-(25)  [or,  equally  well,  Eqs. 
(16),  (23),  and  (26)],  we  obtain  for  the  noise  spectral  density 
of  the  conventional  interferometer 


Sh  = 


h  2 

/SQL 

~Y~ 


lc+]C 


(27) 


Sh  = 


4  h 


mL2Cl2 


2y4 


I22(y2  +  I22) 


+ 


I22(y2  +  I22) 


2  7 


(29) 


This  optimized  conventional  noise  is  shown  as  a  curve  in 
Fig.  4,  along  with  the  standard  quantum  limit  hs QL  and  the 
noise  curves  for  several  QND  interferometers  to  be  discussed 
below.  This  conventional  noise  curve  is  currently  a  tentative 
goal  for  LIGO-II,  when  operating  without  signal  recycling 
[7]. 


IV.  STRATEGIES  TO  BEAT  THE  SQL,  AND  THEIR 
LOSSLESS  PERFORMANCE 

A.  Motivation:  Ponderomotive  squeezing 

The  interferometer’s  input-output  relations  A bi  =  a1e2lf3, 
\b2  =  (a2  —  JCai)e2lf3  can  be  regarded  as  consisting  of  the 
uninteresting  phase  shift  e2lf3,  and  a  rotation  in  the  {ax  ,a2} 
plane  (i.e.,  {cos  a)0t, sin  co0t\  plane),  followed  by  a  squeeze: 

bj  =  S\r,(f>)R\- 0)aje2if3R(- 0)S(r,(f>).  (30) 


This  spectral  density  is  limited,  at  all  frequencies  12,  by  the 
standard  quantum  limit 
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Here  R(  —  0)  is  the  rotation  operator  and  S(r,(p)  the  squeeze 
operator  for  two-photon  quantum  optics;  see  Appendix  A  for 
a  very  brief  summary,  and  Refs.  [26,27]  for  extensive  detail. 
The  rotation  angle  0 ,  squeeze  angle  (p  and  squeeze  factor  r 
are  given  by 


1 

6  =  arctan(  /C/2) ,  <p  =  —  arccot(  /C/2) ,  r  =  arcshinh(  /C/2) . 

(31) 

Note  that,  because  the  coupling  constant  /C  depends  on  fre¬ 
quency  12  [Eq.  (18)],  the  rotation  angle,  squeeze  angle,  and 
squeeze  factor  are  frequency  dependent.  This  frequency  de¬ 
pendence  will  have  major  consequences  for  the  QND  inter¬ 
ferometer  designs  discussed  below. 

The  rotate-and-squeeze  transformation  (30)  for  the  two- 
photon  amplitudes  implies  corresponding  rotate-and-squeeze 
relations  for  the  one-photon  creation  and  annihilation  opera¬ 
tors 


b±  =  S\r,  <p)R\ ~  6)a±e±2lf3R(—  0)S(r,<p).  (32) 

Denote  by  \0a  )  the  vacuum  for  the  in  mode  at  frequency 
m0  +  I2,  by  |0a_)  that  for  the  in  mode  at  a>0  — 12,  and  by 
|0fl+)  the  vacuum  for  one  or  the  other  of  these  modes;  and 
denote  similarly  the  vacuua  for  the  out  modes,  |0^±).  Then 
|0fl+)  is  the  state  annihilated  by  a±  and  \0bJ)  is  that  anni¬ 
hilated  by  b±  .  Correspondingly,  the  rotate- squeeze  relation 
(32)  implies  that 


b±\0b+)  =  S^R^a±e±2i/3RS\0b+)  =  0,  (33) 

where  the  parameters  of  the  squeeze  and  rotation  operators 
are  those  given  in  Eqs.  (31)  and  (32).  This  equation  implies 
that  e±2if3RS\0b+)  is  annihilated  by  a  ±  and  therefore  is  the 
in  vacuum  |0a+)  for  the  in  mode  a>0±12: 

e±2i/3RS\0b±)  =  \0a±).  (34) 

Applying  R'  and  noting  that  ^t|0a+)=  |0a+)  (the  vacuum  is 
rotation  invariant),  we  obtain 

\0a±)  =  e±2if3S(r,cl>)\0b±).  (35) 

Thus,  the  in  vacuum  is  equal  to  a  squeezed  out  vacuum , 
aside  from  an  uninteresting,  frequency -dependent  phase 
shift.  The  meaning  of  this  statement  in  the  context  of  a  con¬ 
ventional  interferometer  is  the  following. 

For  a  conventional  interferometer,  the  in  state  is 

|in)  =  |0a±)  =  e±2l'^(r,^)|0&±);  (36) 

and  because  we  are  using  the  Heisenberg  picture  where  the 
state  does  not  evolve,  the  light  emerges  from  the  interferom¬ 
eter  in  this  state.  However,  in  passing  through  the  interfer¬ 
ometer,  the  light’s  quadrature  amplitudes  evolve  from  a j  to 
bj .  Correspondingly,  at  the  output  we  should  discuss  the 
properties  of  the  unchanged  state  in  terms  of  a  basis  built 
from  the  out  vacuum  \0bJ).  Equation  (35)  says  that  in  this 
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out  language,  the  light  has  been  squeezed  at  the  angle  (p  and 
squeeze-factor  r  given  by  Eq.  (31).  This  squeezing  is  pro¬ 
duced  by  the  back-action  force  of  fluctuating  radiation  pres¬ 
sure  on  the  test  masses.  That  back  action  has  the  character  of 
a  ponderomotive  nonlinearity  first  recognized  by  Braginsky 
and  Manukin  [15].5  The  correlations  inherent  in  this  squeez¬ 
ing  form  the  foundation  for  the  QND  interferometers  dis¬ 
cussed  below. 

One  can  also  deduce  this  ponderomotive  squeezing  from 
the  in-out  relations  A bi  =  aie2lf3,  Ab2  =  (a2  — 1Caf}e2lf3 
[Eq.  (16)],  the  expressions 

-2tt8(D,~  D,  ’ )Sbj(f )  =  (in|  AbjAbjS  |in)sym , 


i-277  S(Sl-Sl’)Sblb2(f) 


in 


1 

2 


+  Ab\Abv) 


in 


(37) 


sym 


for  the  spectral  densities  and  cross  spectral  densities  of  bx 
and  b 2,  and  the  spectral  densities  Sa  =Sa  =  1,  Sa^a2  =  0 
[Eqs.  (26)].  These  imply  that  for  a  conventional  interferom¬ 
eter 


s„=  1,  Sb=l  +  IC2,  Sbib=-JC.  (38) 

1 

Rotating  Abj  through  the  angle  <p  =  —  arccot(/C/2)  to  obtain 
b[  =  bi  cos  <p  +  b2  sin  cp,  b2  =  b2  cos  (p  —  bx  sin  <p, 

(39) 


and  using  Eqs.  (37)  and  (38),  we  obtain 

=  e~2r=  ( yjl  +  (/C/2)2— /C/2)2—  1  IK,  if  K>\, 

S&'  =  e+2r=(  Vl  +(/C/2)2  +  /C/2)2,  Sb[b'  =  0,  (40) 

which  represents  a  squeezing  of  the  input  vacuum  noise  in 
the  manner  described  formally  by  Eqs.  (36)  and  (31). 

This  ponderomotive  squeezing  is  depicted  by  the  noise 
ellipse  of  Fig.  5.  For  a  conventional  interferometer  ( b2  mea¬ 
sured  via  photodetection6),  the  signal  is  the  arrow  along  the 
b2  axis,  and  the  square  root  of  the  noise  spectral  density  Sb2 
is  the  projection  of  the  noise  ellipse  onto  the  b2  axis.  For  a 
detailed  discussion  of  this  type  of  graphical  representation  of 
noise  in  two-photon  quantum  optics  see,  e.g.,  Ref.  [26]. 


5Recently  it  has  been  recognized  that  this  ponderomotive  nonlin¬ 
earity  acting  on  a  movable  mirror  in  a  Fabry-Perot  resonator  may 
provide  a  practical  method  for  generating  bright  squeezed  light 
[30]. 

6See  footnote  4. 
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FIG.  5.  Noise  ellipses  for  a  conventional  interferometer.  Left: 
Noise  for  vacuum  that  enters  the  interferometer’s  dark  port.  Right: 
Noise  for  ponderomotively  squeezed  vacuum  that  exits  at  the  dark 
port  along  with  the  gravitational- wave  signal;  the  ponderomotive 
squeeze  has  moved  the  point  P  to  the  new  point  P'  [ bx 
=  al,  b2  =  a2  —  )Ca1,  Eqs.  (16)].  These  noise  ellipses  have  dimen¬ 
sions  and  shapes  described  by  the  noise  spectral  densities  (26),  (38) 
and  (40),  and  by  the  squeeze  equations  (36)  and  (31).  The  minor 
radius  of  the  output  noise  ellipse  is  ^Jsb'  =  e~r,  and  its  major  radius 
is  yjsb'  =  e+r,  where  r  is  the  squeeze  factor;  cf.  Eqs.  (31)  and  (40). 
The  conventional  interferometer  measures  b2,  which  contains  the 
indicated  noise  [cf.  Eq.  (23)]  and  the  indicated  signal  \Sb2 
=  ^2JCh/hs qL;  cf.  Eq.  (16)].  For  a  detailed  discussion  of  noise 
ellipses  in  2-photon  quantum  optics  see,  e.g.,  Ref.  [26]. 

B.  Squeezed-input  interferometer 

Interferometer  designs  that  can  beat  the  SQL  (28)  are 
sometimes  called  “QND  interferometers.”  Unrah  [14]  has 
devised  a  QND  interferometer  design  based  on  (i)  putting  the 
input  electromagnetic  fluctuations  at  the  dark  port  (ax  and 
a 2)  into  a  squeezed  state,  and  (ii)  using  standard  photodetec¬ 
tion  to  measure  the  interferometer’s  output  field.  We  shall 
call  this  a  squeezed-input  interferometer.  The  squeezing  of 
the  input  has  been  envisioned  as  achieved  using  nonlinear 
crystals  [20,21],  but  one  might  also  use  ponderomotive 
squeezing. 

The  squeezed-input  interferometer  is  identical  to  the  con¬ 
ventional  interferometer  of  Sec.  Ill,  except  for  the  choice  of 
the  in  state  |  in)  for  the  dark-port  field.  Whereas  a  conven¬ 
tional  interferometer  has  |in)  =  |0a),  the  squeezed-input  in¬ 
terferometer  has 


|in>  =  S(*,X)|0fl>,  (41) 

where  R  is  the  largest  squeeze  factor  that  the  experimenters 
are  able  to  achieve  (e~2R~0.1  in  the  LIGO-III  time  frame), 
and  A  =  A(I2)  is  a  squeeze  angle  that  depends  on  side-band 
frequency.  One  adjusts  A  (12)  so  as  to  minimize  the  noise  in 
the  output  quadrature  amplitude  b2 ,  which  (i)  contains  the 
gravitational-wave  signal  and  (ii)  is  measured  by  standard 
photodetection.  As  we  shall  see,  the  optimized  X  is  strongly 
frequency  dependent.  By  contrast,  we  shall  idealize  the 
squeeze  factor  R  as  independent  of  side-band  frequency  12 
except  when  otherwise  stated  (Secs.  IV  D  and  VI F). 

The  gravitational- wave  noise  for  such  an  interferometer  is 
proportional  to 


(m\hnhn,\m)  =  (0a\hnshns,\0a)  (42) 


[Eq.  (22)],  where  hns  is  the  squeezed  gravitational-wave 
noise  operator 


hn=S\RX)hnS(R,\)  (43) 

and  h'n  =  hn{Vl’).  By  inserting  expression  (21)  for  hn  into 
Eq.  (43)  and  then  combining  the  interferometer’s  pondero¬ 
motive  squeeze  relation  \b2  =  (a2  —  K,ai)e2lP  with  the  ac¬ 
tion  of  the  squeeze  operator  on  ax  and  a2  [Eq.  (A8)],  we 
obtain 


hns- ~  J=V(1  +  K2)^ 

X(ai{coshR  cosO-sinh R  cos[<F-2(<F  + A)]} 
-a2{coshR  sinO-sinh R  sin[<F-2(<F  + A)]}), 

(44) 


where 


O  =  arccot /C. 


(45) 


We  can  read  the  spectral  density  of  the  gravitational-wave 
noise  off  of  Eq.  (44)  by  recalling  that  in  the  |0a)  vacuum 
state  [which  is  relevant  because  of  Eq.  (42)],  ax  and  a2  can 
be  regarded  as  random  processes  with  spectral  sensities  Sa^ 
=  Sa2  =  1  and  vanishing  cross  spectral  density  [Eqs.  (26)]: 


z+]C 


(cosh  2  R  —  cos[2(  A  +  <F)]sinh  2  R). 

(46) 


It  is  straightforward  to  verify  that  this  noise  is  minimized  by 
making  it  proportional  to  cosh  27?— sinh2R=e~2R,  which  is 
achieved  by  choosing  for  the  input  squeeze  angle 


A(I2)  =  —  <E>(fi)=  -arccot/C(I2).  (47) 


The  result  is 


(48) 


This  says  that  the  squeezed-input  interferometer  has  the 
same  noise  spectral  density  as  the  conventional  interferom¬ 
eter,  except  for  an  overall  reduction  by  e~2R ,  where  R  is  the 
squeeze  factor  for  the  dark-port  input  field  (a  result  deduced 
by  Unruh  [14]  and  later  confirmed  by  Jaekel  and  Reynaud 
[18]  using  a  different  method);  see  Fig.  4.  This  result  implies 
that  the  squeezed-input  interferometer  can  beat  the  amplitude 
SQL  by  a  factor  e~R. 

When  the  laser  power  IQ  of  the  squeezed-input  interfer¬ 
ometer  is  optimized  for  detection  at  the  frequency  12 
=  y  (70  =  /Sql  as  for  a  conventional  interferometer),  the 
noise  spectrum  becomes 


4  h 

2/ 

mL2£l2 

I22(y2  +  I22) 

I22(r2+I22) 

+  V- 


e~2R.  (49) 
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FIG.  6.  Noise  ellipses  for  a  squeezed-input  interferometer.  Left: 
Noise  for  squeezed  vacuum  that  enters  the  interferometer’s  dark 
port.  The  field  is  squeezed  at  the  angle  X  =  —  O .  Right:  Noise  for 
the  field  that  exits  at  the  dark  port  along  with  the  gravitational- wave 
signal.  This  output  field  results  from  the  interferometer’s  pondero- 
motive  squeezing  of  the  input  field  [e.g.,  point  P  goes  to  point  P'  in 
accord  with  bl  =  ai ,  b2  =  a2  —  Kax\  Eqs.  (16)].  If  the  input  field 
had  been  vacuum  as  in  a  conventional  interferometer  (Fig.  5),  then 
the  output  would  have  been  squeezed  in  the  manner  of  the  dashed 
ellipse.  The  two  squeezes  (input  and  ponderomotive)  result  in  the 
shaded  ellipse,  whose  projection  along  the  axis  measured  by  the 
photodetector  ( b2  axis)  has  been  minimized  by  the  choice  of 
squeeze  angle,  X  =  —  O . 


h sql  /  1  \  _ 

5^  =  — r—  —  +  /C  [(cosh R  cos  O-sinh R  sinO)2 

2  \  /C  / 

+  (cosh 7?  sinO  —  sinhT?  cos<F)2]  (52) 

[Eq.  (46),  translated  into  gravitational- wave  noise  via  Eq. 
(23)].  This  noise  spectrum  is  shown  as  a  dashed  curve  in  Fig. 
4,  for  e~2R  =  0.1.  The  SQL  is  beat  by  the  same  factor  /x 
=  ^e~2R  —  032  as  in  the  case  of  a  fully  optimized  squeezed- 
input  interferometer,  but  the  frequency  band  over  which  the 
SQL  is  beat  is  significantly  smaller  than  in  the  optimized 
case,  and  the  noise  is  worse  than  for  a  conventional  interfer¬ 
ometer  outside  that  band. 

(ii)  Caves  [17],  in  a  paper  that  preceeded  Unruh’s  and 
formed  a  foundation  for  Unruh’s  ideas,  proposed  a  squeezed- 
input  interferometer  with  the  squeeze  angle  set  to  X  =  tt/2 
independent  of  frequency.  In  this  case,  Eq.  (46),  translated 
into  gravitational- wave  noise  via  Eq.  (23),  says  that 

su~1[^  +  e2Rl^.  (53) 


This  optimized  noise  is  shown  in  Fig.  4  for  e~2R  =  ti.\,  along 
with  the  noise  spectra  for  other  optimized  interferometer  de¬ 
signs. 

In  previous  discussions  of  this  squeezed-input  scheme 
[14,18,19],  no  attention  has  been  paid  to  the  practical  prob¬ 
lem  of  how  to  produce  the  necessary  frequency  dependence 

r\  4 

X(U)=-$(fi)=-arccot^2^rjTfL)  ^ 

of  the  squeeze  angle.  In  Sec.  V  C,  we  shall  show  that  this 
X(I2)  can  be  achieved  by  squeezing  at  a  frequency- 
independent  squeeze  angle  (using,  e.g.,  a  nonlinear  crystal 
for  which  the  squeeze  angle  will  be  essentially  frequency- 
independent  because  the  gravity-wave  bandwidth, 
<1000  Hz,  is  so  small  compared  to  usual  optical  band- 
widths  )  and  then  filtering  through  two  Fabry-Perot  cavities. 
This  squeezing  and  filtering  must  be  performed  before  injec¬ 
tion  into  the  interferometer’s  dark  port;  see  Fig.  1  for  a  sche¬ 
matic  diagram. 

The  signal  and  noise  for  this  squeezed-input  interferom¬ 
eter  are  depicted  in  Fig.  6. 

We  comment,  in  passing,  on  two  other  variants  of  a 
squeezed-input  interferometer: 

(i)  If,  for  some  reason,  the  filter  cavities  cannot  be  imple¬ 
mented  successfully,  one  can  still  inject  squeezed  vacuum  at 
the  dark  port  with  a  frequency-independent  phase  that  is  op¬ 
timized  for  the  lowest  point  in  the  noise  curve,  Q=  y;  i.e., 
(with  the  input  power  optimized  to  IQ  =  /sql)  : 

X  =  —  <F(y)=  —  tt/4;  (51) 

cf.  Eq.  (50).  In  this  case  the  noise  spectrum  is 


Since  JC  is  proportional  to  the  input  laser  power  1 0 ,  Caves’ 
interferometer  produces  the  same  noise  spectral  density  as  a 
conventional  interferometer  [Eq.  (27)]  but  with  an  input 
power  that  is  reduced  by  a  factor  e~2R.  This  is  a  well-known 
result. 

C.  Variational-output  interferometer 

Vyatchanin,  Matsko  and  Zubova  [23-25]  have  devised  a 
QND  interferometer  design  based  on  (i)  leaving  the  dark-port 
input  field  in  its  vacuum  state,  |in)=|0a),  and  (ii)  changing 
the  output  measurement  from  standard  photodetection  (mea¬ 
surement  of  b2)  to  homodyne  detection  at  an  appropriate, 
frequency-dependent  (FD)  homodyne  phase  £(I2)  -  i.e., 
measurement  of 

b^=bi  cos  £+£>2  sin  £•  (54) 

In  their  explorations  of  this  idea,  Vyatchanin,  Matsko  and 
Zubova  [23-25]  did  not  identify  any  practical  scheme  for 
achieving  such  a  FD  homodyne  measurement,  so  they  ap¬ 
proximated  it  by  homodyne  detection  with  a  homodyne 
phase  that  depends  on  time  rather  than  frequency — a  tech¬ 
nique  that  they  call  a  “quantum  variational  measurement.” 

In  Sec.  V  below,  we  show  that  the  optimized  FD  homo¬ 
dyne  measurement  can,  in  fact,  be  achieved  by  filtering  the 
interferometer  output  through  two  Fabry-Perot  cavities  and 
then  performing  standard,  balanced  homodyne  detection  at  a 
frequency-independent  homodyne  phase;  see  Fig.  2  for  a 
schematic  diagram.  We  shall  call  such  an  scheme  a 
variational- output  interferometer.  The  word  “variational” 
refers  to  (i)  the  fact  that  the  measurement  entails  monitoring 
a  frequency-varying  quadrature  of  the  output  field,  as  well  as 
(ii)  the  fact  that  the  goal  is  to  measure  variations  of  the 
classical  force  acting  on  the  interferometer’s  test  mass  (the 
original  Vyatchanin-Matsko-Zubova  motivation  for  the 
word). 
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FIG.  7.  Noise  ellipses  for  a  variational-output  interferometer. 
Left:  Noise  for  the  ordinary  vacuum  that  enters  the  interferometer’s 
dark  port.  Right:  Noise  for  the  field  that  exits  at  the  dark  port  along 
with  the  gravitational-wave  signal.  These  noise  ellipses  are  the 
same  as  for  a  conventional  interferometer,  Fig.  5,  but  here  the  quan¬ 
tity  measured  is  the  quadrature  amplitude  bq>  with  frequency  depen¬ 
dent  phase  d>  =  arccot  1C.  It  is  informative  to  compare  the  measured 
phase  O  with  the  angle  of  ponderomotive  squeeze  f 
=  \  arccot(/C/2).  They  are  related  by  tan  =  \  tan  2<£  =  tan  <£/(l 
—  tan2<^>),  so  O  is  always  larger  than  <^>;  but  for  large  1C  (strong 
beating  of  the  SQL),  they  become  small  and  nearly  equal. 

The  monitored  FD  amplitude  b ^  [Eq.  (54)]  can  be  ex¬ 
pressed  in  terms  of  the  interferometer’s  dark-port  input  am¬ 
plitudes  a  x ,  a  2  and  the  Fourier  transform  of  the 
gravitational-wave  field  h  as 

/  _  h 

bg=  sin£  a J21C  - el@+  [a2  +  (cot IC)ax]e2lf3 


cf.  Eqs.  (16)  and  (54).  Correspondingly,  the  operator  describ¬ 
ing  the  Fourier  transform  of  the  interferometer’s 
gravitational-wave  noise  is 


Km 


^SQL 

V2V 


e'^[a2  +  «i(cot  Zf— /C)]; 


(56) 


cf.  Eq.  (23). 

The  radiation-pressure-induced  back  action  of  the  mea¬ 
surement  on  the  interferometer’s  test  masses  is  embodied  in 
the  —JCax  term  of  this  equation;  cf.  Eq.  (16)  and  subsequent 
discussion.  It  should  be  evident  that  by  choosing 

£  =  O  =  arccot  1C,  (57) 


we  can  completely  remove  the  back-action  noise  from  the 
measured  interferometer  output ;  cf.  Fig.  7.  This  optimal 
choice  of  the  FD  homodyne  phase,  together  with  the  fact  that 
the  input  state  is  vacuum,  |in)=|0a),  leads  to  the 
gravitational-wave  noise 


^sql  1  _  1  I  4h  \fl2(y2  +  I22) 

~ 2  lC~T0  //SQL  \  mL2fi2/  Vy" 


(58) 


Cf.  Eqs.  (22)  and  (25). 

This  noise  for  an  optimized  variational-output  interferom¬ 
eter  is  entirely  due  to  shot  noise  of  the  measured  light,  and 
continues  to  improve  oc  \Uo  even  when  the  input  light  power 


IQ  exceeds  /SQL .  Figure  4  shows  this  noise,  along  with  the 
noise  spectra  for  other  optimized  interferometer  designs. 

It  is  interesting  that  the  optimal  frequency-dependent  ho¬ 
modyne  phase  O  for  this  variational- output  interferometer  is 
the  same,  aside  from  sign,  as  the  optimal  frequency- 
dependent  squeeze  angle  for  the  squeezed-input  interferom¬ 
eter;  cf.  Eq.  (47). 


D.  Comparison  of  squeezed-input  and  variational-output 
interferometers 

The  squeezed-input  and  variational- output  interferometers 
described  above  are  rather  idealized,  most  especially  because 
they  assume  perfect,  lossless  optics.  When  we  relax  that  as¬ 
sumption  in  Sec.  VI  below,  we  shall  see  that,  for  realistic 
squeeze  factors  e~2R  and  losses  ,  the  two  interferometers 
have  essentially  the  same  performance,  but  the  variational- 
output  intefermometer  requires  —10  times  higher  input 
power  IQ  .  In  this  section  we  shall  seek  insight  into  the  phys¬ 
ics  of  these  interferometers  by  comparing  them  in  the  ideal¬ 
ized,  lossless  limit. 

Various  comparisons  are  possible.  The  noise  curves  in 
Fig.  4  illustrate  one  comparison:  When  the  FD  homodyne 
angle  has  been  optimized,  a  lossless  variational-output  inter¬ 
ferometer  reduces  shot  noise  below  the  SQL  and  completely 
removes  back-action  noise;  by  contrast,  when  the  FD 
squeeze  angle  has  been  optimized,  a  squeezed-input  interfer¬ 
ometer  reduces  shot  noise  and  reduces  but  does  not  remove 
back-action  noise;  cf.  Eqs.  (58)  and  (48). 

In  variational-output  interferometers,  after  optimizing  the 
FD  homodyne  angle,  the  experimenter  has  further  control  of 
just  one  input/output  parameter:  the  laser  intensity  or  equiva¬ 
lently  1 0  / 1 SqL=  )C(Cl  =  y).  When  I0II SqL  is  increased,  the 
shot  noise  decreases;  independent  of  its  value,  the  back- 
action  noise  has  already  been  removed  completely;  cf.  Eq. 
(58).  By  contrast,  in  squeezed-input  interferometers,  after  op¬ 
timizing  the  FD  squeeze  phase,  the  experimenter  has  control 
of  two  parameters:  /0//sql>  which  moves  the  minimum  of 
the  noise  curve  back  and  forth  in  frequency  but  does  not 
lower  its  minimum  [17],  and  the  squeeze  factor  R ,  which 
reduces  the  noise  by  e~2R ;  cf.  Eq.  (48). 

Present  technology  requires  that  R  be  approximately  con¬ 
stant  over  the  LIGO  frequency  band.  However,  in  the  same 
spirit  as  our  assumption  that  the  FD  homodyne  phase  can  be 
optimized  at  all  frequencies,  it  is  instructive  to  ask  what  can 
be  achieved  with  an  unconstrained,  frequency-dependent 
(FD)  squeeze  factor  R(kl),  when  coupled  to  an  uncon¬ 
strained  FD  squeeze  angle  X(D). 

One  instructive  choice  is  \(I2)  =  —  arccot  1C  as  in  our  pre¬ 
vious,  optimized  interferometer  [Eq.  (47)],  and  e~2R(n1 
=  l/(  1  +  tC  2) .  In  this  case,  the  squeezed-input  interferometer 
has  precisely  the  same  noise  spectrum  as  the  lossless 
variational-output  interferometer 


h  2 

,  ^  SQL 

h  2/C  ’ 


(59) 


[Eq.  (58)],  and  achieves  it  with  precisely  the  same  laser 
power. 
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FIG.  8.  Noise  ellipses  for  a  squeezed-input  interferometer  whose 
input  squeeze  is  inverse  to  the  interferometer’s  ponderomotive 
squeeze  (“IIS  interferomter”). 


Another  instructive  choice  is  an  input  squeeze  that  is  in¬ 
verse  to  the  interferometer’s  ponderomotive  squeeze  (a  con¬ 
figuration  we  shall  call  “inversely  input  squeezed”  or  IIS): 
Let  the  dark-port  input  field  before  squeezing  be  described 
by  annihilation  operators  c  +  ,  so 

c  ±  |  in)  =  0,  (60) 

i.e.,  the  pre-squeeze  field  is  vacuum.  Then,  denoting  by 
c1  ,c2  the  quadrature  amplitudes  of  this  pre-squeeze  field,  the 
IIS  input  squeezing  is 


Cl  i  —  Cj,  CL  2  —  C  2~\~  JCc  i  ,  (61) 

where  /C(ft)  is  the  interferometer’s  frequency-dependent 
coupling  constant  (18).  The  interferometer’s  ponderomo- 
tively  squeezed  output  noise  is  then 

A  bi  =  aie2lP=Cie2lP,  Ab2  =  (a2  — K,ai)e2lf3=Cie2lf3 

(62) 

[cf.  Eq.  (16)],  i.e.,  the  noise  of  the  output  light  is  that  of  the 
vacuum  with  a  phase  shift,  but  since  the  vacuum  state  is 
insensitive  to  phase,  it  is  actually  just  the  noise  of  the 
vacuum. 

Figure  8  illustrates  this:  The  IIS  input  light  is  squeezed  in 
a  manner  that  gets  perfectly  undone  by  the  ponderomotive 
squeeze,  so  the  output  light  has  no  squeeze  at  all.  The  fact 
that  the  input  squeeze  is  inverse  to  the  ponderomotive 
squeeze  shows  up  in  this  diagram  as  an  input  noise  ellipse 
that  is  the  same  as  the  output  ellipse  of  the  ponderomotively 
squeezed  vacuum,  Fig.  5,  except  for  a  reflection  in  the  hori¬ 
zontal  axis. 

Because  the  output  of  the  IIS  interferometer  is  b2  (ordi¬ 
nary  photodetection)  and  the  output  light’s  state  is  the  ordi¬ 
nary  vacuum,  its  gravitational-wave  noise  is 

u  2 

o  -SQL.  (fCi\ 

$h~  2K,  ’  ^ 

cf.  Eqs.  (23),  (22)  and  (26)  (with  cij  replaced  by  bf).  Notice 
that  this  is  identically  the  same  noise  spectral  density  as  for 
our  previous  example  [Eq.  (59)]  and  as  for  a  variational- 
output  interferometer,  and  it  is  achieved  in  all  three  cases 
with  the  same  light  power. 

The  fact  that  our  two  squeezed-input  examples  produce 
the  same  noise  spectrum  using  different  squeeze  angles  and 
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squeeze  factors  should  not  be  surprising.  The  noise  spectrum 
is  a  single  function  of  12  and  it  is  being  shaped  jointly  by  the 
two  squeeze  functions  X(I2)  and  R  (12). 

The  fact  that  the  IIS  interferometer  and  the  variational 
output  interferometer  produce  the  same  noise  spectra  results 
from  a  reciprocity  between  the  IIS  and  the  variational-output 
configurations:  The  IIS  interferometer  has  its  input  squeezed 
at  the  angle  —  O  =  —  arccot  tC  and  it  has  vacuum-noise  out¬ 
put,  whereas  the  variational-output  interferometer  has 
vacuum-noise  input  and  is  measured  at  the  homodyne  angle 
+  O  =  +  arccot  JC. 

Note  that  the  IIS  interferometer  has  a  different  input 
squeeze  angle  [X(f2)  =  -  1/2  arccot(/C/2);  cf.  Eq.  (31)]  from 
that  of  the  angle-optimized  squeezed-input  interferometer  of 
Sec.  IV  B  [X(I2)  =  -  arccot /C;  cf.  Eq.  (47)].  This  difference 
shows  clearly  in  the  noise  ellipses  of  Fig.  8  (the  IIS  interfer¬ 
ometer)  and  Fig.  6  (the  angle-optimized  interferomter). 
Moreover,  this  difference  implies  that  by  optimizing  the 
IIS  interferometer’s  squeeze  angle  (changing  it  to  X(f2) 
=  —  arccot  1C) ,  while  keeping  its  squeeze  factor  unchanged 
[7?(I2)  =  arcshinh(/C/2);  cf.  Eq.  (31)],  we  can  improve  its 
noise  performance  slightly.  The  improvement  is  from  Eq. 
(63)  to 


sh=- 


'SQL 


2/C 


1  +K,: 


1  +  -(/C2  +  /cV/C2  +  4) 


(64) 


[which  can  be  derived  by  setting  X(I2)=  —  <l>=  —  arccot /C 
and  7?(I2)  =  arcshinh(/C/2)  in  Eq.  (46),  or  by  inserting 
7?(I2)  =  arcshinh(/C/2)  into  Eq.  (48) — note  that  (48)  is  valid 
for  any  angle-optimized,  squeezed-input  interferometer  but 
not  for  the  IIS  interferometer].  The  improvement  factor  in 
square  brackets  is  quite  modest;  it  lies  between  0.889  and 
unity. 

We  reiterate,  however,  that  the  above  comparison  of  inter¬ 
ferometer  designs  is  of  pedagogical  interest  only.  In  the  real 
world,  the  noise  of  a  QND  interferometer  is  strongly  influ¬ 
enced  by  losses,  which  we  consider  in  Sec.  VI  below. 


E.  Squeezed- variational  interferometer 

The  squeezed-input  and  variational-output  techniques  are 
complementary.  By  combining  them,  one  can  beat  the  SQL 
more  strongly  than  using  either  one  alone.  We  call  an  inter¬ 
ferometer  that  uses  the  two  techiques  simultaneously  a 
squeezed-variational  interferometer. 

The  dark-port  input  of  such  an  interferometer  is  squeezed 
by  the  maximum  achievable  squeeze  factor  R  at  a  (possibly 
frequency  dependent)  squeeze  angle  X(I2),  so 

|m)  =  S(/U)|0fl>.  (65) 

The  dark-port  output  is  subjected  to  FD  homodyne  detection 
with  (possibly  frequency  dependent)  homodyne  angle  £(I2); 
i.e.,  the  measured  quantity  is  the  same  output  quadrature  as 
for  a  variational-output  interferometer,  b  ^  [Eq.  (55)],  so  the 
gravitational-wave  noise  operator  is  also  the  same 
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h 

hn(Cl)=-^=ei/3[a2  +  a1(  cot£-K)]e,fi  (66) 
V2/C 

[Eq.  (56)]. 

As  for  a  squeezed-input  interferometer,  the  gravitational- 
wave  noise  is  proportional  to 

(in|  hnhn'\  in)  (0^  |  ^ns^ns '  I  ®a)  (67) 

[Eq.  (22)],  where  hns  is  the  squeezed  gravitational- wave 
noise  operator 


hns  =  sHR,\)hnS(R,X).  (68) 

By  inserting  expression  (66)  for  hn  into  Eq.  (68)  and  invok¬ 
ing  the  action  of  the  squeeze  operator  on  ax  and  a2  [Eq. 
(A8)],  we  obtain 

hns=-  h^k^l+iC2eil3 

V2/C 

X(ai{coshR  cos<5-sinh R  cos[<5-2(<5  +  X)]} 

—  <22{cosh7?  sinO  —  sinhT?  sin[d>  — 2(<F  +  X)]}), 

(69) 


where 


/C=/C-cot£,  <F  =  arccot/C.  (70) 

As  for  a  squeezed-input  interferometer  [see  passage  fol¬ 
lowing  Eq.  (45)],  we  can  read  the  gravitational- wave  spectral 
density  off  of  Eq.  (69)  by  regarding  ax  and  a2  as  random 
processes  with  unit  spectral  densities  and  vanishing  cross 
spectral  density.  The  result  is 


FIG.  9.  Noise  ellipses  for  a  squeezed- variational  interferometer. 
Left:  Noise  for  the  squeezed  vacuum  that  enters  the  interferometer’s 
dark  port.  Right:  Noise  for  the  field  that  exits  at  the  dark  port  along 
with  the  gravitational-wave  signal. 

been  completely  removed,  and  the  shot  noise  has  been  re¬ 
duced  by  the  input  squeeze  factor  e~2R. 

Because  the  optimal  input  squeeze  angle  is  frequency  in¬ 
dependent,  the  squeezed  variational  interferometer  needs  no 
filter  cavities  on  the  input.  However,  they  are  needed  on  the 
output  to  enable  FD  homodyne  detection;  see  Fig.  2  for  a 
schematic  diagram. 

V.  FD  HOMODYNE  DETECTION  AND  SQUEEZING 

Each  of  the  QND  schemes  discussed  above  requires  ho¬ 
modyne  detection  with  a  frequency-dependent  phase  (FD  ho¬ 
modyne  detection)  and/or  input  squeezed  vacuum  with  a 
frequency-dependent  squeeze  angle  (FD  squeezed  vacuum). 
In  this  section  we  sketch  how  such  FD  homodyne  detection 
and  squeezing  can  be  achieved. 


Sh  =  - 


'SQL, 

2 Jc 


(1  +  Kr){e  2^  +  sinh27?[l  -cos2(<F  +  X)]}. 


(71) 


A.  General  method  for  FD  homodyne  detection 

The  goal  of  FD  homodyne  detection  is  to  measure  the 
electric-field  quadrature 


This  noise  is  minimized  by  setting  the  input  squeeze  angle 
X  and  output  homodyne  phase  £  to 

X  =  7t/2,  £=<F  =  arccot/C,  (72) 


which  produces  JC=  0  and  X  =  O  =  tt/2,  so 


S-^e-2  R  = 

h  2/C 


,-2  R 


1JI 


SQL 


4  h  \ft2(y2  +  ft2) 

mEWj  2y4 


;  (73) 


see  Fig.  4. 

Equation  (72)  says  that,  to  optimize  the  (lossless) 
squeezed-variational  interferometer,  one  should  squeeze  the 
dark-port  input  field  at  the  frequency-independent  squeeze 
angle  £=  7t/2  (which  ends  up  squeezing  the  interferometer’s 
shot  noise),  and  measure  the  output  field  at  the  same  FD 
homodyne  phase  £=<F  as  for  a  variational-output  interfer¬ 
ometer;  see  Fig.  9.  Doing  so  produces  an  output,  Eq.  (73),  in 
which  the  radiation-pressure-induced  back-action  noise  has 


for  which  the  quadrature  amplitude  is 

bc=bx  cos  £+Z?2  sin  £,  f  =  £(D);  (75) 

cf.  Eqs.  (10)  and  (54).  If  £  were  frequency  independent,  the 
measurement  could  be  made  by  conventional  balanced  ho¬ 
modyne  detection,  with  homodyne  phase  £.  In  this  subsec¬ 
tion  we  shall  show  that,  when  £  depends  on  frequency,  the 
measurement  can  he  achieved  in  two  steps:  first  send  the 
light  through  an  appropriate  filter  (assumed  to  he  lossless), 
and  then  perform  conventional  balanced  homodyne  detec¬ 
tion. 

The  filter  puts  onto  the  light  a  phase  shift  a  that  depends 
on  frequency.  Let  the  phase  shift  be  a+  for  light  frequency 
co0  +  ( 2,  and  a_  for  co0  — 12.  The  input  to  the  filter  has  am- 
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plitudes  (annihilation  operators)  b±  at  these  two  sidebands, 
and  the  filter  output  has  amplitudes  (denoted  by  a  tilde) 


b±  =  b±e 


ia± 


The  corresponding  quadrature  amplitudes  are 


b+  +  bl  t  b+-b  1 

=^/^’  bi=^2 r 


(76) 


(77) 


at  the  input  [Eqs.  (6)],  and  the  analogous  expression  with 
tildes  at  the  output.  Combining  Eqs.  (77)  with  and  without 
tildes,  and  Eq.  (76),  we  obtain  for  the  output  quadrature  am¬ 
plitudes  in  terms  of  the  input 

bi  =  el(*m(bi  cos  av~  b2  sin  ap), 

b2  =  ela™(b2  cos  av+b2  sin  ap).  (78) 

Here 

1  1 

“m=2(a+-a-)’  “p=2  (“+  +  (79) 


The  light  with  the  output  amplitudes  bx,  b2  is  then  sub¬ 
jected  to  conventional  balanced  homodyne  detection  with 
frequency-independent  homodyne  angle  6 ,  which  measures 
an  electric-field  quadrature  with  amplitude 

b0=  bx  cos  6 +  b2  sin  6 

=  eia™[bl  cos(  6  —  ap)  +  b2  sin(  0  —  ap) ] .  (80) 

If  we  adjust  the  filter  and  the  constant  homodyne  phase  so 
that 


0-a=6-l-(a+  +  a_)  =  m),  (81) 


then,  aside  from  the  frequency-dependent  phase  shift  am, 
the  output  quadrature  amplitude  will  be  equal  to  our  desired 
FD  amplitude: 


be=eia^bc.  (82) 

The  phase  shift  o:m(I2)  is  actually  unimportant;  it  can  be 
removed  from  the  signal  in  the  data  analysis  [as  can  be  the 
phase  shift  /3(H)  produced  by  the  interferometer’s  arm  cavi¬ 
ties]. 

To  recapituate:  FD  homodyne  detection  with  homo  dyne 
phase  £(H)  can  be  achieved  by  filtering  and  conventional 
homodyne  detection,  with  the  filter’s  phase  shifts  a±  (at  w 
=  oj0±D)  and  the  constant  homodyne  phase  0  adjusted  to 
satisfy  Eq.  ( 81 ). 


B.  Realization  of  the  filter 

The  desired  FD  homodyne  phase  is 
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Z=*(a)  =  arccot  /C=  arccotj  n2(^+n2}) 


=  arctan 


a2(r2+a2)^ 

T4  ]’ 


(83) 


where 


A4=(/0//sql)2/  (84) 

[cf.  Eqs.  (18)  and  (45)].  Recall  that  y—  2ttX  100  Hz  is  the 
optimal  frequency  of  operation  of  the  interferometer,  and  to 
beat  the  SOL  by  a  moderate  amount  will  require 
~10SoA‘~20y‘,i.e.,A~2r.  ® 

In  Appendix  C  we  show  that  this  desired  FD  phase  can  be 
achieved  by  filtering  the  light  with  two  successive  lossless 
Fabry-Perot  filter  cavities,  followed  by  conventional  homo¬ 
dyne  detection  at  homodyne  angle 

0=  7t/2  (85) 

[i.e.,  homodyne  measurement  of  b2  at  the  filter  output;  cf. 

Eq.  (80)]. 7  The  two  filter  cavities  (denoted  I  and  II)  produce 
phase  shifts  am  and  au±  on  the  0)o±Ft  side  bands,  so  upon 
emerging  from  the  second  cavity,  the  net  phase  shifts  are 

a±  =  al±  +  au±  .  (86) 

Each  cavity  (7=1  or  II)  is  characterized  by  two  param¬ 
eters:  its  decay  rate  (bandwidth)  28 j  (with  7  =  I  or  II),  and  its 
fractional  resonant-frequency  offset  from  the  light’s  carrier 
frequency  oj0  , 


(x)q  U>res  j 


(87) 


Here  cores  j  is  the  resonant  frequency  of  cavity  7.  In  terms  of 
these  parameters,  the  phase  shifts  produced  in  the  a>0±D 
side  bands  by  cavity  7  are 

aJ±  =  arctan  ( £/±  £1/ 8 j).  (88) 


The  filters’  parameters  must  be  adjusted  so  that  the  net  phase 
shift  (86),  together  with  the  final  homodyne  angle  9=  7t/2, 
produce  the  desired  FD  phase,  Eqs.  (81)  and  (83). 

In  Appendix  C  we  derive  the  following  values  for  the 
filter  parameters  ^ ,  Si,  ,  and  £n  as  functions  of  the  pa¬ 
rameters  A  and  y  that  appear  in  the  desired  FD  homodyne 
phase.  Define  the  following  four  functions  of  A  and  y: 

4A4  IT+P 2 

p=  ~y~’  e-1+V^- ’  (89a) 


7The  fact  that  only  two  cavities  are  needed  to  produce  the  desired 
FD  homodyne  phase  (83)  is  a  result  of  the  simple  quadratic  form  of 
tanO(H2).  If  the  desired  phase  were  significantly  more  compli¬ 
cated,  a  larger  number  of  filter  cavities  would  be  needed;  cf.  Eq. 
(C3)  and  associated  analysis.  It  would  be  interesting  to  explore 
what  range  of  FD  homodyne  phases  can  be  achieved,  with  what 
accuracy,  using  what  number  of  cavities. 


022002-15 


KIMBLE,  LEVIN,  MATSKO,  THORNE,  AND  VYATCHANIN 


PHYSICAL  REVIEW  D  65  022002 


FIG.  10.  The  parameters  characterizing  the  two  Fabry-Perot 
cavities  that  are  used,  together  with  conventional  homodyne  detec¬ 
tion  at  phase  0=  7t/2,  to  produce  FD  homodyne  detection  at  the 
desired  frequency-dependent  phase  (83).  The  quantities  and 
are  the  filters’  fractional  frequency  offsets  from  the  light’s  carrier 
frequency  (87);  S1/y  and  £n/  y  are  the  filters’  half  bandwidths  in 
units  of  the  half-band  width  of  the  interferometer’s  identical  arm 
cavities.  The  functional  forms  of  these  parameters  are  Eqs.  (89). 


Then  in  terms  of  these  functions,  the  filter  parameters  are 

(89c) 


V1+( 2T0* 


fn= 

*i= 

y 

Sr= 

y 


l 


2A_ 


-V  1  + 


1 


(2  A_) 


2’ 


s&Ve’ 


8(-&  )Ve' 


(89d) 

(89e) 

(89f) 


Note  that,  when  the  cavity  half-bandwidths  Sj  are  ex¬ 
pressed  in  terms  of  the  half-b  and  width  y  of  the  interferom¬ 
eter’s  arm  cavities,  as  in  Eqs.  (89e)  and  (89f),  then  the  filter 
parameters  depend  on  only  one  characteristic  of  the  desired 
FD  homodyne  phase:  the  quantity  (A/ y)4  =  2I0/ISQL.  Fig¬ 
ure  10  depicts  the  filter  parameters  as  functions  of  this  quan¬ 
tity. 

As  Fig.  10  shows,  the  half-bandwidths  of  the  two  filter 
cavities  are  within  a  factor  ~2  of  that  of  the  interferometer’s 
arm  cavities.  This  is  so  for  the  entire  range  of  laser  powers, 
I o  / Asql  »  that  are  likely  to  be  used  in  QND  interferometers, 
at  least  in  the  early  years  (e.g.,  LIGO-III;  ca.  2008-2010). 
Moreover,  the  filter  cavities’  fractional  frequency  offsets  £/ 
are  of  order  unity  (  —  0.5<£/^2).  Thus,  the  desired  proper¬ 
ties  of  the  filter  cavities  are  not  much  different  from  those  of 
the  interferometer’s  arm  cavities. 

In  Sec.  VI  below,  we  shall  see  that  the  most  serious  limi¬ 
tation  on  the  sensitivities  of  variational- output  and  squeezed- 
variational  interferometers  is  optical  loss  in  the  filter  cavities. 
To  minimize  losses,  the  cavities  should  be  very  long  (so  the 


cavities’  stored  light  encounters  the  mirrors  a  minimum  num¬ 
ber  of  times).  This  suggests  placing  the  filter  cavities  in  the 
interferometer’s  4-km-long  arms,  alongside  the  interferom¬ 
eter’s  arm  cavities. 


C.  Squeezing  with  frequency-dependent  squeeze  angle 

Just  as  the  variational-output  and  squeezed- variational  in¬ 
terferometers  require  homodyne  detection  at  a  FD  phase,  so  a 
squeezed-input  interferometer  requires  squeezing  at  a  FD 
angle  X(I2). 

The  nonlinear- optics  techniques  currently  used  for 
squeezing  will  produce  a  squeeze  angle  that  is  nearly  con¬ 
stant  over  the  very  narrow  frequency  band  of  gravitational- 
wave  interferometers,  |  a>  —  0l>o |  ^  (a  few)  X  10_  l2a)0  . 
What  we  need  is  a  way  to  change  the  squeeze  angle  from  its 
constant  nonlinear-optics-induced  value  to  the  desired 
frequency-dependent  value,  X=  —  <F(I2)  [Eq.  (50)]. 

Just  as  FD  homodyne  detection  can  be  achieved  by  send¬ 
ing  the  light  field  through  appropriate  filters  followed  by  a 
frequency-independent  homodyne  device,  so  also  FD 
squeezing  can  be  achieved  by  squeezing  the  input  field  in  the 
standard  frequency-independent  way,  and  then  sending  it 
through  appropriate  filters.  Moreover,  since  the  necessary 
squeeze  angle  (50)  has  the  same  frequency  dependence 
—  0(D)  as  the  homodyne  phase  (57)  and  (18)  (aside  from 
sign  and  the  value  of  a  multiplicative  constant  in  1C),  the 
filters  needed  in  FD  squeezing  are  nearly  the  same  as  those 
needed  in  FD  homodyne  detection:  The  filtering  can  be 
achieved  by  sending  the  squeezed  input  field  through  two 
Fabry-Perot  cavities  before  injecting  it  into  the  interferom¬ 
eter,  and  the  cavity  parameters  are  given  by  Eqs.  (89a)- 
(89f),  with  certain  sign  changes  and  with  P  =  8: 

[65  IQ 

2-1+ Vy’  A±  =  -<2±y-,  (90a) 


1  (  l 

^I_  2A+  V1+(2 A  +  )2’ 

1  _  /l  ~ 

2AZ  +  V  1  +  (2 A_)2' 


* 
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(-&  )Ve’ 


4=  /  i 
y  *  fWcT 


(90b) 

(90c) 

(90d) 

(90e) 


The  details  of  the  calculations  are  essentially  the  same  as 
Appendix  C,  but  with  Eq.  (Cl)  changed  into  the  following 
expression  for  the  initial  frequency-independent  squeeze 
angle  6  and  the  cavities’  frequency-dependent  phase  shifts 
aJ±  : 
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tan  <F  ( O )  =  — 


n2(r2+n2) 

2/ 

ai+  +  <*/-  +  an+  +  «//- 


=  tan  I  0- 


(91) 


VI.  INFLUENCE  OF  OPTICAL  LOSSES  ON  QND 
INTERFEROMETERS 

A.  The  role  of  losses 

It  is  well  known  that,  when  one  is  working  with  squeezed 
light,  any  source  of  optical  loss  (whether  fundamentally  irre¬ 
versible  or  not)  can  debilitate  the  light’s  squeezed  state.  This 
is  because,  wherever  the  squeezed  light  can  leave  one’s  op¬ 
tical  system,  vacuum  field  can  (and  must)  enter  by  the  in¬ 
verse  route;  and  the  entering  vacuum  field  will  generally  be 
unsqueezed  [31]. 

All  of  the  QND  interferometers  discussed  in  this  paper 
rely  on  squeezed-light  correlations  in  order  to  beat  the 
SQL — with  the  squeezing  always  produced  ponderomotively 
inside  the  interferometer  and,  in  some  designs,  also  present 
in  the  dark-port  input  field.  Thus,  optical  loss  is  a  serious 
issue  for  all  the  QND  interferometers. 

In  this  section  we  shall  study  the  influence  of  optical 
losses  on  the  optimized  sensitivities  of  our  three  types  of 
QND  interferometers. 

B.  Sources  of  optical  loss 

The  sources  of  optical  loss  in  our  interferometers  are  the 
following: 

(i)  For  light  inside  the  interferometer’s  arm  cavities  and 
inside  the  Fabry-Perot  filter  cavities:  scattering  and  absorp¬ 
tion  on  the  mirrors  and  finite  transmissivity  through  the  end 
mirrors.  We  shall  discuss  these  quantitatively  at  the  end  of 
the  present  subsection.  (In  addition,  wave  front  errors  and 
birefringence  produced  in  the  arm  cavities  and  filters,  e.g., 
via  power-dependent  changes  in  the  shapes  and  optical  prop¬ 
erties  of  the  mirrors,  will  produce  mode  missmatching  and 
thence  losses  in  subsequent  elements  of  the  output  optical 
train.) 

(ii)  For  squeezed  vacuum  being  injected  into  the  interfer¬ 
ometer:  fractional  photon  losses  6circ  in  the  circulator 8  used 
to  do  the  injection,  in  the  beam  splitter  6bs,  and  in  mode¬ 
matching  into  the  interferometer  6mm . 

(iii)  For  the  signal  light  traveling  out  of  the  interferom¬ 
eter:  In  addition  to  losses  in  the  arm  cavities  and  filter  cavi¬ 
ties,  also  fractional  photon  losses  in  the  beam  splitter  6bs ,  in 
the  circulator  6circ ,  in  mode  matching  into  each  of  the  filter 
cavities  6mm,  in  mode  matching  with  the  local-oscillator 
light  used  in  the  homodyne  detection  6lo ,  and  in  the  photo¬ 
diode  inefficiency  6pd . 


8The  circulator  is  a  four-port  optical  device  that  separates  spatially 
the  injected  input  and  the  returning  output  from  the  interferometer; 
see  Fig.  1.  It  can  be  implemented  via  a  Faraday  rotator  in  conjunc¬ 
tion  with  two  linear  polarizers. 
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It  is  essential  to  pursue  research  and  development  with  the 
aim  of  driving  these  fractional  photon  losses  down  to 

^circ~  0.001 .  (92) 

These  loss  levels  are  certainly  daunting.  However,  it  is  well 
to  keep  in  mind  that  attaining  the  absolute  lowest  loss  levels 
will  likely  be  an  essential  component  of  any  advanced  inter¬ 
ferometer  that  attempts  to  challenge  and  surpass  the  SQL.  In 
the  current  case,  discussions  with  Stan  Whitcomb  and  the 
laboratory  experience  of  one  of  the  authors  (H.J.K.)  lead  us 
to  suggest  that  it  may  be  technically  plausible  to  achieve  the 
levels  of  Eq.  (92)  in  the  LIGO-III  time  frame,  though  a  vig¬ 
orous  research  effort  will  be  needed  to  determine  the  actual 
feasibility. 

The  arm  cavities  are  a  dangerous  source  of  losses  because 
the  light  bounces  back  and  forth  in  them  so  many  times.  We 
denote  by  C  the  probability  that  a  photon  in  an  arm  cavity 
gets  lost  during  one  round-trip  through  the  cavity,  due  to 
scattering  and  absorption  in  each  of  the  two  mirrors  and 
transmission  through  the  end  mirror.  With  much  research  and 
development  by  the  LIGO-III  time  frame  this  loss  coefficient 
per  round  trip  may  be  as  low  as 

£~20X  10-6.  (93) 


A  fraction 


2  C  C 

f-T-2^“00012  <94) 


of  the  carrier  photons  that  impinge  on  each  arm  cavity  gets 
lost  in  the  cavity  [cf.  Eq.  (B25)  on  resonance  so  S=  e].  (Note 
the  absence  of  any  subscript  on  this  particular  e.)  For  side¬ 
band  light  the  net  fractional  loss  [denoted  £(1 2);  Eq.  (100) 
below]  is  also  of  order  e. 

Each  filter  cavity,  J=  I  or  II,  has  an  analogous  loss  coef¬ 
ficient  Cj—C  and  fractional  loss  of  resonant  photons 


2  Cj_  C 

Tj  2SjLjlc • 


(95) 


Because  (as  we  shall  see),  the  filter  cavities’  losses  place 
severe  limits  on  the  interferometer  sensitivity,  we  shall  mini¬ 
mize  their  net  fractional  loss  in  our  numerical  estimates  by 
making  the  filter  cavities  as  long  as  possible:  Lj=L 
=  4  km.  Then  the  ratio  of  Eqs.  (95)  and  (94)  gives 

ej=e(y/Sj)~  (0.5  to  2)e.  (96) 


C.  Input-output  relation  for  lossy  interferometer 

We  show  in  Appendix  B  that,  accurate  to  first  order  in  the 
arm-cavity  losses  (and  ignoring  beam-splitter  losses  which 
we  shall  deal  with  separately  below),  the  relation  between 
the  input  to  the  interferometer’s  beam  splitter  (field  ampli¬ 
tudes  af)  and  the  output  from  the  beam  splitter  (field  ampli¬ 
tudes  bj)  takes  the  following  form: 
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/ -  ^  a 

bx  =  Abx,  b2  =  Ab2+^2)C^-I - e,f 3*  (97) 

^SQL 

[cf.  the  last  sentence  of  Appendix  B ;  also  the  lossless  input- 
output  relation  (16)  and  Fig.  3].  Here,  accurate  to  first  order 
in  6, 


/  fi/y  \  6/2 

{9S) 

is  the  loss-modified9  phase  [i  [Eq.  (17)],  and  the  coupling 
coefficient  is  reduced  slightly  by  the  losses:10 


,  (70//SQL)2y4 

*  fl2[y2(l  +  6/2)2  +  H2] 


(99) 


[cf.  Eq.  (18)],  where 

2y2  2e 

£=  y2  +  fi2  £=  1  +  (fl/y)2 


(100) 


is  the  net  fractional  loss  of  sideband  photons  in  the  arm  cavi¬ 
ties  [cf.  Eq.  (B25)].  Accurate  to  first  order  in  the  losses,  the 
output  quadrature  noise  operators  in  Eq.  (97)  have  the  form 


FIG.  11.  The  output  light  b  is  sent  through  a  lossy  output  optical 
train,  including  a  beam  splitter,  circulator,  cavity  filters  I  and  II,  a 
mixer  with  local  oscillator  light  and  a  photodiode.  The  result  (aside 
from  an  unimportant  phase  shift  am)  is  the  desired  measured  quan¬ 
tity  This  actual  process,  sketched  on  the  left  side  of  the 

equality  sign,  is  mathematically  equivalent  to  the  idealized  process 
sketched  on  the  right  side:  The  cavities’  loss  effects  are  introduced 
first,  producing  b,  which  is  then  sent  through  an  idealized,  lossless 
optical  train  including  the  filters. 


of  loss-noise  field  .  The  phase  shift  /3  that  the  interfer¬ 
ometer  cavities  put  onto  the  loss-noise  field  is  half  that  put 
onto  the  dark-port  input  field  because  of  the  different  routes 
by  which  the  aj  and  rij  get  into  the  arm  cavities. 

The  radiation-pressure  back-action  force  on  the  test  mass 
is  produced  by  a  beating  of  the  laser’s  carrier  light  against 
the  in-phase  quadrature  of  the  inside-cavity  noise  field  ax 
+  \je/2  n  i .  Thus,  it  is  a  x  +  \je/2  n  i  that  appears  in  the  output 
light’s  back-action  noise  (last  term  of  A b2). 


Abi  =  axe21^  1  —  —Sj  +  yfSe^rii , 

\b2  =  a2e2lf3i^  1  —  —  £j  +  yj£el^n2 

—  1C*(al  +  ^jeHn^e2^*  (101) 

[cf.  last  sentence  of  Appendix  B  and  cf.  Eq.  (16)].  Here  nx 
and  ft  2  are  the  net  quadrature  field  amplitudes  that  impinge 
on  the  interferometer’s  arm  cavities  at  their  various  sites  of 
optical  loss.  We  shall  call  rij  the  quadrature  amplitudes  of  the 
arm  cavities’  loss -noise  field.  They  are  complete  analogs  of 
the  input  and  output  fields’  quadrature  amplitudes  aj  and  bj  : 
they  are  related  to  the  loss-noise  field’s  annihilation  and  cre¬ 
ation  operators  n±  and  n\  in  the  standard  way  [analog  of 
Eqs.  (6)],  they  have  the  standard  commutation  relations  [ana¬ 
log  of  Eqs.  (7)],  and  they  commute  with  the  dark-port  input 
field  amplitudes  aj . 

Equations  (101)  have  a  simple  physical  interpretation. 
The  dark-port  input  field  a  j  at  frequency  0)o  ±  12  gets  attenu¬ 
ated  by  a  fractional  amount  £12  while  in  the  interferometer 
(corresponding  to  a  photon-number  fractional  loss  £),  and 
the  lost  field  gets  replaced,  in  the  output  light,  by  a  small  bit 


9The  loss  modification,  i.e.,  the  difference  between  f3 *  and  /3, 
turns  out  to  influence  the  gravitational-wave  noise  only  at  second 
order  in  e  and  thus  is  unimportant;  see  footnote  12  below. 

10As  is  discussed  in  footnote  16,  in  Eq.  (99)  for  1C*  ,  strictly 

speaking,  IQ  is  not  the  input  power  to  the  interferometer,  but  rather 
is  the  input  power  reduced  by  the  losses  that  occur  in  the  input 
optics,  beamsplitter,  and  arm  cavities.  We  ignore  this  delicacy  since 
its  only  effect  in  our  final  formulas  is  a  slight  renormalization  of  IQ  . 


D.  Noise  from  losses  in  the  output  optical  train  and  the 
homodyne  filters 

The  output  quadrature  operators  bj  get  fed  through  an 
output  optical  train  including  the  beam  splitter,  circulator  (if 
present),  filter  cavities  (if  present  in  the  output  as  opposed  to 
the  input),  local-oscillator  mixer,  and  photodiode.  Losses  in 
all  these  elements  will  modify  the  A  bj .  In  analyzing  these 
modifications,  we  shall  not  assume,  initially,  that  the  FD  ho¬ 
modyne  phase  is  <E(I2);  rather,  we  shall  give  it  an  arbitrary 
value  £(I2)  (as  we  did  in  our  lossless  analysis,  Sec.  IV  C), 
and  shall  optimize  £  at  the  end.  The  optimal  £  will  turn  out  to 
be  affected  negligibly  by  the  losses;  i.e.,  it  will  still  be 
<E(I2)  =  arccot/C. 

By  analogy  with  the  effects  of  arm-cavity  losses  [factors  £ 
in  Eqs.  (101)],  the  effects  of  the  optical-train  losses  on  the 
output  fields  bj  can  be  computed  in  the  manner  sketched  in 
Fig.  11:  The  process  of  sending  the  quadrature  amplitudes  bj 
through  the  optical  train  is  equivalent  to  (i)  sending  bj 
through  a  “loss  device”  to  obtain  loss-modified  fields  bj , 
and  then  (ii)  sending  bj  through  the  lossless  optical  train.11 

Because  the  filter  cavities  have  frequency  offsets  fi,  that 
make  their  losses  different  in  the  upper  and  lower  side  bands, 
the  influence  of  the  losses  is  most  simply  expressed  in  terms 
of  the  annihilation  operators  for  the  side  bands  b±  ,  rather 
than  in  terms  of  the  quadrature  amplitudes  bj .  In  terms  of 
b  +  ,  the  equation  describing  the  influence  of  losses  is  iden¬ 
tical  to  that  in  the  case  of  the  arm  cavities  with  fixed  mirrors, 
Eqs.  (101)  with  /C=0: 


nYanbei  Chen  [32]  has  shown  that  it  does  not  matter  whether  the 
losses  are  placed  before  or  after  the  lossless  train. 
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U±  +  X  V^y±«y±.  (102) 

I  J 


Here  (i)  the  sum  is  over  the  two  filter  cavities  7=1  and  II 
(which  must  be  treated  specially)  and  over  the  rest  of  the 
output  optical  train,  denoted  7=OT;  (ii)  £j  is  the  net  frac¬ 
tional  loss  of  photons  in  element  7;  (iii)  nJ±  is  the  annihila¬ 
tion  operator  for  the  loss-noise  field  introduced  by  element  7; 
(iv)  for  each  filter  I  or  II,  the  analog  of  the  phase  factor  2  /3  of 
Eq.  (101)  gets  put  onto  the  light  in  the  subsequent  lossless 
filter  and  thus  is  absent  here;  and  (v)  we  have  absorbed  a 
phase  factor  into  the  definition  of  nJ±  . 

The  net  fractional  photon  loss  in  a  filter  cavity  must  be 
identical  to  that  in  an  arm  cavity,  Eq.  (100),  if  written  in 
terms  of  the  cavity’s  half  bandwidth  (y  for  arm  cavity,  Sj  for 
filter  cavity)  and  the  difference  between  the  field’s  frequency 
0)=(x)o±i 2  and  the  cavity’s  resonant  frequency  cores  (a) 
—  a>res=— ^  for  arm  cavity;  co—  a>res=  ^JSJ±QJ  for  filter 
cavity).  Therefore,  Eq.  (100)  implies  that 


£j±  — 


2eT 


l+(£y±f^y) 


for  7  =  1,11. 


(103) 


For  the  remainder  of  the  optical  train,  the  net  fractional  pho¬ 
ton  loss  £ox  is  the  sum  of  the  contributions  from  the  various 
elements  and  is  independent  of  frequency: 


^OT±  ^OT  ^bs^-  ^circ^-  2  Gnm^-  ^lo^-  ^pd  0.006. 

(104) 


By  expressing  b  +  and  n  J±  in  terms  of  bj  and  n jj  (for  j 
=  1,2)  via  the  analog  of  Eq.  (6),  inserting  these  expressions 
into  Eq.  (102),  then  computing  bj  via  the  analog  of  Eq.  (6), 
we  obtain 


b\~  1  0£qtf \b\  a  2  (£j+  £j-)b 2 


+  9  2  [(>/£/+  +  'J£j-)nji  +  ^ £j—)nj2~\ j 

7  j 

(105a) 


£>2—  1  o^'OTFH?2+  A  2  (£j+  £j-)b  1 


+  9  2  [( V^/+  +  ^£j-)nj2  *(V<£/+  y/Sj-M. 

2  / 

(105b) 


Here 


£0tf— 


1 

2 


2 


(£j+  +  £j-) 


1 

=  foT+  2^l+  +  ^ l~  +  ^II+  +  ^ I!-  ) 


—  f0T+  6  2  2 

/=I,II  j=  +  ,- 


i  +  (€j+sCi/8j)2 


(106) 


is  the  net,  IX -dependent  loss  factor  for  the  entire  output  op¬ 
tical  train  including  the  filter  cavities.  From  Eqs.  (94),  (104) 
and  (106)  and  Fig.  10,  we  infer  that 

£0tf~  0.009  (107) 

with  only  a  weak  dependence  on  frequency,  which  we  shall 
neglect. 

In  Eqs.  (105a),  (105b)  the  terms  iX  (quantity  linear  in 
£j±)bj  [the  b  2  term  in  bx  and  the  bx  term  in  b2 ]  will  con¬ 
tribute  amounts  second  order  in  the  losses  (oc£2) 

to  the  sig¬ 
nal  and/or  noise,  and  thus  can  be  neglected.  We  shall  flag  our 
neglect  of  these  terms  below,  when  they  arise. 


E.  Computation  of  noise  spectra  for  variational-output  and 
squeezed-variational  interferometers 

The  output  of  a  squeezed-variational  interferometer  or 
variational-output  interferometer  is  the  frequency-dependent 
quadrature  b g  depicted  in  Fig.  11.  This  quantity,  when  split 
into  signal  h  plus  noise  takes  the  following  form: 


b^=bl  cos  £+£2  sin  £ 


1- 


h 

hsQL 


elP*  + 


A  bc 
sin  £ 


(108) 


cf.  Eqs.  (97)  and  (105a),  (105b).  Here  we  have  omitted  an 
imaginary  part  of  the  factor  1  —  ^£0tf  [arising  from  the  b2 
term  in  bx,  Eq.  (105a)]  because  its  modulus  is  second  order 
in  the  losses  (^Sj)  and  therefore  it  contributes  negligibly  to 
the  signal  strength. 

Equation  (108)  implies  that  the  gravitational- wave  noise 
operator  is 


1  \h 


bn  —  I  1  +  2^otf+ 


V2K 


(A&2  + AZ?i  cot  £), 


(109) 


where  we  have  used  Eq.  (99)  for  /C*  . 

For  a  squeezed-variational  interferometer,  the  dark-port 
input  field  aj  is  in  a  squeezed  state,  with  squeeze  factor  R 
and  squeeze  angle  \(I2)  (which,  after  optimization,  will  turn 
out  to  be  X  =  tt/2  as  for  a  lossless  interferometer).  For  a 
variational-output  interferometer,  aj  is  in  its  vacuum  state, 
which  corresponds  to  squeezing  with  R  =  0  so  we  lose  no 
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generality  by  assuming  a  squeezed  input.  Since  all  the  noise 
fields  except  aj  are  in  their  vacuum  states,  the  light’s  full 
input  state  is 

|in)=|O„)0|OnoT)0|Oni)0|O„n)05(7?,X)|O.),  (110) 

where  the  notation  should  be  obvious. 

The  gravitational-wave  noise  is  proportional  to 

(ui\hnhnt\m)  =  (0\hnshns'\0)  (111) 

where  |0)  is  the  vacuum  state  of  all  the  noise  fields  a,  n, 
nQ x,  nY,  and  ;  and  hns  is  the  usual  squeezed  noise  opera¬ 
tor 


hns  =  S\R,\)hnS{R,\).  (112) 

We  bring  this  squeezed-noise  operator  into  an  explicit  form 
by  (i)  inserting  Eq.  (109)  into  Eq.  (112),  then  (ii)  replacing 

the  A&y’s  by  expressions  (105a),  (105b)  [with  A  put  onto  all 
the  b’s ,  i.e.,  with  the  signal  removed],  then  (iii)  replacing  the 
AZ?7-’s  by  expressions  (101),  and  then  (iv)  invoking  Eqs.  (A8) 
for  the  action  of  the  squeeze  operators  on  the  af  s.  The  result 
is 


*M=(l-lf)  x[Eq.(69)] 

+  ksQL 

J>X 

+  [( V<?/+  +  \l£j~) cot  i(^£J+  —  ylSj-)]nn 

£  j 


(  —  KelPyjel2+  Vfcot  £)elf3rii  +  y[£ elf3n2 


(113) 


where  we  have  omitted  terms,  arising  from  b2  in  Eq.  (105a) 
and  from  bx  in  Eq.  (105b),  which  contribute  amounts  0(£2j) 
to  Sh ;  and  we  have  omitted  a  term12  proportional  to  (3* 
—(3  which  contributes  an  amount  0(e2). 

By  virtue  of  Eq.  (Ill)  and  the  argument  preceding  Eqs. 
(26),  we  can  regard  all  of  the  quadrature  noise  operators 
aj9  rij ,  n  j  j  in  this  hns  as  random  processes  with  unit 
spectral  densities  and  vanishing  cross-spectral  densities.  Cor¬ 
respondingly,  the  gravitational-wave  noise  is  the  sum  of  the 
squared  moduli  of  the  coefficients  of  the  quadrature  noise 
operators  in  Eq.  (113): 


12This  term  is  an  imaginary  part,  2z(/3*  —  (3)1C=  —  \ie1C  sin  2/3, 
of  the  quantity  )C,  which  enters  Eq.  (69)  via  Eq.  (70).  Because  this 
imaginary  part  produces  a  correction  to  the  loss-free  part  of  hn  that 
is  90°  out  of  phase  with  the  loss-free  part  and  is  of  order  e,  it 
produces  a  correction  to  Sh  that  is  quadratic  in  e  and  thus  negli¬ 
gible. 


Sh  = 


h 


SQL 


h~  2/C 

X{e_2R  +  sinh27?[l  —  cos2(<f>  +  \)]} 

+  /C2  —  +  ( 1  —  /C  cot  £)£+  ( 1  +  cot2£)£gTF 


(114) 


where 


/C=/C— cot  £,  d>  =  arccot/C  (115) 

[Eq.  (70)].  In  Eq.  (114),  the  first  two  lines  come  from  a x  and 
a  2  [squeezed  vacuum  entering  the  dark  port;  cf.  Eq.  (71)] 
modified  by  losses  in  the  arm  cavities  [the  factor  1  —£72)]; 
the  first  two  terms  on  the  third  line  come  from  nx  and  n2 
[shot  noise  due  to  vacuum  entering  at  loss  points  in  the  arm 
cavities];  and  the  last  term  comes  from  riji  and  nJ2  [shot 
noise  due  to  vacuum  entering  at  loss  points  in  the  output 
optical  train,  including  the  filters]. 

As  for  the  lossless  interferometer  [Eqs.  (72)  and  (73)],  the 
noise  (114)  is  minimized  by  setting  the  input  squeeze  angle  X 
and  output  homodyne  phase  £  to 

X  =  7t/2,  £  =  <3>  =  arc  cot  /C  (116) 

[aside  from  a  neglible  correction  S£=  (£+2£OTF)e~2R/(JC 

+  /C-1)].  This  optimization  produces  1C=  0  and  \  =  <5 
=  77/2,  so 


l--£\e~2R  +  £+£( 

K 


OTF 


'  +  2  +  ^OTF 


(117) 


Note  that  the  optimization  has  entailed  a  squeezed  input 
with  frequency-independent  squeeze  phase,  as  in  the  lossless 
interferometer;  so  no  filters  are  needed  in  the  input.  The  out¬ 
put  filters  must  produce  a  FD  homodyne  angle  £=  <E(I2)  that 
is  the  same  as  in  the  lossless  case  and  therefore  can  be 
achieved  by  two  long,  Fabry-Perot  cavities. 

It  is  instructive  to  compare  the  noise  (117)  for  a  lossy 
squeezed- variational  interferometer  with  that  of  Eq.  (73)  for 
one  without  optical  losses.  In  the  absence  of  losses,  the  out¬ 
put’s  FD  homodyne  detection  can  completely  remove  the 
radiation-pressure  back-action  noise  from  the  signal;  only  the 
shot  noise,  1//0 1 H0  ,  remains.  Losses  in  the  interferom¬ 
eter’s  arm  mirrors  prevent  this  back-action  removal  from  be¬ 
ing  perfect:  they  enable  a  bit  of  vacuum  field  n  to  leak  into 
the  arm  cavities,  and  this  field  produces  radiation-pressure 
noise  that  remains  in  the  output  after  the  FD  homodyne  de¬ 
tection  (the  JCe/2  term  in  Eq.  (117)]. 

The  IC£0 tf  noise  in  Eq.  (117)  has  the  same  dependence  on 
laser  power,  ,  as  the  radiation-pressure  noise.  Never¬ 

theless,  it  is  actually  shot  noise,  not  radiation  pressure  noise. 
It  is  produced  by  the  vacuum  loss-noise  fields  that  leak  into 
the  output  signal  light  when  it  encounters  each  lossy  optical 
element.  Those  fields’  shot  noise  gets  weighted  by  the  factor 
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cot  £=  cot  0  =  /C  in  the  homodyne  process,  which  accounts 
for  their  proportionality  to  JC ^I0. 

A  reasonable  estimate  for  the  amount  of  input-light 
squeezing  that  might  be  achieved  in  LIGO-III  is  [33] 

e~2R  —  0.l.  (118) 

By  contrast,  Eqs.  (94),  (96),  (100)  and  (106)  suggest 


(£+£otf)~0.01.  (119) 


This  motivates  our  neglecting  £+£0TF  compared  to  e  2R  in 
expression  (117),  and  rewriting  the  noise  (117)  as 


h2 

nSQL 

~Y~ 


+  /C6^  , 


(120) 


where 


=  J  +  foxF- 0.0010;  (121) 


cf.  Eqs.  (94)  and  (107). 

Equation  (120)  is  our  final  form  for  the  noise  spectrum  of 
a  lossy  squeezed-variational  interferometer.  When  we  set  the 
input  squeeze  factor  to  unity,  e~2R=l,  it  becomes  the  noise 
spectrum  for  a  lossy  variational-output  interferometer: 


Sh~ 


h  2 
SQL 

2 


i  +  /C6 


* 


(122) 


Errors  A  X  =  X  —  7t/2  in  the  input  squeeze  angle  and  A  £ 
=  £— arccot /C  in  the  output  homodyne  phase  will  increase 
the  noise  spectral  density.  By  performing  a  power  series  ex¬ 
pansion  of  expression  (114),  we  obtain  for  the  noise  increase 


AS*  = 


lSQL 


JC 


sinh  27?  AX2  —  2(1  +/C2)  sinh  27?A\A£ 


+(1+/C)VV 


-j^e2R[AK-(l+IC2)A  a2, 


(123) 


where  the  second  expression  is  accurate  in  the  limit  e2R 
>e~2R.  Numerical  evaluations  show  that,  for  e~2R  =  <d.\  and 
6*  =  0.01  (see  above),  and  for  /C~  1  to  3  (the  range  of  great¬ 
est  interest;  cf.  Sec.  VII),  A  -Jsh  will  be  less  than  \^Sh  so 
long  as:  (i)  the  input  squeeze  angle  is  accurate  to  |A\| 
^0.05,  and  (ii)  the  FD  output  homodyne  phase  is  accurate  to 
|  A £|^0.01.  At  JC=l  the  FD  phase’s  required  accuracy  is 
reduced  to  |  A£|  ^0.04.  The  FD  phase  £  is  determined  by  the 
filter  cavities’  half  bandwidths  Sj  and  fractional  frequency 
offsets  and  the  local  oscillator  phase  or  equivalently  the 
final,  conventional  homodyne  detector’s  homodyne  phase  6. 
The  filter  cavities’  half  bandwidths  Sj  (or  equivalently  their 
finesses)  are  fixed  by  the  mirror  coatings.  Coating-produced 
errors  in  8,  can  be  compensated  to  some  degree  by  tuning 
the  fractional  frequency  offsets  £/  (via  adjusting  the  mirror 
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positions)  and  by  tuning  the  local  oscillator  phase  or  equiva¬ 
lently  6.  Finesse  errors  as  large  as  five  per  cent,  \ASj\/Sj 
^0.05,  can  be  compensated  to  yield  the  required  |A£| 
^0.01  by  tuning  the  offsets  and  homodyne  phase  to  one 
percent  accuracy,  |A£7|:S0.01,  A^^O.Ol  [Eqs.  (83),  (Cl), 
(C2)  and  Fig.  10].  These  requirements  are  challenging. 


F.  Computation  of  the  noise  spectrum  for  a  squeezed-input 
interferometer 

For  a  squeezed-input  interferometer,  as  for  squeezed- 
variational,  the  losses  in  the  input  optical  train  (including  the 
filter  cavities)  influence  the  noise  only  through  their  impact 
on  the  squeeze  factor  e~2R~0.1  of  the  dark-port  vacuum 
when  it  enters  the  arm  cavities — an  impact  that  may  make  R 
frequency  dependent,  R  =  R(Cl).  By  contrast,  losses  in  the 
arm  cavities  and  in  the  output  optical  train  will  produce  noise 
in  much  the  same  manner  as  they  do  for  a  squeezed- 
variational  interferometer.  More  specifically: 

The  effect  of  arm-cavity  and  output-train  losses  on  the 
squeezed  noise  operator  hns  can  be  read  off  of  the  squeezed- 
variational  formula  (113)  as  follows:  (i)  Set  £=7t/2  so  the 
quantity  measured  is  b2  [no  output  filtering;  Eq.  (108)];  (ii) 

correspondingly  set  cot£=0,  JC=JC,  and  O  =  =  arccot  JC 

[Eqs.  (115)];  (iii)  in  the  sum  over  7  include  only  7=OT  and 
not  7  =  1,  II  since  there  are  no  output  filters.  The  result  is 


hn=\  l-^)x[Eq.  (44)]+  --j==(  —  ICy]e/2e2iPnl 


^£(£^Yi2~f'  V^OT'^OT'2) • 


(124) 


Here  the  prime  on  the  subscript  OT  indicates  that  we  must 
omit  losses  due  to  mode  matching  into  the  output  filters  and 
mixing  with  the  local  oscillator,  since  there  are  no  output 
filters  or  homodyne  detection.  Correspondingly, 

f0T'=  6bs+  6circ+  6pd~  0.003  (125) 

is  the  net  fractional  photon  loss  in  the  output  optical  train. 

Treating  the  quadrature  noise  operators  as  random  pro¬ 
cesses  with  unit  spectral  density  and  vanishing  cross  spectral 
densities,  we  read  off  Sh  from  Eq.  (124): 


E~\~  T'ot'  ^  1 

1 

/  1  \ 

Jr  +./C+ 

\--£\ 

—  +  JC] 

K  2  \ 

,  2 

\K  ) 

X  {cosh  2  R  -  cos[  2  ( X  +  )  ]  sinh  2  R} 


(126) 


As  in  the  lossles  case,  the  noise  is  minimized  by  squeezing 
the  dark-port  input  at  the  FD  angle  \(ft)  =  —  <E  = 
-  arccot  JC  [Eq.  (47)].  The  result  is 


Sh  =  - 


lSQL 


17  l  ) 

ill  \ 

1-  -£  T^  +  ZC 

2  j 

\JC  1 

-2  R 


£  ~\~  £(yv  € 

+  —^  +  2,C 


(127) 


For  our  estimated  squeezing  e  2/?~0.1  and  losses  £0T'~£ 
—  6^0.003  in  the  LIGO-III  time  frame,  the  loss  parameters 


022002-21 


KIMBLE,  LEVIN,  MATSKO,  THORNE,  AND  VYATCHANIN 


PHYSICAL  REVIEW  D  65  022002 


FIG.  12.  Universal  noise  curve  for  conventional  and  QND  in¬ 
terferometers  [Eqs.  (130)]. 

are  small  compared  to  the  squeeze,  and  thus  contribute  neg¬ 
ligibly  to  the  noise,  so  Sh  is  well  approximated  by  the  loss¬ 
less  formula 


Sh^~f ~{lc+K)e~1R-  (128) 

However,  it  is  important  to  keep  in  mind  that  the  input 
squeeze  factor  e~2R  is  constrained  not  only  by  the  physics  of 
the  squeezing  apparatus,  but  also  by  frequency  dependent 
losses  in  the  input  optical  train  and  mode  matching  into  the 
arm  cavities. 

By  expanding  expression  (126)  in  powers  of  AX  =  X 
+  arccot  tC,  we  see  that  the  fractional  increase  in  noise  due  to 
errors  in  the  FD  squeeze  angle  is 


A 


I  Sh  e4R 

\  ~^==e2R  sinh2 RSX2— - AX2. 


(129) 


For  e~2R  =  0.1,  this  fractional  noise  increase  will  be  less  than 
1/4  so  long  as  AX  is  less  than  0.07.  This  translates  into 
accuracies  of  —  7  percent  for  the  prefilter  squeeze  angle, 
—  15  percent  for  the  filter  cavities’  fractional  frequency  off¬ 
sets  (| A^/1^0.15),  and  —10  percent  for  the  cavities’  half 
bandwidths  or  equivalently  their  finesses  (A^/^^0.1). 
These  constraints  are  significantly  less  severe  than  those  for 
a  squeezed-variational  interferometer  (end  of  Sec.  VI  E);  but, 
as  we  shall  see,  the  potential  performance  of  this  squeezed- 
input  interferometer  is  poorer  by  a  factor  — 1.5-2  than  that 
of  the  squeezed- variational  one. 


VII.  DISCUSSION  OF  THE  INTERFEROMETERS’  NOISE 
SPECTRA 

The  noise  spectra  for  our  three  lossy  QND  interferom¬ 
eters,  Eqs.  (120),  (122)  and  (128),  all  have  the  same  univer¬ 
sal  form — a  form  identical  to  that  for  a  conventional  broad¬ 
band  interferometer,  Eq.  (27).  Only  the  parameters  /z  and  cr 
characterizing  the  noise  differ  from  one  interferometer  to  an¬ 
other.  This  universal  form  can  be  written  as 


^sql(^)  ^y^\cr2  I22/ 


(130a) 


where  12^  is  the  following  function  of  angular  frequency 


FIG.  13.  Universal  noise  curve  plotted  as  a  function  of  angular 
frequency  Cl  for  various  values  of  the  dimensionless  frequency  pa¬ 
rameter  cr. 


Cl  1 1  +  Cl2/  y2 

n*-W— 2— 


(130b) 


and  hSQL(Cl)  is  given  by  Eq.  (20).  Notice  that  Cl*  =  1  when 
Cl=y— 100  Hz;  Cl=(Clly)l\j2  when  Cl<y,  and  Cl± 
=  (Cll y)2l yj2  when  Cl>y. 

This  universal  noise  curve  is  plotted  as  a  function  of  12* 
in  Fig.  12.  Its  two  parameters  are  the  minimum  value  /z  of 
the  noise,  i.e.,  the  minimum  amplitude  noise  in  units  of  the 
SQL,  and  the  dimensionless  frequency  cr  (in  units  of  12*)  at 
which  the  noise  takes  on  this  minimum  value. 

Figure  13  shows  this  universal  noise  curve  plotted  as  a 
function  of  angular  frequency  12 .  Notice  that,  because  of  the 
relation  (130b)  between  Cl*  and  12,  the  shape  of  the  noise 
curve  depends  modestly  on  the  location  cr  of  its  minimum. 

The  values  of  the  parameters  /z  and  cr  for  our  various 
interferometer  configurations  are  shown  in  Table  II.  Notice 
the  following  details  of  this  table:  (i)  The  minimum  noise  /z 
(the  optimal  amount  by  which  the  SQL  can  be  beat)  is  inde¬ 
pendent  of  the  laser  input  power  1 0  in  all  cases;  it  depends 
only  on  the  level  of  input  squeezing  e~2R  and  the  level  of 
losses  e*  .  (ii)  For  our  estimated  loss  level  and  squeeze  level, 
the  squeezed-input  interferometer  and  variational-output  in- 

TABLE  II.  The  values  of  the  parameters  /z=  (minimum  noise) 
and  cr=  (frequency  of  minimum)  for  various  interferometer 
(“IFO”)  configurations:  Conv  =  Conventional  broadband  [Eq. 
(27)],  SI  =  Squeezed-Input  [Eq.  (128)],  VO  =  Variational- Output 
[Eq.  (122)],  and  SV  =  Squeezed- Variational  [Eq.  (120)].  The  nu¬ 
merical  values  are  for  e~2R  =  0.1  and  e*  =  0.01. 
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FIG.  14.  Noise  curves  for  SQL  interferometers  with  noise 
minima  yu  =  0.18  and  various  values  of  the  frequency  parameter  <r. 
The  vertical  axis  is  weighted  by  yj[l/y  so  the  curves  give  an  indi¬ 
cation  of  the  relative  noise  in  searches  for  waves  from  inspiraling 
binaries;  see  text.  The  noise  curves  are  labeled  by  the  power 
1Q  / f  sql  required  by  a  squeezed- variational  interferometer  to 
achieve  the  given  cr. 

terferometer  achieve  the  same  /z  —  0.32,  while  the  squeezed- 
variational  interferometer  achieves  a  moderately  lower  /z 

—  0.18.  (iii)  The  frequency  I2*  =  <7  at  which  the  minimum 
noise  is  achieved  is  proportional  to  v/0//SQL.  (Recall  that 
/Sql  is  the  input  power  required  for  a  conventional  interfer¬ 
ometer  to  reach  the  SQL  at  the  angular  frequency  12  =  y 

—  277X100  Hz,  i.e.,  at  12^=1;  to  do  so,  the  conventional 
interferometer  must  have  <7=  1.)  (iv)  For  /0  =  /sql»  the 
squeezed-input  interferometer  has  <7=1,  but  the  variational- 
output  and  squeezed- variational  interferometers  have  cr<  1 , 
which  means  that  the  minimum  of  the  noise  curve  is  at  12 
<  y—  100  Hz.  To  push  cr  up  to  unity,  i.e.,  to  push  the  noise- 
curve  minimum  up  to  ft=y,  requires  /0//SqL=  1/V^^— 10 
in  a  variational-output  interferometer,  and  IQ  //sql 
=  -Je~2R/  e*  —  3.2  in  a  squeezed- variational  interferometer. 

The  importance  of  pushing  cr  up  to  unity  or  higher  is 
explained  in  Fig.  14.  This  figure  requires  some  discussion: 

The  most  promising  gravitational  waves  for  LIGO  are 
those  from  the  last  few  minutes  of  inspiral  of  black-hole- 
black-hole  binaries,  black-hole-neutron-star  binaries,  and 
neutron- star-neutron- star  binaries.  The  amplitude  signal-to- 
noise  ratio  S/N  produced  by  these  waves  is  given  by 

s 2  r»\h\2  tin  r»  imp  dinn 

V2=4  =  4  MD 

N  Jo  Sh  2 77  J-ooilSh  2tt 

where  h  is  the  Fourier  transform  of  the  waveform  h(t).  For 
the  inspiraling  binary  \Clh\  is  nearly  independent  of  fre¬ 
quency  throughout  the  LIGO  band  [34],  so  the  signal-to- 
noise  ratio  is  optimized  by  making  12 Sh(£l)  as  small  as  pos¬ 
sible  over  as  wide  a  range  of  In  II  as  possible. 

Figure  14  plots  yQSh(Q)  as  a  function  of  12/ y  using 
logarithmic  scales  on  both  axes,  and  using  the  minimum- 
noise  parameter  /z  =  0.18  corresponding  to  our  fiducial 
squeezed-variational  interferometer  (though  the  specific 
value  of  /z  is  irrelevant  to  our  present  discussion).  From  the 
shapes  of  the  curves  it  should  be  evident  that  the  larger  is  the 
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frequency  of  the  noise  minimum,  i.e.  the  larger  is  cr  at  fixed 
jul,  the  larger  will  be  the  S/N  for  inspiraling  binaries. 

A  second  factor  dictates  using  large  cr,  in  particular  a 
^  1 .  This  is  thermal  noise  in  the  interferometer’s  test-mass 
suspension  fibers.  The  thermal  noise  scales  with  frequency  as 
\jClS )fermal ( Q )  oc  Q  _ 2  or  oc(l-5/2  depending  on  the  nature  of 
the  dissipation  [35];  see  the  steep  dashed  curve  in  Fig.  14.  It 
seems  realistic  to  expect,  in  LIGO-III,  that  this  thermal  noise 
will  be  at  approximately  the  level  shown  in  the  figure,  so  it 
compromises  the  performance  of  QND  interferometers  at 
12  ^  0.5  y—  50  Hz  [7,36].  Correspondingly,  to  avoid  the  ther¬ 
mal  noise  significantly  debilitating  the  S/N  for  inspiraling 
binaries,  it  will  be  necessary  to  have  cr>  1 . 

Because  a  scales  as  yl0/l sql  f°r  ah  interferometer  de¬ 
signs,  large  cr  entails  large  laser  power.  In  particular,  <7^1 
requires  I0^I sql;  cf.  Table  II.  For  our  fiducial  parameters 
(Table  I),  Isql=  10  kW,  which  corresponds  to  an  optical 
power  circulating  in  each  of  the  interferometer’s  arm  cavities 


SOL  /.sql/2  mcLy3 
circ  yL/c  8co0 


0.62  MW. 


(132) 


To  construct  mirrors  capable  of  handling  this  huge  power 
will  be  an  enormous  technical  challenge  (even  though  this  is 
approximately  the  circulating  power  contemplated  for  LIGO- 
II).  To  operate  with  a  circulating  power  much  larger  than  this 
might  not  be  possible.  Therefore,  it  may  be  important  in 
LIGO-III  to  achieve  <7^  1  while  keeping  I0/Isql  n°t  much 
larger  than  unity. 

The  squeezed-input  interferometer,  with  its  <7  =  yI0^sQL 
(Table  II)  is  the  most  attractive  from  this  point  of  view  [and 
also  in  terms  of  its  required  filter  and  squeeze-phase  accura¬ 
cies;  cf.  end  of  Sec.  VI  F];  and  the  variational-output  with  its 

<7=  V  V e*I0  //sql~  V 0- 1 T0  //sql  Is  the  least  attractive.  The 
squeezed-variational  interferometer,  with  a 

=  V yje*  /e  ~ 2RI0  //sql~  Vo.32/0  //Sql  requires  a  modestly 
higher  laser  power  to  reach  <7=1  than  the  squeezed-input 
[and  requires  better  filter  and  squeeze-phase  accuracies],  but 
it  is  capable  of  a  lower  noise  minimum,  /z  —  (e~2Re*)m 

—  0.18  vs  /z=  sle~2R— 0.32  for  squeezed-input. 

This  suggests  a  research  and  development  strategy:  Focus 
on  input  squeezing  as  a  key  foundation  for  LIGO-III  (it  is 
needed  both  for  squeezed-input  and  squeezed- variational  in¬ 
terferometers),  and  in  parallel  (i)  develop  the  technology  and 
techniques  for  the  FD  homodyne  detection  required  by 
squeezed- variational  configurations,  (ii)  work  to  drive  down 
optical  losses  to  the  levels  6circ~  6bs~  6mm~  elo~  epd 

—  0.001  [Eq.  (92)],  and  (since  ponderomotive  squeezing, 
which  underlies  all  our  QND  interferometers,  has  never  been 
seen)  (iii)  carry  out  experiments  in  a  small  test  appratus  to 
demonstrate  ponderomotive  squeezing  and  to  search  for  un¬ 
expected  obstacles  and  imperfections  in  it. 

If  both  input  squeezing  and  FD  homodyne  detection  can 
be  implemented  successfully,  then  the  squeezed-variational 
interferometer  is  likely  to  achieve  better  performance  than 
any  other  configuration  discussed  in  this  paper,  despite  its 
apparent  need  for  higher  laser  power  (e.g.,  /0//SQL— 3.2  to 
achieve  <7=1  compared  to  /0//sql=1  f°r  squeezed  input, 
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with  our  fiducial  parameters).  If  powers  as  high  as  70/7SQL 

—  3.2  cannot  be  handled,  then  we  can  operate  the  squeezed- 
variational  interferometer  with  a  lower  power  without  much 
loss  of  performance. 

Consider,  for  example,  V Sh/hSQL  evaluated  at  12  =  a 

—  2  77  XI 00  Hz,  as  a  function  of  I0/Isql  in  a  squeezed- 
variational  interferometer  with  our  fiducial  e2R  =  0.\  and 
6^  —  0.01 .  The  optimal  70/7SQL=3.2  produces  ^ lSh(y)/hSQL 
=  0.18;  pushing  IQ  //Sql  down  by  a  factor  2,  to  1.6,  increases 
the  noise  at  12  =  y  by  only  10  percent,  to  0.20;  pushing  down 
all  the  way  to  70/7SQL=l  increases  the  noise  to  only 
V6'/z(y)//zSQL=0.23,  which  is  still  significantly  lower  noise 
than  the  optimized  squeezed-input  interferometer  (0.32  at 
V/SQL=1). 

It  is  worth  recalling  that  for  noncosmological  sources 
(sources  at  distance  <^3  Gpc),  the  volume  of  the  universe 
that  can  be  searched  for  a  given  type  of  source  scales  as  the 
inverse  cube  of  the  amplitude  noise,  so  a  noise  level 
yfSh/h  sql =  0.18  corresponds  to  search- volume  increase  of 
1/0. 183— 180  over  a  SQL-limited  interferometer,  i.e.,  over 
LIGO-II. 

VIII.  CONCLUSIONS 

In  this  paper  we  have  explored  three  candidate  ideas  for 
QND  LIGO-III  interferometers:  squeezed-input,  variational- 
output,  and  squeezed  variational.  The  squeezed-input  and 
squeezed- variational  interferometers  both  look  quite  promis¬ 
ing.  For  our  estimated  levels  of  optical  loss  and  levels  of 
squeezing,  and  for  an  input  laser  power  I0/Isql= 1  (the 
LIGO-II  level),  the  squeezed-input  interferometer  could 
achieve  a  noise  jm  —  0.32  of  the  SQL,  with  a  corresponding 
increase  V— 1/0.323  —  30  over  LIGO-II  in  the  volume  of  the 
universe  that  could  be  searched  for  a  given  source,  at  non¬ 
cosmological  distances.  The  squeezed- variational  interfer¬ 
ometer  could  achieve  /x  — 0.23  of  the  SQL  with  a  search- 
volume  increase  over  LIGO-II  of  V— 80.  If  the  optics  can 
handle  a  laser  power  70/7SQL— 3.2,  then  the  squeezed- 
variational  interferometer  could  reach  jn  —  0.18  of  the  SQL 
and  a  search- volume  increase  of  V— 180.  These  numbers 
scale  with  the  losses,  squeezing,  and  laser  power  as  shown  in 
Table  II. 

The  squeezed-input  and  squeezed-variational  designs  are 
therefore  sufficiently  promising  to  merit  serious  further 
study.  Some  of  the  issues  that  need  theoretical  analysis  are: 

(i)  How  can  one  incorporate  into  these  interferometer  de¬ 
signs  the  various  light  modulations  that  are  required,  in  a  real 
gravitational-wave  interferometer,  to  (i)  make  the  interferom¬ 
eter  be  shot-noise  limited  (put  the  gravitational-wave  signal 
into  —100  Hz  sidebands  of  a  MHz  modulation13),  (ii)  con¬ 
trol  the  mirror  positions  and  orientations,  etc.  [37]. 


13LIGO  scientists  are  currently  exploring  the  possibility  of  achiev¬ 
ing  shot-noise-limited  performance  in  LIGO-II  without  this 
modulation-demodulation.  The  modulation-demodulation  may,  in 
fact,  be  replaced  in  LIGO-II  by  homodyne  detection  at  the  interfer¬ 
ometer  output,  making  it  more  nearly  like  our  paper’s  LIGO-III 
designs. 
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(ii)  What  accuracies  and  other  characteristics  are  needed 
for  the  interferometers’  new  elements:  the  circulator,  filter 
cavities,14  and  input  squeezing?  How  can  these  be  achieved? 
For  example,  how  stable  must  be  the  local  oscillator  for  the 
conventional  homodyne  detector,  and  can  it  be  achieved  sim¬ 
ply  by  tapping  some  light  off  the  interferometer’s  output  or 
input  beam? 

(iii)  If  the  filter  cavities  are  placed  in  the  same  long 
vacuum  tubes  as  the  interferometer’s  arm  cavities  (with  their 
enormous  circulating  power),  what  will  be  the  nature  and 
level  of  noise  due  to  scattering  of  light  from  the  test-mass 
cavites  to  the  filter  cavities?  (We  thank  Eanna  Flanagan  for 
raising  this  issue.) 

(iv)  Can  the  filter  cavities  be  made  to  serve  multiple  pur¬ 
poses?  For  example,  is  it  possible  to  use  a  single  optical 
cavity  for  both  filters,  e.g.,  with  the  two  filters  based  on  two 
different  polarization  states  (for  which  the  filter  might  be 
made  to  behave  differently  via  birefringence),  or  with  the 
two  filters  based  on  different,  adjacent  longitudinal  modes? 
As  another  example,  could  an  output  filter  cavity  be  used  as 
a  source  of  ponderomotively  squeezed  vacuum  for  input  into 
the  interferometer’s  dark  port?15 

(v)  Signal  recycling  via  resonant-sideband  extraction 
(RSE)  [38]  is  likely  to  be  a  standard  tool  in  LIGO-II  [7]. 
How  can  one  best  implement  RSE  simultaneously  with  the 
FD  homodyne  detection  (and  input  squeezing)  of  a 
variational-output  (or  squeezed-variational)  interferometer? 
[37]  How  can  one  best  achieve  the  FD  homodyne’s  filtration 
[which  will  entail  a  different  frequency  dependence  0(12) 
from  that  in  this  paper’s  non-RSE  designs]? 

(vi)  In  this  paper’s  analysis  we  have  made  a  number  of 
simplifying  approximations  [e.g.,  our  approximating  the 
phase  of  the  coefficient  of  fj  in  Eq.  (B24)  by  2/3  an  approxi¬ 
mation  that  fails  by  a  frequency-dependent  amount  which 
can  be  nearly  as  large  as  one  per  cent].  At  what  level  of 
sensitivity  do  these  approximations  become  problematic 
(e.g.,  for  our  proposed  two-cavity  way  of  achieving  the  nec¬ 
essary  FD  homodyne  detection),  and  how  can  the  resulting 
problems  be  overcome? 

(vii)  Our  analysis  is  based  on  the  crucial  assumption  that 
the  interferometer’s  output  is  strictly  linear  in  its  input  [4]. 
Matsko  and  Vyatchanin  [39]  have  shown  that  this  is  not  quite 
correct.  In  the  interferometer’s  arms  the  back- action-induced 
mirror  displacement  X  produces  a  phase  shift  of  reflected 
light  given  by  e~2l^xlc ?  which  our  linearized  analysis  ap¬ 
proximates  as  1—  2iflX/c  [cf.  Eq.  (B10)];  when  the  better 
approximation  1  —  2iflX/c~ 2(£lX/c)2  is  used,  the  result  is 
additional,  nonlinear  noise,  which  limits  the  cancellation  of 


14The  filter  cavities  will  require  a  mechanical  stability  far  less  than 
that  of  the  arm  cavities,  since  the  carrier  power  in  the  output  light  is 
small  and  filter  mirror  displacements  of  magnitude  ~/*L  therefore 
do  not  imprint  a  significant  signal  on  the  light. 

15For  ponderomotively  squeezed  vacuum,  the  squeeze  angle  is 
frequency  dependent,  with  dfildfl  of  the  opposite  sign  to  that 
needed  by  a  squeezed-input  interferometer.  This  must  be  compen¬ 
sated  by  a  filtering  different  from  that  discussed  in  Sec.  V. 
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the  back-action  noise  by  the  shot  noise  and  produces  a  limit 
[39] 
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on  the  sensitivity  that  any  of  our  QND  designs  can  achieve. 
Here 
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is  the  number  of  quanta  entering  a  SQL  interferometer  in 
time  y-1~2  ms.  The  nonlinear  limitation  (133)  is  suffi¬ 
ciently  far  below  the  SQL  that  we  need  not  be  concerned 
about  it.  Are  there  any  other,  more  serious  sources  of  non¬ 
linearity  that  might  compromise  the  performance  of  these 
interferometers? 

Experimental  studies  are  also  needed  as  foundations  for 
any  possible  implementation  of  variational-output  or 
squeezed- variational  interferometers  [40].  Examples  are 

(i)  Studies  of  the  debilitating  effects  of  very  high  circulat¬ 
ing  powers,  Wq~  a  few  MW,  and  how  to  control  them. 

(ii)  A  continuation  of  efforts  to  achieve  large  squeezing, 
robustly,  via  nonlinear  optics  [33],  and  exploration  of  the 
possibility  to  do  so  ponderomotively  [41-44]. 

(iii)  A  continuation  of  efforts  to  achieve  low  levels  of 
losses  in  optical  cavities  and  interferometers,  so  as  to  mini¬ 
mize  the  contamination  of  squeezed  light  by  ordinary 
vacuum  [45]. 

(iv)  Prototyping  of  FD  homodyne  detection  by  the  tech¬ 
nique  proposed  in  this  paper:  filtration  followed  by  conven¬ 
tional  homodyne  detection. 

In  the  meantime,  and  in  parallel  with  such  studies,  it  is 
important  to  push  hard  on  the  effort  to  find  practical  QND 
designs  that  entail  circulating  light  powers  well  below  1  MW 
[13],  and  that  might  be  much  less  constrained  by  optical 
losses  than  the  designs  explored  in  this  paper. 
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APPENDIX  A:  ROTATION  AND  SQUEEZE  OPERATORS 

In  this  paper  we  make  extensive  use  of  squeeze  operators 
and  some  use  of  rotation  operators.  In  this  appendix  we  list 
properties  of  these  operators  that  are  useful  in  verifying 
statements  made  in  the  text.  This  appendix  is  based  on  the 
formalism  for  2-photon  quantum  optics  developed  by  Caves 
and  Schumaker  [26,27]. 

The  rotation  operator  R(0 ),  which  acts  on  the  Hilbert 
space  of  the  modes  with  frequencies  co=co0±Q,  is  defined 
by 


R(0)  =  exp[  —  i  0(al_a  +  +  a^_a  _)]  (Al) 

[Eq.  (4.33)  of  [26]];  here  a±  are  the  annihilation  operators, 
and  a\  the  creation  operators  for  photons  in  these  modes. 
This  operator  is  unitary  and  has  the  inverse 

R~\0)  =  R\0)  =  R(-0).  (A2) 

The  effect  of  a  rotation  on  the  modes’  annihilation  operators 
is 


R(0)a±R\0)  =  a±eie  (A3) 

[Eq.  (4.35)  of  [26]],  and  its  effect  on  the  two-photon  quadra¬ 
ture  amplitudes  [Eqs.  (6)]  is 

R{0)axR\0)  =  ax  cos  0~a2  sin  0 , 

R(0)a2R\0)  =  ax  sin  0+a2cos  0  (A4) 

[Eq.  (4.36)  of  [26]]. 

The  squeeze  operator  also  acts  on  the  Hilbert  space  of 
modes  with  frequencies  co  =  co0  ±  Q ,  and  is  defined  by 

S  (r ,  (f>)  =  exp[r(a +a  _e~2l(f) — a\a^_e2l(f))]  (A5) 

[Eq.  (4.9)  of  [26];  Eq.  (1.8)  of  [27]].  This  squeeze  operator  is 
unitary  and  its  inverse  is 

S~l(r,<fi)  =  S\r,<fi)  =  S(  —  r,0)  =  S(r,0  +  ir/2)  (A6) 

[Eq.  (1.9)  of  [27]].  The  effect  of  a  squeeze  on  the  modes’ 
annihilation  operators  is 

S(r,<j))a±S\r,(j))  =  a±  coshr  +  a^2^ sinhr  (A7) 

[Eq.  (4.10)  of  [26]].  From  this  equation  and  the  definition  (6) 
of  the  quadrature  amplitudes,  we  infer  the  effect  of  a  squeeze 
on  those  amplitudes 

S(  r,  4>)a  i ( r ,  0)  =  a  x( cosh  r  +  sinh  r  cos  2  0) 

+  a2  sinh  r  sin  2  0, 

S(r,<fi)a2S\r,(f))  =  a2(coshr—  sinh  r  cos  20) 

+  ax  sinh  r  sin  20.  (A8) 


022002-25 


KIMBLE,  LEVIN,  MATSKO,  THORNE,  AND  VYATCHANIN 


PHYSICAL  REVIEW  D  65  022002 


FIG.  15.  Field  amplitudes  entering  and  leaving  the  beam  splitter 
(which  here  is  idealized  as  lossless).  The  various  amplitudes  are 
defined  in  Eqs.  (B1)-(B4). 

APPENDIX  B:  INPUT-OUTPUT  RELATIONS  FOR 
INTERFEROMETERS 


E 

Ia  =  -^Ac  =  hMoD2,  (B3) 

477 

where  the  overbar  means  time  average.  (Note  that  D 2  has 
dimensions  Hz  =  1/sec.) 

For  all  other  fields  the  classical  amplitude  and  sideband 
amplitudes  are  as  indicated  in  the  figure;  for  example,  the 
field  going  toward  the  east  cavity  has  classical  amplitude 
D/ V 2  and  quadrature  amplitudes  f\  ,f2  . 

With  an  appropriate  choice  of  conventions  [46],  the  fields’ 
junction  conditions  at  the  splitter  are 


f!= 


b,= 


dj  +  cij 

d;  — a, 

re  _  J  J 

A  ’  ' 

Ti  V2  ^ 

gjSj 

8l  +  8j 

A  * 

£j  V2 

(B4) 


In  this  appendix  we  shall  derive  the  input-output  relations 
for  the  fields  cij  and  bj  that  enter  and  leave  the  interferom¬ 
eter’s  dark  port.  From  the  outset  we  shall  include  optical 
losses  in  our  derivation,  thereby  obtaining  the  lossy  input- 
output  relations  (97)  and  (101);  the  lossless  input-output  re¬ 
lations  (16)  then  follow  by  setting  6=0. 


1.  Fields  at  beam  splitter 

We  describe  the  field  amplitudes  entering  and  leaving  the 
beam  splitter  by  the  notation  shown  in  Fig.  15  (cf.  Fig.  3). 
We  idealize  the  beam  splitter  as  lossless  in  this  appendix,  and 
deal  with  its  losses  in  the  body  of  the  paper  in  the  manner 
sketched  in  Fig.  11.  The  amplitudes  DScd  of  the  field  enter¬ 
ing  the  beam  splitter  from  the  laser  are  defined  by  the  fol¬ 
lowing  formulas  for  the  positive-frequency  part  of  the  elec¬ 
tric  field 


7(  +  ) 
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[cf.  Eq.  (5)]  and  for  the  total  electric  field 


Here  j=  1  or  2 . 


2.  Arm  cavities  and  fields 

The  east  and  north  arm  cavities  are  presumed  to  be  iden¬ 
tical,  with  power  reflection  and  transmission  coefficients  R 
and  T  for  the  front  mirror,  and  R  and  T  for  the  back  mirror. 
The  amplitude  reflection  and  transmission  coefficients  are 

chosen  be  real,  with  signs  {+yJT,  ~^R},  {+  ^T,~  ^R} 

for  light  that  impinges  on  a  mirror  from  outside  the  cavity; 

and  {  +  V T ,  +  yjR},  {+  +  V R }  for  light  that  impinges 

from  inside  the  cavity. 

The  dominant  optical  losses  are  for  light  impinging  on 
mirrors  from  inside  the  cavity  (cf.  Sec.  VI  B).  The  influence 
of  the  losses  on  the  interferometer’s  signal  and  noise  are 
independent  of  the  physical  nature  of  the  losses — whether  it 
is  light  scattering  off  a  mirror,  absorption  in  the  mirror,  or 
transmission  through  the  end  mirror.  (We  ignore  the  effects 
of  mirror  heating.)  For  computational  simplicity,  we  model 
all  the  losses  as  due  to  finite  transmissivity  C=TE  0  of  the 
end  mirror,  and  correspondingly  we  set 
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R  +  T=  1,  R  +  T=  1.  (B5) 

The  fractional  loss  of  photons  in  each  round  trip  in  the  cavity 
is  then  T ,  and  the  net  fractional  loss  of  photons  in  the  arm 
cavities  is 


+  sin(a>or) 
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Thus,  D  is  the  classical  amplitude  of  the  laser  light  (carrier 
with  frequency  co0),  d±  are  the  annihilation  operators  for 
the  d)0±£l  sidebands,  and  d1  and  d2  are  the  quadrature  am¬ 
plitudes  for  the  side  bands.  [Notice  that  the  factor  out  front  is 
a  V277  in  Eq.  (Bl)  but  V477  in  Eq.  (B2),  and  notice  the  V 2D 
in  Eq.  (B2).]  The  light  power  IQ  impinging  on  the  beam 
splitter  is  related  to  the  classical  amplitude  D  by 


cf.  Eqs.  (93)  and  (94).  Recall  that  7—0.033  and  6—0.0012, 
and  also  that  H—  y=Tc/4L  [Eqs.  (11),  (94)];  correspond¬ 
ingly,  we  shall  make  the  approximations 

T<T=  4  yL/c  -  CLL/c  <  1  (B7) 

throughout  our  analysis. 
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FIG.  16.  Field  amplitudes  entering  and  leaving  an  arm  cavity. 
The  cavity’s  front-port  input  and  output  amplitudes  D,  /,  and  g  are 
defined  in  Eqs.  (B3)  and  (B4)  and  Fig.  15,  and  its  back-port  input  q 
is  defined  in  Eq.  (Bll). 


Figure  16  shows  an  arm  cavity  and  the  amplitudes  of  the 
fields  that  impinge  on  or  depart  from  its  mirrors.  The  ampli¬ 
tudes  are  those  at  the  (front  or  back)  mirror  location,  and  the 
mirrors,  like  the  beam  splitter,  are  idealized  as  infinitesimally 
thin. 

For  pedagogical  simplicity,  the  distance  from  the  beam 
splitter  to  the  front  mirror  of  each  arm  cavity  is  set  to  an 
integral  multiple  of  the  carrier  wavelength  and  is  assumed  to 
be  far  smaller  than  c/i 2  (the  wavelength  associated  with  the 
sidebands).  This  means  that  there  are  no  net  phase  shifts  of 
the  light  in  traveling  between  the  beam  splitter  and  the  cavi¬ 
ty’s  front  mirror;  i.e.,  the  field  amplitudes  D/^2&f  (or 
D/\]28cg)  arriving  at  (or  departing  from)  the  mirror  are  the 
same  as  those  departing  from  (or  arriving  at)  the  beam  split¬ 
ter;  cf.  Figs.  16  and  15. 

The  cavity’s  length  is  adjusted  to  an  integral  number  of 
carrier  wavelengths  so  there  is  no  carrier  phase  shift  from 
one  end  of  the  cavity  to  the  other,  and  inside  the  cavity  the 
carrier  amplitude  is  amplified  by  the  standard  resonance  fac¬ 
tor  2 JyjT.  (Losses  are  small  enough  to  be  of  little  importance 
for  the  carrier.)  Because  the  side  bands  inside  the  cavity  have 
a  frequency  dependence  gje  ~intcos(co0t)  at  the  front  mirror 
location  [cf.  Eq.  (B2)],  they  propagate  down  the  cavity  as 
gje~l^t~z^cos[(o0(t—z)]  and  upon  reaching  the  back  mirror 
(where  cos[m0(£— z)]=cos[utf]),  they  have  acquired  the  phase 
shift  indicated  in  the  figure,  gjel^Llc\  and  similarly  for  the  kj 
field  propagating  in  the  other  direction. 

The  standard  junction  conditions  at  the  front  mirror  imply 
that 
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Comparing  with  the  standard  expression  for  the  field  at  the 
location  of  the  unperturbed  end  mirror  [Eq.  (B2)  with  the 
amplitude  changes  indicated  in  the  lower  right  of  Fig.  16, 
D^(2/^l2)(D/\j2)  and  we  obtain  the  fol- 

lowing  expression  for  the  field  fed  from  the  carrier  D  into  the 
sideband  amplitudes  kj : 


2  2  0)o 

Skx  =  0,  Sk2  =  ^D - X.  (BIO) 

\jT  C 

This  acts  as  a  source  term  in  the  standard  junction  condition 
for  the  back  mirror: 

kje-iSiL/c=  yj^j.ei^lc+  yjfq.+  Skj .  (Bll) 

Note  that  qj  is  the  noise-producing  vacuum  fluctuation  that 
leaks  into  the  cavity  as  a  result  of  the  optical  losses. 

3.  Cavity’s  internal  field  and  radiation-pressure  fluctuations 

By  combining  the  front-mirror  and  back-mirror  junction 
conditions  (B8)  and  (Bll)  we  obtain  for  the  side-band  am¬ 
plitude  in  the  cavity 

yjrfi+  yfRe^'^Tqj+Skj) 

j  j — — - - - - -•  (B12) 

1  -  yjRRe2l[lL/c 

Equations  (B7)  and  ^R=  ^l~T=  ^1-  0.033—  1  allow  us  to 
make  the  approximations  \jRelilL/c—  1  in  the  numerator  and 
[using  Eq.  (11)] 

1  -  ^RRelinLlc-(2Llc){y^~iQ,),  (B13) 


jj  =  ylffj  +  V Rkj ,  gj=-  V Rfj  +  V Tkj  .  (B8) 

We  denote  by  X(t)  the  change  of  arm  length  produced  by 
radiation  pressure  and  the  gravitational  waves,  and  by  X  its 
Fourier  transform.  The  oscillating  X(t)  pumps  carrier  light 
into  the  side  bands.  More  specifically,  in  traveling  from  the 
front  mirror  z  =  0  to  the  perturbed  position  z  =  L  +  X(t )  of 
the  back  mirror,  then  reflecting  and  propagating  to  the  unper¬ 
turbed  location  z  =  L,  the  carrier  field  acquires  the  form16 


y^7(l +  6/2)  (B14) 


in  the  denominator  (accurate  to  better  than  1  percent  for  all 
12  of  interest  to  us),  thereby  bringing  Eq.  (B12)  into  the  form 


.  _  ^T{fj+  V e/2 +  8k j 
( 2L!  c)(yt  —  ill) 


(B15) 


where  we  have  used  T  =  1/2 eT.  The  cavity’s  internal  electric 
field  Eint  is  expression  (B2)  with  D^(2/^T)(D/ yj2)  [Eq. 
(B9)]  and  [expression  (B 15)];  cf.  Fig.  16.  The  power 


16Here  we  have  neglected  the  attenuation  of  the  carrier  field  due  to 
the  arm-cavity  losses.  This  neglect  is  in  the  same  spirit  as  our  ig¬ 
noring  attenuation  in  the  input  optics,  in  the  beam  splitter,  and  in 
mode  matching  into  the  arm  cavities.  Including  these  attenuations 
would  simply  change  D  in  Eq.  (B9)  to  DX  (1  —  1/2  power  attenua¬ 


tion  factor) — i.e.,  D  X  ( 1  —  l/2e)  for  the  effect  of  arm-cavity  losses. 
Equivalently,  it  would  dictate  replacing  IQ  by  IQ  ( 1  —  power  at¬ 
tenuation  factor)  in  /C,  /C* ,  and  all  our  formulas  for  the 

gravitational-wave  noise. 
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circulating  in  the  cavity  is  this  (£fnt/47r)*4r,  and  consists  of 
two  parts,  a  steady  classical  piece 


W 

rr  pi 


1 41)2  *  _ 2 ,  _  l/2 


(B16) 


and  a  fluctuating  piece 


SW  = 

^  rr  circ 


r^yjl0h(00(fl+  Ve/2<?i)  _.n 

Jo  (L/cXy^-iCl)  2tt 

(B17) 


where  H.c.  means  Hermitian  conjugate  (adjoint)  of  the  pre¬ 
vious  term. 


4.  Mirror  motion 

The  circulating-power  fluctuations  (B17)  produce  a  fluc¬ 
tuating  radiation-pressure  (back-action)  force 

F  ba=  2  SWcirc/c  (B18) 


on  each  mirror.  This  force  is  equal  and  opposite  on  the  cavi¬ 
ty’s  two  mirrors  and,  along  with  the  gravitational  waves,  it 
produces  the  following  acceleration  of  the  mirror  separation: 


d2X(t)  _  1  d2h(t)  4 SWcirc(t) 
dt2  2  ^ ne' L  dt 2  me 


(B19) 


Here  h(t)  is  the  gravitational- wave  field  (projected  onto  the 
interferometer’s  arms),  and  rjne  is  +1  for  the  north  arm  and 
—  1  for  the  east  arm  (one  arm  is  stretched  while  the  other  is 
squeezed). 

Below  we  will  need  an  expression  for  the  (Fourier  trans¬ 
form  of  the)  arm-length  difference,  x  =  Xn  —  Xe.  It  can  be 
obtained  by  Fourier  transforming  the  equation  of  motion 
(B19),  solving  for  X  (i.e.,  Xn  or  Xe),  inserting  expression 
(B17)  for  8Wckc ,  and  then  taking  the  difference  of  the  north 
and  east  arms.  The  result  is 


x  =  Lh  +  xBA  (B20) 


[cf.  Eq.  (15)],  where 

—  4\l2I0ho)0(al  +  \fe/2rii) 

X  n  a  =  “ 

mkl2L(  —  ii 1) 

=  -^CJ2LhSQL(al  +  4d2rh)ei^.  (B21) 

Here  we  have  introduced  the  quadrature  amplitude  for  the 
difference  of  the  arms’  noise  fields 


Below  we  shall  also  need  the  following  expression  for  the 
difference  of  the  two  arms’  sideband  fields  produced  by  the 
mirror  motions’  coupling  to  the  carrier: 


8k\  —  Ske2 

~1T~ 


In  M„X 


(B23) 


This  follows  from  Eqs.  (BIO),  (B3)  and  (12). 


5.  Cavity  output 

The  field  exiting  from  the  (north  or  east)  cavity  is  ob¬ 
tained  by  combining  Eqs.  (B8),  (Bll)  and  (B12): 

Jfie2iCiLic_  ( V TTq:+  yjTSk:)ei[lL/c 

gj  = - 7=^ - Xfj + - 7=^ - • 

1  -  XRRe2,ilUc  1  -  ^RRe2,(lL/c 

(B24) 

Inserting  Eq.  (B13)  for  the  denominator  and  analogous  ex¬ 
pressions  for  the  numerator,  and  discarding  terms  that  are 
higher  order  than  linear  in  the  losses,  we  bring  Eq.  (B24)  into 
the  form 


Si=  (  1  -  2 £)e2ipf]+  'l£eil3qj  + 


0 c/2L)2T 

r*+^ 


'P*8kj , 


(B25) 


where  (3*  is  given  by  Eq.  (98)  and  £  by  Eq.  (100). 


6.  Beam  splitter  output 

By  combining  Eqs.  (B4),  (B25),  and  (B22),  we  obtain  for 
the  dark-port  output  of  the  beam  splitter 


bj=ll  —  —£j  aje2lP+  y\£nxelP 


+ 


I(c/2L)2tI  Sk”-Ske;\ 


eip*. 


(B26) 


Inserting  8kn{e  =  0  [Eq.  (B10)]  and  our  expression  (B23)  for 
the  difference  of  the  8k2  s,  and  inserting  Eqs.  (B20)  for  v  and 
Eqs.  (99),  (20),  (B14)  for  /C*  ,  /zSQL,  7*  ,  we  obtain  for 
the  output  fields: 

bx  =  1 1  —  —  £\  ale2lP+  (B27a) 

b2=  1 1  —  —£j  a2e2lf3+  ^Sn2elf3 


(B22) 


and  have  used  Eq.  (B4)  for  f\  and  f  \  ,  and  Eqs.  (99),  (98), 
(19)  and  (20)  for  the  coupling  constant  /C*  ,  the  phase  (3 *  , 
the  SQL  power  /SQL  and  the  standard  quantum  limit  /zSQL. 


+  V2  /C* 


^  h  +  xba/L\ 

- r 


h 


SQL 


(B27b) 


By  inserting  expression  (B21)  for  the  back- action-induced 
mirror  displacement  xBA,  we  obtain  the  input-output  rela¬ 
tions  quoted  in  the  text:  Eqs.  (97)  and  (101)  with  losses,  and 
Eqs.  (16)  in  the  lossless  limit. 
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APPENDIX  C:  FILTER  PARAMETERS 

In  our  discussion  of  FD  homodyne  detection  (Sec.  V),  we 
derived  the  following  requirement  for  the  conventional  ho¬ 
modyne  phase  6  and  the  filter  parameters  and  Sj  (with  J 
=  1  and  II): 


n2(y2+n2)  /  ^i+  +  ^i_+^ii+  +  ^h_\ 

tan  0(12)  = - ^4 - =  tan^  0 - - - j 

(Cl) 

[Eqs.  (81),  (86),  and  (83)],  where 

aJ±  =  arctan(  ijj±  11/  Sj)  (C2) 

[Eq.  (88)].  In  this  appendix,  we  shall  show  that  this  require¬ 
ment  is  satisfied  by  the  parameter  choices  asserted  in  the 
text:  Eqs.  (85)  and  (89). 

We  initially  regard  the  parameters  6 ,  ijj,  and  Sj  as  un¬ 
known.  By  inserting  Eq.  (C2)  into  Eq.  (Cl)  and  invoking 
some  trigonometric  identities,  we  obtain  the  requirement 

(R0~Io  cot  0)  +  (R2  - 1 2  cot  0)  II2  +  tf4I24 
(R0  cot  0+Io)  +  (R 2  cot  0+I2)fL2  +  R4  cot  6i l4 
y2Il2  +  Il4 


Here  R0  + R2£l2  + R4£l4  is  the  real  part  and  70  +  /2I22 
is  the  imaginary  part  of  ( 1  +  i  tan  al+ )  ( 1  +  i  tan  )  ( 1 

+  i  tan  oqI+)(  1  +  i  tan  an_) .  More  specifically, 

^o=  I  —  £i  —  £ii—  Hi£ii+  £i  £ii> 

(C4a) 

Ri =  ( I  V  <?n+  ( 1  —  )!  , 

(C4b) 

R4  =  1/(^i  ^n)’ 

(C4c) 

^o  =  2(^j+  ^n)(l  —  £i£n)* 

(C4d) 

/2  =  2fn/^  +  2&/4. 

(C4e) 

To  get  rid  of  the  12  4  term  in  the  denominator  of  Eq.  (C3), 
we  must  set 


0=  7t/2,  so  cot  0=0.  (C5) 

(We  cannot  set  7?4  =  0  since  that  would  require  an  infinite 
bandwidth  for  one  or  both  of  the  filters.)  To  get  rid  of  the  II2 
term  in  the  denominator  and  the  constant  term  in  the  numera¬ 
tor,  and  to  make  the  II2  and  II 4  terms  in  the  numerator  have 
the  correct  coefficients,  we  must  set 


1 2  =  0, 

(C6a) 

Ro  =  0, 

(C6b) 

R22/(I0R4)=y4/A4=4/P, 

(C6c) 

R2/R4=y2. 

(C6d) 
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Here  we  have  used  definition  (89a)  of  the  constant  P. 

Equations  (C6)  are  four  equations  for  the  four  unkown 
filter  parameters:  the  fractional  frequency  offsets  £I?  £n 
and  the  half  bandwidths  Si,  £n.  In  the  next  four  para¬ 
graphs  we  shall  explore  the  consequences  of  these  four  equa¬ 
tions,  arriving  finally  at  the  solution  (89)  for  ,  £n ,  Sl ,  and 
Su  given  in  the  text. 

Equation  (C6a)  implies  that 

V4= -£„/£•  (C7) 

Equation  (C6b)  implies  that  (1  —  £i£n)2  =  (£i+ £n)2-  It 
turns  out  that  one  of  the  frequency  offsets  is  positive  and  the 
other  is  negative  (cf.  Fig.  10);  we  choose  to  be  the  positive 
one.  It  also  turns  out  that  £i+£n  Is  positive  (cf.  Fig.  10). 
Consequently,  we  can  take  the  square  root  of  the  above  equa¬ 
tion  to  obtain 


1  fifn-fi+fn*  (C8) 

which  enables  us  to  express  the  frequency  offsets  in  terms  of 
each  other: 


&= 


1-gn 
1  +  £h  ’ 


£n— 


Hi 

1  +  £i 


(C9) 


Equation  (C6c),  when  combined  with  Eqs.  (C7)  and  (C8), 
implies  that 


8 

P 


(ft+£ll)2 


(CIO) 


We  shall  now  combine  this  equation  with  Eqs.  (C9)  to  obtain 
Eqs.  (89)  for  the  frequency  offsets  and  £n  in  terms  of  P 
=  4y4/A4.  Our  first  step  is  to  define  A±  by  Eqs.  (89c)  and 
(89d),  which  are  equivalent  to 


A.-p-.  (Cl  1) 

tf-i  fS-i 

Note  that  the  relation  (C9)  between  ^  and  £n  is  equivalent  to 

4A+A_  =  1.  (Cl  2) 

By  using  Eqs.  (C9),  (Cll)  and  (C12),  we  can  reexpress  the 
right  side  of  Eq.  (CIO)  solely  in  terms  of  A+  : 

8  (4A2-1)2 

P  _  A  +  (4A  +  +  1) ' 

It  is  convenient  to  define  Q  by  Eqs.  (89b),  which  are  equiva¬ 
lent  to 


A  +  +A_  =  2g/E.  (Cl  4) 

Using  Eqs.  (C12)  and  (C14),  we  can  rewrite  Eq.  (C13)  in 
terms  of  Q  instead  of  A  +  : 
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2  2  Q  P 

P~-p-~2Q' 

which  can  be  solved  for  Q  as  a  function  of  P 

i  +  Vi +P1 


By  eliminating  <5n  with  the  aid  of  Eq.  (C7),  we  obtain 

(Cl  5) 

'Mnr-'-lr)-  (C18> 

(C16)  Using  Eqs.  (Cll),  (C12),  and  (89b),  we  can  rewrite  this  as 


This  is  the  relation  asserted  in  the  text,  Eq.  (89a),  and  it 
completes  our  derivation  of  Eqs.  (89a)-(89d)  for  the  fre¬ 
quency  offsets  and  £n  in  terms  of  P. 

Turn,  finally,  to  the  consequences  of  Eq.  (C6d),  which 
says 


72-^i(1  £i)  +  ^ii(1  £n)-  (C17) 


Si 


7 


(Cl  9) 


which  is  the  formula  for  the  half  bandwidth  given  in  the 
text,  Eq.  (89e).  The  corresponding  formula  for  £n,  Eq.  (89f), 
follows  directly  from  Eqs.  (Cl 9)  and  (C7). 
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We  present  detailed  discussions  of  cooling  and  trapping  mechanisms  for  an  atom  in  an  optical  trap  inside  an 
optical  cavity,  as  relevant  to  recent  experiments.  The  interference  pattern  of  cavity  QED  and  trapping  fields  in 
space  makes  the  trapping  wells,  in  principle,  distinguishable  from  one  another.  This  adds  considerable  flex¬ 
ibility  to  creating  effective  trapping  and  cooling  conditions  and  to  detection  possibilities.  Friction  and  diffusion 
coefficients  are  calculated  in  and  beyond  the  low  excitation  limit  and  full  three-dimensional  simulations  of  the 
quasiclassical  motion  of  a  Cs  atom  are  performed. 
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I.  INTRODUCTION 

A  recent  experiment  [1]  succeeded  in  trapping  a  single 
atom  with  single  photons  inside  an  optical  cavity  and  in 
monitoring  the  atomic  motion  with  the  resolution  approach¬ 
ing  the  standard  quantum  limit  for  position  measurements. 
Yet  a  second  experiment  [2]  has  likewise  reported  single¬ 
atom  trapping  at  the  few-photon  level,  although  in  this  case 
the  trapping  potential  and  diffusion  are  in  fact  well  approxi¬ 
mated  by  a  free-space  semiclassical  theory  [3]. 

One  future  objective  for  such  experiments  is  to  use  atoms 
trapped  in  cavities  for  quantum  communication  purposes, 
with  atoms  serving  as  quantum  memories  and  photons  as  the 
transporters  of  quantum  information  [4,5].  While  the  single¬ 
photon  trapping  experiments  provide  a  new  paradigm  for 
quantum  measurement  and  control,  they  are,  nevertheless, 
not  entirely  suitable  for  the  purpose  of  distributed  quantum 
networks  where  qubits  will  be  communicated  among  quan¬ 
tum  nodes.  The  reason  is  the  short  trapping  lifetime  of  the 
atoms  as  well  as  limited  operation  flexibility.  A  better  strat¬ 
egy  might  be  to  use  the  cavity  QED  field  for  quantum  state 
entanglement  and  distribution  while  an  additional  (external) 
trapping  mechanism  provides  the  necessary  confinement  of 
the  atomic  center-of-mass  motion.  For  instance,  in  another 
recent  experiment  from  the  Caltech  group  [6],  mean  trapping 
times  of  —28  ms  (as  compared  to  mean  trapping  times  of 
<1  msec  in  the  experiments  [1,2])  were  achieved  by  em¬ 
ploying  a  far-off  resonant  trapping  (FORT)  beam  along  the 
cavity  axis.  In  that  experiment  the  trapping  lifetime  was  lim¬ 
ited  due  to  intensity  fluctuations  of  the  intracavity  FORT 
beam  [7].  Here  we  consider  the  situation  of  current  improved 
experiments  [8]  in  which  a  single  atom  is  held  inside  an 
optical  cavity  in  a  stable  FORT  beam  of  minimum  intensity 
fluctuations. 

Several  mechanisms  for  cooling  inside  optical  resonators 
have  been  discussed  before  [9-11].  Here  we  discuss  in  detail 
how  the  combination  of  an  external  trapping  potential  and 
the  cavity  QED  field  adds  flexibility  in  predetermining  where 
and  to  what  degree  atoms  will  be  trapped  and  cooled.  More¬ 
over,  our  calculations  go  beyond  the  weak  driving  limit  dis- 
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cussed  in  [10].  That  is,  we  allow  the  “probe”  field  driving 
the  cavity  to  be  so  strong  as  to  appreciably  modify  the  dy¬ 
namical  behavior  of,  rather  than  merely  probe,  the  atom- 
cavity  system. 

This  paper  is  organized  as  follows.  In  Sec.  II  we  describe 
the  physical  situation  of  an  atom  trapped  in  an  optical  poten¬ 
tial  and  strongly  interacting  with  a  cavity  QED  field.  We 
give  the  evolution  equations  for  both  internal  and  external 
atomic  degrees  of  freedom  and  for  the  quantized  cavity 
mode.  Section  III  contains  an  exposition  on  how  we  calcu¬ 
lated  friction  and  diffusion  coefficients  from  the  forces  act¬ 
ing  on  the  atom.  Section  IV  contains  the  main  results  of  this 
paper:  we  discuss  simple  pictures  for  cooling  mechanisms, 
based  on  the  dressed  state  structure  of  the  atom-cavity  sys¬ 
tem,  and  give  numerical  results  for  the  typical  cooling  and 
diffusion  rates,  and  hence  “temperatures”  for  single  atoms 
under  various  trapping  conditions.  We  also  study  the  satura¬ 
tion  behavior  under  strong  driving  conditions  and  perform 
simulations  of  the  full  three-dimensional  (3D)  motion  of  at¬ 
oms  trapped  in  particular  wells  that  show  how  the  probe  field 
transmission  is  correlated  with  the  atomic  motion  and  how 
trapping  times  can  be  prolonged  by  strong  cooling.  Section 
IV  F  concludes  with  a  brief  discussion  of  a  slightly  different 
trapping  scheme.  The  summary  highlights  the  main  results. 

II.  DESCRIPTION  OF  PROBLEM 

We  consider  a  single  two-level  atom  coupled  to  a  single- 
quantized-cavity  mode  and  coupled  to  a  (classical)  far-off 
resonant  trapping  beam.  In  most  of  the  paper  we  assume  that 
the  FORT  shifts  the  atomic  excited  state  \e)  up  and  the 
ground  state  | g)  down  by  an  amount  SF(r )  (i.e.,  the  energy 
of  the  ground  state  is  Eg  —  SF ,  that  of  the  excited  state  Ee 
+  SF),  as  this  is  the  situation  pursued  in  previous  and  current 
experiments  [6,8].  In  Sec.  IV  F,  however,  we  will  also  study 
the  different  situation  where  both  ground  and  excited  states 
are  shifted  down  by  SF  (see,  e.g.,  [12]).  The  FORT  beam 
coincides  with  one  of  the  longitudinal  modes  of  the  cavity 
and  its  wavelength  XF  is  longer  than  that  of  the  main  cavity 
mode  of  interest  for  cavity  QED,  \0.  In  fact,  in  the  experi¬ 
ments  [6,8]  the  cavity  length  L  is  104\0/2=  102\F/2. 

The  position-dependent  ac-Stark  shift  due  to  the  FORT 
field  is  of  the  form 

S  F(  r)  =  S0sin2(fcFz)exp(  -  2p2/w20),  (1) 
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FIG.  1.  The  FORT-induced  shift  of  the  ground  state  on  axis 
(p  =  0)  in  the  case  where  *S0 /(2 tt)  =  10  MHz  and  the  cavity-QED 
coupling  rate  g  (dashed  curve),  with  g0  =  3S0,  as  functions  of  posi¬ 
tion  along  the  cavity  axis  measured  in  units  of  the  FORT  wave¬ 
length  X F ,  with  z  =  0  at  the  left  cavity  boundary.  The  cavity  length 
is  L=  15Xf  . 

with  S 0>0  the  maximum  shift,  kF=2i t/Xf  the  wave  vector 
of  the  FORT  field,  w0  the  size  of  the  Gaussian  mode  of  the 
cavity,  while  z  and  p  give  the  coordinate  along,  and  the 
distance  perpendicular  to,  the  cavity  axis,  respectively.  The 
quantized  cavity  mode  is  assumed  to  have  the  same  trans¬ 
verse  dimensions1  w0  so  that  the  atom-cavity  coupling  is 
determined  by 


g(r)  =  g0sin(£z)exp(  -  p2/w20), 


(2) 


with  g 0  the  maximum  coupling  rate  and  k=2ir/X0  the  wave 
vector  of  the  cavity  mode.  Under  conditions  where  the  cavity 
is  not  driven  too  strongly,  the  atom  will  be  trapped  around 
the  antinodes  of  the  red-detuned  FORT  field.  Thanks  to  the 
fact  that  \0AXf,  the  atom  will  experience  a  different  cou¬ 
pling  strength  to  the  cavity  mode  in  each  different  well.  Fig¬ 
ure  1  shows  the  axial  pattern  arising  from  the  FORT  and 
cavity  fields.  For  illustrative  purposes  we  choose  here  (and  in 
the  rest  of  this  paper)  a  cavity  of  length  L=16X0=15XF . 
This  does  not  influence  the  basic  physics  involved:  in  par¬ 
ticular  we  note  that  the  precise  value  of  XF  is  largely  irrel¬ 
evant  on  the  time  scales  considered  here,  as  the  FORT  field 
is  detuned  far  from  atomic  resonance.  The  choice  of  L 
=  16X0=  15XF  just  means  that  only  eight  wells  out  of  30  are 
qualitatively  and  quantitatively  different. 

This  is  illustrated  in  Fig.  2  where  we  plot  the  value  of  the 
cavity  QED  coupling  g  at  the  antinodes  of  the  FORT  (i.e., 
the  bottom  of  the  trapping  potential).  In  particular,  there  are 
two  antinodes  in  which  g  =  0,  and  4  in  which  \g\  attains  its 
maximum. 


FIG.  2.  The  values  of  |g|/g0  at  the  locations  of  the  antinodes  of 
the  FORT,  i.e.,  at  the  points  zn  =  (n/ 2—  l/4)XF  for  n=  1, .  . .  ,30. 
There  are  eight  quantitatively  different  wells. 

The  cavity  is  driven  by  an  external  classical  field  £(t) 
=  S0Qxip(i(Dpt ),  at  a  frequency  ojp  ,  which  is  used  to  probe  the 
atom-cavity  system  and  which  may  cool  the  atom  at  the 
same  time.  In  the  following,  the  strength  of  the  driving  field 
is  indicated  by  the  number  of  cavity  photons  Ne  that  would 
be  present  if  there  were  no  atom  in  the  cavity,  rather  than  by 
£0.  This  closely  follows  the  experimental  procedure  for  de¬ 
termining  the  driving  strength.  The  relation  between  the  two 
is 


N=- 


&  n 


k2  +  A 


(3) 


fit  is  in  fact  the  Rayleigh  ranges  of  the  beams  that  are  identical,  so 
that  wF0ORT/wc0av=  VXq/Xf^O.99. 


with  A c  =  o)c—  (op  the  detuning  of  the  probe  from  the  cavity 
frequency  o)c  =  kc.  The  Hamiltonian  for  the  internal  atomic 
degrees  of  freedom  and  the  quantized  cavity  mode  is,  in  a 
frame  rotating  at  the  probe  frequency  o)p ,  given  by 

H  =  hAca^  a  -\-hAa(j+cr~  +  2hS  F(r)(cr+  cr~  —  1/2  )  +  h£0(aif 

+  a)  +  hg(r)(a^cr~  +  cr+a).  (4) 

Here  Aa=(oa  —  cop  is  the  detuning  of  the  atomic  resonance 
from  the  probe  frequency.  In  all  numerical  examples  given 
below  the  cavity  frequency  is  chosen  to  coincide  with  the 
atomic  frequency,  so  that  Ac  =  Aa.  The  quantity  Ap=  —  Aa 
is  then  referred  to  as  the  probe  detuning.  Note  here  that  with¬ 
out  a  FORT  the  optimum  cavity  and  atom  detunings  are  not 
equal  [9-11].  In  our  case,  however,  the  FORT  effectively 
changes  the  atomic  frequency  in  a  position-dependent  way 
and  thus  the  precise  value  of  the  atomic  detuning  relative  to 
the  cavity  detuning  is  largely  irrelevant.  Indeed,  optimum 
cooling  conditions  will  exist  in  certain  wells  but  not  in  oth¬ 
ers,  which  is  one  feature  that  allows  one  to  distinguish  vari¬ 
ous  wells. 

Coupling  the  atom  and  the  cavity  to  the  remaining  modes 
of  the  electromagnetic  field  leads  by  a  standard  procedure  to 
the  master  equation  for  the  density  operator  of  the  coupled 
atom-cavity  system, 
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dp 

dt 


r 

i[H,p\/h  —  k{cl^ a , p}  +  2 kci pa^ H —  —{cr+  a  ,p} 


+ 


—  f  d2k^j  (d- 6)2exp(  —  ik-  r) 

87 Tj  * 

X  cr~ pa+exp(ik‘  r), 


(5) 


with  T  the  spontaneous  decay  rate  and  k  the  cavity  decay 
rate.  We  are  mainly  interested  in  the  strong-coupling  regime, 
where  g0>r,/c. 

We  treat  the  external  (center-of-mass)  degrees  of  freedom 
of  the  atom  classically,  an  approximation  justified  at  the  end 
of  Sec.  IV  C.  For  a  discussion  of  various  interesting  effects 
arising  from  the  quantized  external  motion  of  an  atom  in  a 
cavity  QED  field,  we  refer  the  reader  to  [13]. 

In  the  quasiclassical  approximation  (i.e.,  where  we  retain 
the  full  quantum  character  of  the  internal  degrees  of  freedom 
and  of  the  cavity  mode;  see  [3]  for  a  full  discussion  of  this 
approximation),  the  integral  in  Eq.  (5)  can  be  evaluated  to 
give  the  simpler  result 

dp  t  r 

—  =  —  i\H,p]/h  —  K{a'a,p}  +  2Kapa'  —  —{a+cr  ,p} 

-\~Ta~pcr+.  (6) 

The  force  acting  on  the  atom  consists  of  two  parts,  one  due 
to  spontaneous  emission,  whose  mean  vanishes  on  average, 
and  the  other  part  is  represented  by  the  operator 

F=  —  VH=  —  2hV S /.(ct+  a  —  1/2)  —  hV g (a  '  cr~  +  cr+a), 

(7) 

which  has  contributions  arising  from  the  FORT  potential  and 
from  the  interaction  with  the  cavity  mode.  It  was  only  the 
latter  part  that  was  considered  in  [10]  and  that  leads  to  ID 
cooling  to  temperatures  of  the  order  kBT—min(hK,hT/2). 
See  also  Refs.  [14]  for  similar  calculations  on  single  atoms 
moving  in  cavity  QED  field,  and  Refs.  [15,16]  for  calcula¬ 
tions  of  diffusion  of  atoms  in  optical  traps  in  free  space. 

It  can  be  shown  [17]  starting  from  a  fully  quantized  de¬ 
scription,  that  the  semiclassical  motion  of  the  atom  is  de¬ 
scribed  by  a  Fokker-Planck  equation  for  the  Wigner  distri¬ 
bution  function  containing  (position-dependent)  friction  and 
diffusion  coefficients.  Equivalently,  we  may  use  stochastic 
equations  for  the  classical  atomic  position  and  velocity  vari¬ 
ables  r  and  v  of  the  form 

dr=vdt , 


with  the  expression  (7)  for  the  force  operator,  we  can  calcu¬ 
late  [5  and  D  by  the  procedure  outlined  in  the  next  section. 

III.  FRICTION  AND  DIFFUSION 

Reference  [10]  employs  Heisenberg  equations  of  motion 
for  various  field  and  atomic  operators  to  find  friction  and 
diffusion  coefficients.  These  equations  are  not  closed  and, 
consequently,  an  approximation  has  to  be  made  in  order  to 
find  solutions.  The  natural  assumption  is  to  consider  the 
weak  driving  limit  (i.e.,  S0<k)  and  truncate  the  available 
Hilbert  space  to  that  part  containing  no  more  than  a  single 
cavity  photon.  This  allows  one  to  close  the  Heisenberg  equa¬ 
tions  [10].  Here  we  employ  a  different  method  (using  the 
density  matrix  equations)  to  calculate  friction  and  diffusion 
coefficients  that  does  not  require  us  to  stay  within  the  weak 
driving  limit,  but  in  addition  we  used  Ref.  [10]’s  procedure 
here  to  obtain  results  in  the  weak  driving  limit  for  verifica¬ 
tion  purposes.  In  any  case,  it  is  still  true  that  the  most  inter¬ 
esting  regime  is  where  only  one  or  few  photons  are  involved. 
Note  that  given  the  strong  coupling  between  atom  and  cavity 
field,  even  a  single  photon  is  sufficient  to  lead  to  regimes  far 
beyond  the  weak  driving  limit.  In  our  examples  we  truncated 
the  Hilbert  space  to  photon  numbers  of  around  4  or  smaller. 
We  refer  to  [19]  for  an  exposition  on  how  to  represent  op¬ 
erators  in  truncated  Hilbert  spaces  of  precisely  this  form  in  a 
numerically  convenient  manner. 

The  master  equation  (6)  is  written  as 


dp 

~r~  =  Cp. 
dt  r 


(9) 


Numerically,  the  Liouvillian  superoperator  C  is  converted 
into  a  premultiplication  operator  by  methods  explained  in 
[19].  In  order  to  find  friction  and  diffusion  coefficients  we 
apply  a  simple  procedure,  which  yields  these  coefficients  at 
zero  velocity:  this  is  sufficient  for  our  purposes  as  the  atom 
we  are  interested  in,  Cs,  is  relatively  heavy.  More  precisely, 
the  relevant  dimensionless  parameters  determining  the  veloc¬ 
ity  dependence  of  friction  and  diffusion  coefficients  are  kv IT 
and  kv  I  k  (see  for  instance  [20]),  and  both  are  very  small  in 
all  our  simulations.  In  particular,  T/k~43  m/s  and  Klk 
—  3.4  m/s,  while  velocities  in  the  trapping  regime  we  are 
interested  in  (where  atoms  are  localized  in  wells  at  low  tem¬ 
peratures  for  times  >  k~  1  ,T  - 1)  are  around  the  Doppler  limit 
velocity 


hT  12 


m 


*8.8  cm/s. 


(10) 


-(F) 

dv= dt—  Bvdt  +  BdW, 

m 


(8) 


where  (.)  denotes  an  expectation  value,  (3  is  the  friction 
tensor  (with  dimensions  of  a  rate),  m  the  mass  of  the  atom,  B 
is  a  tensor  such  that  D  =  BBT/2  is  the  velocity  diffusion  ten¬ 
sor  (with  dimension  m2/s3),  and  dW  is  a  three-dimensional 
Wiener  process  that  satisfies  dWtdWj=  S^dt  [18].  Starting 


Also  note  that  the  standard  procedure  of  continued  fractions 
to  calculate  the  full  velocity  dependence  is  not  directly  ap¬ 
plicable  to  the  present  case,  as  the  potential  through  which 
the  atom  is  moving  is  not  periodic  (\F¥=\0). 

For  an  atom  moving  at  velocity  v  we  write 


d  d  - 

dTJt+v'v' 


(ii) 
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and  expand  Eq.  (9)  in  powers  in  v  and  solve  for  the  steady 
state.  The  zeroth-order  solution  is  then  the  steady  state  p0  at 
zero  velocity: 

£p0  =  0,  (12) 

while  the  first-order  term  pl  is  determined  by 

Cpi  =  v-Vp0.  (13) 

The  zeroth-order  force  is  the  steady- state  force  for  an  atom  at 
rest,  and  is  given  by 


(A(t)B(t—  r))  =  Tr[A  exp(- iHtotr)B  ptot(t)e\p(iHtotr)], 

(18) 

with  ptot  the  density  matrix  of  the  total  system.  This  expres¬ 
sion  formally  contains  the  evolution  of  a  density  matrix  over 
a  time  interval  r  starting  from  an  initial  density  matrix  pinit 
=  Bptot(t).  The  quantum  regression  theorem  now  states  that 
Eq.  (18)  is  still  valid  for  the  reduced  density  matrix  that 
evolves  under  the  Liouvillian  C.  That  is,  instead  of  Eq.  (18) 
we  may  use 

(A(t)B(t—  r))  =  Tr[A  exp (jCr)Bp(t)].  (19) 


F0=-Tr(p0Vtf).  (14) 

Similarly,  the  friction  coefficients  follow  from  the  first-order 

term  in  the  force 

F  i =  — Tr(PlV//),  (15) 

by  identifying 

F\  =  ~  fimv ,  (16) 

where  p  is  a  3  X  3  tensor.  In  our  case  [6],  the  gradients  along 

the  cavity  axis  are  larger  in  magnitude  than  those  in  the 
transverse  directions  by  roughly  a  factor  kw0^  150  (and 
around  the  cavity  axis  where  the  atoms  spend  most  of  their 
time  the  radial  gradients  are  even  smaller,  of  course).  Since 
the  friction  coefficient  scales  with  the  product  of  two  gradi¬ 
ents  [cf.  Eqs.  (13)  and  (15)],  the  largest  element  of  the  tensor 
! 3  is  the  zz  component.  Next  largest  in  magnitude  are  the 
off-diagonal  components  such  as  (3XZ  and  (3ZX .  Their  effects, 
however,  can  be  safely  neglected  in  our  case:  first,  the  force 
in  the  z  direction  proportional  to  —  /3zxvx  is  smaller  than  the 
friction  force  —  /3zzvz  by  roughly  a  factor  kw0.  Second,  the 
force  in  the  v  direction  —  / 3xzvz  is  not  a  friction  force  (as  it  is 
not  proportional  to  vx),  and  its  contribution  is  averaged  out 
because  the  oscillations  in  vz  are  faster  than  those  in  the  v 
direction  by  another  factor  kw0.  Finally,  the  purely  radial 
friction  rates  such  as  (3XX  are  too  small  (<^  1  s-1  on  average) 
to  have  any  influence  on  the  time  scales  considered  here. 
Thus  we  take  only  fizz  into  account. 

The  diffusion  coefficient,  again  at  zero  velocity,  is  calcu¬ 
lated  as  follows.  The  standard  method  is  to  use  the  quantum 
regression  theorem,  and  a  particularly  useful  (for  numerical 
purposes)  interpretation  of  that  theorem  is  given  in  [19].  The 
momentum  diffusion  tensor  Dp  is  given  by 


In  our  case,  C  is  a  time-independent  operator  and  hence  the 
right-hand  side  of  Eq.  (19)  can  be  evaluated  by  expanding 
exp(£r)  in  an  exponential  time  series,  as  in  the  methods 
developed  in  [19].  This  then  is  the  method  we  use  here  to 
evaluate  the  friction  and  diffusion  tensors,  and  the  results 
have  been  checked  in  the  low-intensity  limit  by  applying  the 
different  methods  from  [10]  to  the  same  problem. 

Diffusion  due  to  spontaneous  emission  is  not  obtained  this 
way  (as  the  bath  of  vacuum  modes  has  been  eliminated  al¬ 
ready),  but  can  be  obtained  by  standard  methods  and  gives 
an  independent  additional  three  components  (Dp)SE 
=  Nih2k2T /2(cr+  cr~)0  for  i  =  x,y,z ,  with  (.)0  denoting  a 
steady-state  value  and  with  the  dimensionless  factor  Nt  de¬ 
pending  on  polarization.  When  the  two-level  system  is 
formed  by  two  Zeeman  levels  that  are  connected  by  circu¬ 
larly  polarized  light  propagating  in  the  z  direction,  we  have 
Nz  =  2/5,  and  Nx  =  Ny  =  3/10. 

Since  the  diffusion  coefficients,  just  as  the  friction  coeffi¬ 
cients,  scale  as  the  square  of  a  gradient,  the  largest  compo¬ 
nent  is  Dzz .  Off-diagonal  elements  such  as  Dxz  and  Dzx  are, 
again,  smaller  by  roughly  a  factor  kw0^  150,  while  the  di¬ 
agonal  radial  components  such  as  Dxx  are  in  fact  largely 
determined  by  spontaneous  emission,  and  are  of  similar  or 
larger  magnitude  than  the  off-diagonal  elements.  The  proper 
way  to  take  into  account  the  off-diagonal  elements  of  the 
diffusion  tensor  D  is  to  diagonalize  D ,  and  consider  three 
independent  diffusion  processes  along  the  axes  of  the  basis 
that  diagonalizes  D  with  the  eigenvalues  of  D  as  diffusion 
coefficients.  Using  the  fact  that  Dzz  is  large  we  can  calculate 
both  eigenvalues  and  eigenbasis  perturbatively.  The  eigen¬ 
values  to  first  order  are  given  by 


Dp  =  limRe  f  dr(F(t)F(t- r))-(F(t))(F(t- r)>, 

00  Jo 

(17) 

and  its  relation  to  the  velocity  diffusion  tensor  is  D 
=  Dp/m2.  Before  eliminating  any  degrees  of  freedom,  the 
total  system  in  fully  quantized  form  is  described  by  a  time- 
independent  Hamiltonian,  which  we  denote  by  Htot .  In  that 
case  the  time  evolution  of  all  operators  is  determined  by 
exp(  —  iHioit) ,  and  two-time  averages  of  the  form  (A(t)B(t 
—  r))  as  appearing  in  Eq.  (17)  can  be  written  as 


DXZDZX 

Dz'z'=Dzz+-^+--,  (20) 

where  the  •  •  •  stands  for  terms  of  higher  order  in  l/(/:w0), 
while  the  axes  change  as 


„  „ D 

!  i  i 

z  =z  +  x—  +  ■ 

±-y  77 


~  I  ", 

x  =x+z—+ ■ 

^ zz 


(21) 
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FIG.  3.  For  parameters  to  be  used  later,  S0 1(2  tt)  =  50  MHz,  and 
Ap/(2ir)  =  - 10  MHz,  Ae  =  0.01  we  plot  here  the  ratio  d 
=  D\j (D zzD xx)  as  a  function  of  position. 

The  fact  that  z'  is  slightly  tilted  toward  the  x  direction  im¬ 
plies  that  a  small  part  of  the  large  diffusion  coefficient  Dz>zi 
will  contribute  to  diffusion  in  the  x  direction.  This  increase, 
however,  is  almost  exactly  compensated  for  by  the  decrease 
in  Dxrx, .  In  particular,  the  velocity  in  the  v  direction  under¬ 
goes  the  following  Wiener  process: 

dvx=  ^2Dx,x,  +  2Dz,z,^f+---dW.  (22) 

D  ZZ 

In  our  case  it  turns  out  that  DXXDZZ>D2ZX  (see  Fig.  3),  so  that 
effects  due  to  the  off-diagonal  elements  of  the  diffusion  ten¬ 
sor  can  in  fact  be  neglected.  The  figure  also  shows  that  the 
previous  considerations  about  the  relative  sizes  of  the  vari¬ 
ous  components  of  D  do  not  just  hold  on  average,  but  also 
locally. 

Thus,  friction  is  appreciable  only  along  the  cavity  axis, 
while  diffusion  has  two  main  contributions:  from  spontane¬ 
ous  emission  in  all  three  directions,  and  a  large  diffusion 
along  the  cavity  axis  from  fluctuations  in  the  FORT  and 
cavity  QED  forces. 

IV.  NUMERICAL  RESULTS 

The  following  results  pertain  to  a  Cs  atom,  with  the 
ground  state  given  by  \6S  m\F = 4\mF= 4)  and  the  excited 
state  by  |6JP3/2;F1=5;mF=5),  so  that  \0=852.4  nm  and 
r/(27r)  =  5.2  MHz.  The  cavity  parameters  are  k!(2tt) 
=  4  MHz  and  g0/(27r)  =  30  MHz,  and  w0  =  20  jum,  which 
are  typical  of  the  experiments  discussed  in  [6].  Furthermore, 
the  values  for  S 0  examined  here  are  SqI(2tt)  =  10,50  MHz. 
Both  of  these  values  are  close  to  those  explored  in  the  actual 
experiment  [6],  and  they  contrast  the  behavior  of  atoms  in 
shallow  (So <  g0)  and  deep  (5'0>g0)  wells.  Typical  values 
for  Ne  range  from  10“ 3  to  0.1. 

A.  Dressed-state  structure 

We  first  focus  on  the  atomic  motion  along  the  cavity  axis. 
The  simplest  way  to  get  a  feeling  for  the  results  for  /3ZZ  and 
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FIG.  4.  Transition  frequencies  A+  relative  to  the  bare  atomic 
frequency  from  the  ground  state  to  the  lower  two  excited  dressed 
states  as  functions  of  the  position  of  the  atom  along  the  cavity  axis 
(i.e.,  p  =  0)  .  Here  S0/(2tt)  =  10  MHz.  Also  indicated  by  the  dotted 
line  is  the  probe  detuning  used  in  Fig.  9,  Apl(2i t)=  —28  MHz. 

Dzz  as  a  function  of  the  probe  detuning  Ap  is  to  first  consider 
the  eigenenergies  of  the  dressed  atom-cavity  states.  When  we 
neglect  dissipation  for  the  moment,  and  take  the  limit  of  no 
driving  (Ne  =  0),  we  can  easily  find  the  energies  of  the  lower 
dressed  states  |  if/±)  containing  at  most  one  excitation:  the 
state  containing  no  excitation  is  the  ground  state  with  an 
energy  of  E0=  —  hSF(r),  while  the  energies  of  the  two 
dressed  states  in  the  manifold  of  states  containing  a  single 
excitation  are 

E±  =  h(oa±h  Vg(r)2  +  SV(r)2,  (23) 

if  the  atom  and  cavity  are  on  resonance.  The  excited  dressed 
states  are  given  by 

|<A_)  =  (sin  0)|g,l)  +  (cos  0)|<?,O),  (24) 

with 

8 

sin  0= 

V  g 2 + ( sFy 2 

SF-ylg2  +  S2F 

cos  0=  - .  (25) 

'Jg2+(JE+sl-sF)2 

In  Fig.  4  (10  MHz  FORT)  and  5  (50  MHz  FORT)  we  plot 
the  transition  frequencies  (relative  to  (oa )  from  the  ground 
state  to  these  two  excited  states  as  functions  of  position,  i.e., 

A±  =  SF(r)±\fg{rfVsFrf.  (26) 

This  expression  along  with  the  figures  explicitly  shows  that 
the  main  features  of  the  atom-cavity  system  are  determined 
by  the  ratio  S0/g0.  It  furthermore  shows  an  important  differ¬ 
ence  with  the  situation  of  trapping  with  a  FORT  in  free 
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FIG.  5.  Same  as  previous  figure  but  for  SqI(2tt)  =  50  MHz. 
Also  indicated  by  the  dotted  line  is  the  probe  detuning  used  in  Fig. 
10,  Ap/(2tt)=-10  MHz. 

space.  The  fact  that  the  excited  state  shifts  up  while  the 
ground  state  shifts  down  implies  that  ground  and  excited 
states  are  trapped  in  different  positions  in  free  space.  In  the 
presence  of  the  quantized  cavity  field,  however,  both  the 
lower  excited  dressed  state  and  the  ground  state  are  now 
shifted  down.  This  may  improve  trapping  and  cooling  condi¬ 
tions,  as  detailed  below. 

B.  Cooling  mechanisms 

We  now  take  a  closer  look  at  cooling  mechanisms.  In  the 
regime  of  weak  driving,  we  will  find  that  the  friction  coeffi¬ 
cient  [3ZZ  is  positive  (corresponding  to  cooling)  when  the 
probe  field  is  tuned  slightly  (by  an  amount  ~#c,r/2)  below 
the  transition  to  the  relevant  dressed  state  while  for  blue 
detuning  the  friction  coefficient  is  negative,  leading  to  expo¬ 
nential  heating  of  the  atom’s  velocity.  This  can  be  under¬ 
stood  by  analogy  with  Doppler  cooling:  by  tuning  below 
resonance,  the  process  of  stimulated  absorption  followed  by 
spontaneous  emission  leads  to  a  loss  of  energy,  while  the 
maximum  cooling  rate  is  achieved  by  maximizing  the  prod¬ 
uct  of  excitation  rate  and  detuning.  Now  looking  back  to 
Figs.  4  and  5  one  sees  that  the  variation  of  A  +  with  position 
is  larger  than  that  of  A  _  ,  because  both  the  ground  state  and 
the  lower  excited  dressed  state  shift  down,  while  the  upper 
excited  dressed  state  shifts  upward.  Generally  speaking,  for 
cooling  purposes  it  is  better  to  tune  to  the  lower  excited  state 
so  as  to  have  smaller  spatial  variations  in  cooling  rates.  More 
importantly,  the  upper-excited- state  energies  decrease  with 
increasing  radial  distance,  whereas  the  lower-excited- state 
energy  increases.  Thus,  for  the  upper  state  the  probe  detun¬ 
ing  changes  from  red  to  blue,  so  that  an  atom  cooled  on  axis 
will  in  fact  be  heated  if  it  moves  away  radially.  For  the  lower 
dressed  state  the  probe  detuning  becomes  more  red,  so  that 
an  atom  that  is  optimally  cooled  on  axis  will  still  be  cooled 
away  from  the  axis,  but  at  a  lower  rate. 

The  most  popular  explanation  for  intracavity  cooling  [10] 
exploits  analogies  with  Sisyphus  cooling  [21],  although  an¬ 
other  explanation  for  cavity-based  cooling  based  on  asym¬ 


EIG.  6.  Decay  rate  and  excitation  rate  of  the  lower  dressed  state 
as  functions  of  position  along  the  cavity  axis  (p  =  0).  Here  and  in 
Figs.  7  and  8,  we  chose  the  following  parameters:  Ne  =  0.001, 
V(2tt)  =  50  MHz,  Ap /(2tt)  =  -  10  MHz. 

metries  in  coherent  scattering  was  recently  put  forward  in 
[11].  Here  we  illustrate  the  Sisyphus  picture  for  cooling  in¬ 
side  optical  wells  within  an  optical  resonator,  using  a  very 
simple  dressed- state  picture,  that  makes  use  of  only  the 
lower  dressed  state  and  the  ground  state,  relevant  in  the  low- 
excitation  limit.  We  choose  one  particular  well,  from  z 
=  2.0XF  to  z  =  2.5XF,  and  one  particular  set  of  parameters 
given  in  the  caption  of  Fig.  6.  In  that  figure  we  plot  the  decay 
rate  y_  of  the  lower  dressed  state  and  the  excitation  rate 
from  ground  to  the  dressed  state,  ,  as  functions  of  posi¬ 
tion.  In  the  weak  driving  limit  the  decay  rate  is  given  by 

y_  =  (if/_\  Ka^ a  +  T cr+  cr~ I2\  ifs~)  =  (sin2#) k+  (cos2#)T/2, 

(27) 


and  the  excitation  rate  by 

U_  =  |(g,0|£0(at  +  a)| i/'-)!  =  £0|sin  0\.  (28) 

These  two  quantities,  together  with  the  detuning  of  the  probe 
field  from  (dressed- state)  resonance  determine  the  steady- 
state  population  in  the  lower  dressed  state,  according  to 


Hi 

(A_-Ap)2+y2' 


(29) 


The  population  n_  is  plotted  in  Fig.  7,  along  with  the 
transition  frequency  A  _  .  These  two  quantities  are  sufficient 
to  understand  the  Sisyphus  cooling  mechanism. 

Since  an  atom  in  the  ground  state  is  moving  in  a  conser¬ 
vative  potential  around  the  equilibrium  position  z  =  2.25XF, 
the  following  Sisyphus  picture  should  be  taken  as  to  apply  to 
the  motion  of  the  atom  in  addition  to  that  conservative  mo¬ 
tion  [see  Eq.  (30)].  Suppose,  for  example,  that  the  atom  is  at 
position  z  =  2.2Xf  and  moving  toward  the  right  (cf.  Fig.  7). 
The  probability  to  be  in  the  excited  state  now  decreases  (ac¬ 
cording  to  the  lower  part  of  Fig.  7),  while  the  energy  of  the 
excited  state  relative  to  the  ground  state  is  increasing:  in 
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FIG.  7.  Transition  frequency  to  and  populations  in  the  lower 
dressed  state  as  functions  of  position  along  the  cavity  axis.  Note 
that  the  equilibrium  position  of  the  atom  is  around  z  =  2.25XF . 


other  words,  an  atom  in  the  excited  state  is  climbing  uphill 
(again,  in  relation  to  the  ground  state),  but  will  likely  make 
the  down  transition  to  the  ground  state,  thus  leading  to  cool¬ 
ing  at  that  particular  position.  Similarly,  at  z=2AXF  an  atom 
moving  to  the  left  is  going  uphill  while  having  an  increased 
chance  of  decaying  to  the  ground  state,  again  leading  to  cool¬ 
ing.  This  picture  in  fact  shows  that  the  cooling  rate  is  ex¬ 
pected  to  be  proportional  to  the  gradient  of  n  _  and  the  gra¬ 
dient  of  A_  .  More  precisely,  the  force  on  the  atom  at 
position  z  is  approximately  given  by 


dSF  r/A_  dSf 

F  ~ h  — - hn_(z  —  v/ y  )—j — ~  fi  - hn-(z) 

dz  dz  dz 


d\_ 

dz 


hv  dn_  d A_ 
y_  dz  dz 


(30) 


FIG.  8.  Cooling  rate  (3ZZ  (solid  curve)  and  the  product  of  gradi¬ 
ents  of  dressed-state  population  and  transition  frequency  R  (dashed 
curve)  as  functions  of  position  along  the  cavity  axis.  The  similarity 
between  the  two  curves  confirms  the  validity  of  the  Sisyphus  cool¬ 
ing  picture. 

and  always  positive  in  others.  This  of  course  also  implies 
that  the  temperatures  reached  by  atoms  in  thermal  equilib¬ 
rium  vary  with  position. 

For  the  case  of  the  shallow  FORT  we  consider  weak  driv¬ 
ing  (Ne  =  0.001),  whereas  for  the  deeper  FORT  the  driving 
field  is  taken  to  be  stronger  by  an  order  of  magnitude.  The 
stronger  driving  field  increases  cooling  rates  while  the  fact 
that  deeper  wells  trap  the  atoms  better  means  that  corre¬ 
spondingly  larger  diffusion  rates  still  can  be  tolerated. 

The  stable  equilibrium  points  zen  are  located  around  the 
maxima  of  SF ,  i.e.,  around  zn  =  (n  —  \I2)XFI2  for  integer  n , 
because  it  is  the  FORT  that  gives  the  main  contribution  to 
the  total  force  (even  for  the  smallest  value  of  £0  =  277 
X  10  MHz  considered  here).  The  cavity  QED  field  gives 


where  the  argument  of  n  _  indicates  the  lag  between  the  atom 
reaching  a  position  z  and  reaching  its  steady  state,  with  the 
lag  time  scale  determined  by  the  inverse  decay  rate  from  the 
dressed  state.  From  the  second  line  we  see  that  the  friction 
coefficient  /3ZZ  is  approximated  by 


h  dn _  JA_ 
my_  dz  dz 


(31) 


Indeed,  Fig.  8  shows  the  similar  behavior  of  /3ZZ  and  R  as 
functions  of  position. 


C.  Friction,  diffusion  and  equilibrium  rms  velocities 

In  Figs.  9-10  we  give  examples  of  friction  and  diffusion 
coefficients  for  both  the  10  and  50  MHz  FORTs,  as  functions 
of  the  atomic  position.  They  illustrate  the  point  that  in  the 
low-excitation  limit  red  (blue)  detuning  leads  to  cooling 
(heating)  (cf.  Figs.  4  and  5).  They,  moreover,  clearly  show 
how  all  wells  are  quantitatively  different,  with  cooling  rates 
and  diffusion  strengths  differing  by  orders  of  magnitude  over 
the  various  wells,  and  with  [5ZZ  being  negative  in  some  wells, 


position  [Xf\ 


FIG.  9.  Friction  and  velocity  diffusion  coefficients  Pzz  and  Dzz 
as  functions  of  the  atomic  position  (in  units  of  XF)  along  the  cavity 
axis.  Here  Ne  =  0.001,  £0/(27t)  =  10  MHz,  and  Ap=—2S 

X  2  77  MHz.  Compare  Fig.  4. 
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FIG.  10.  Same  as  previous  figure  but  for  A^  =  0.01,  SqI{2tt) 
=  50  MHz,  and  A^=- 10X2 77  MHz. 

only  a  small  correction  to  the  force  and  hence  to  the  equilib¬ 
rium  position.  In  each  equilibrium  point,  we  can  define  a 
measure  for  the  expected  rms  velocity  of  the  atom  along  the 
z  axis  in  thermal  equilibrium  by  considering  averages  over 
local  wells 


FIG.  11.  The  values  of  in  the  eight  equilibrium  points  as 
defined  in  Eq.  (32)  by  averaging  over  the  entire  well.  In  all  cases 
50  =  277X  10  MHz.  Triangles  correspond  to  a  detuning  Ap/(2i r) 
=  -28  MHz,  squares  to  Ap/(2ir)  =  -23  MHz,  and  circles  to 
A p  !(2i r)  =  15  MHz.  Note  the  points  on  the  latter  curve  on  the  axis 
indicate  that  the  friction  coefficient  is  negative,  so  that  there  is  in 
fact  no  cooling  and  is  not  defined.  They  do  not  indicate  cooling 


if  £«>0,  (32) 

Pzz 

in  terms  of  the  friction  and  diffusion  coefficients.  This  aver¬ 
aging  procedure  gives  a  sensible  measure  for  the  rms  veloc¬ 
ity  only  if  the  atom  indeed  samples  the  whole  well.  This 
condition  is  fulfilled  for  the  relatively  shallow  wells  originat¬ 
ing  from  A0  =  27rX  10  MHz,  and  Fig.  11  uses  this  averaging 
procedure.  For  the  50  MHz  FORT,  however,  we  averaged 
over  only  part  of  the  well,  namely,  a  region  of  size  XF/10 
symmetrically  around  the  equilibrium  point.  This  choice  is 
rather  arbitrary,  and  thus  Fig.  12  just  gives  an  indication  of 
what  rms  velocities  to  expect  of  atoms  trapped  in  the  corre¬ 
sponding  wells,  although  the  simulations  in  fact  do  confirm 
these  values. 

We  see  here  that  depending  on  the  probe  detuning,  the 
atom  will  be  cooled  to  low  temperatures  either  in  all  wells, 
or  only  in  wells  where  g  is  large  in  the  equilibrium  point,  or 
only  in  wells  where  g  is  small.  This  shows  the  flexibility  that 
a  FORT  beam  adds:  one  can  predetermine  to  a  certain  degree 
in  which  well  the  atom  will  be  trapped  (and  cooled)  for 
longer  times  and  in  which  it  will  not  be. 

Under  the  current  conditions  k>T /2  the  lowest  tempera¬ 
tures  achievable  are  determined  by  the  Doppler  velocity  v  D  . 
More  precisely,  the  lower  limit  on  rms  velocities  along  the 
cavity  axis  is  expected  to  be 

I  hf 

V°'72^’  <33> 

where  the  factor  0.7  =  ( 1  +  215)12  comes  from  the  fact  that  in 
our  case  the  diffusion  due  to  spontaneous  emission  in  the  z 
direction  is  two-fifths  of  the  full  3D  value.  We  tested  that  for 


smaller  k  the  rms  velocities  indeed  do  become  even  smaller, 
now  determined  by  -JhK/m ,  thus  confirming  predictions  of 
[10]. 

Finally,  we  note  that  the  quasiclassical  approximation 
used  throughout  this  paper  is  justified  as  neither  the  recoil 
limit  is  reached  nor  the  resolved- sideband  limit,  i.e., 


hT/2>(hk)2/m , 


FIG.  12.  As  previous  figure,  but  for  50  =  27tX50  MHz  and  Ne 
=  0.01,  and  where  the  average  is  taken  over  a  region  of  size  \F/10 
around  the  equilibrium  point.  The  probe  detunings  were  Ap!{2i r) 
=  — 10,  — 5100  MHz  for  the  squares,  triangles,  and  circles,  respec¬ 
tively.  Note  the  points  on  the  axis  indicate  that  the  friction  coeffi¬ 
cient  is  negative,  so  that  there  is  in  fact  no  cooling  and  is  not 
defined.  They  do  not  indicate  cooling  to  urms=0. 
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FIG.  13.  The  average  values  of  Dzz  and  (3ZZ  as  functions  of  the 
driving  field  strength  Ne  in  the  well  extending  from  z  =  2XF  to  z 
=  2.5XF,  for  the  10  MHz  FORT,  where  Ap=  -28X2tt  MHz.  In 
the  lower  part  the  corresponding  values  for  the  rms  velocity  i^ms  are 
plotted  as  a  function  of  Ne . 

hT/2>hvosc,  (34) 

with  vosc  the  oscillation  frequency  of  the  atom  in  a  well  (see 
below),  although  in  some  cases  the  latter  condition  is  only 
marginally  fulfilled,  namely,  when  vosc=600  kHz,  which  is 
only  a  factor  4  smaller  than 

D.  Saturation  behavior 

We  now  briefly  turn  to  the  question  of  the  nonlinear  be¬ 
havior  of  the  atom-cavity  system  with  increasing  excitation. 
In  the  absence  of  saturation  effects,  both  friction  and  diffu¬ 
sion  coefficients  would  increase  linearly  with  Ne .  For  the 
same  parameters  as  Fig.  9,  Fig.  13  shows  nonlinearities  set¬ 
ting  in  around  ^  =  0.01.  The  friction  coefficient  even  starts 
to  decrease  around  Ne  =  0.\  as  a  result  of  the  local  values  of 
f3zz  becoming  negative  where  they  were  positive  in  the  weak 
driving  limit.  The  concomitant  effect  on  the  uj:ms  is  shown  as 
well. 

E.  Simulations 

We  also  performed  Monte  Carlo  simulations  of  the  3D 
motion  in  given  wells  by  solving  the  Langevin  equations  (8) 
for  position  and  velocity  (see  also  [14]).  The  experimental 
procedure  switches  the  FORT  field  on  only  when  an  atom 
has  been  detected  and  when  it  consequently  has  partly  fallen 
through  the  cavity  already  [6].  We  accordingly  fix  initial 
conditions  as  follows:  We  start  the  atom  on  the  cavity  axis, 
and  we  fix  the  downward  velocity  to  be  vx  =  10  cm/s.  Fur¬ 
thermore,  we  chose  vz  =  0  cm/s,  and  the  initial  position 
along  the  z  axis  to  be  X  F/S  away  from  the  equilibrium  point. 
The  initial  position  and  velocity  were  fixed  so  that  all  varia¬ 
tions  in  trapping  times  and  rms  velocities  are  solely  due  to 
the  random  fluctuations  of  the  forces  acting  on  the  atom, 
rather  than  from  random  initial  conditions.  Experimentally 
these  two  are  mixed  of  course. 
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FIG.  14.  In  the  upper  part  of  the  plot  each  data  point  gives  the 
trapping  time  and  the  average  urms  resulting  from  a  single  trajec¬ 
tory.  Identical  initial  conditions  were  chosen  for  each  trajectory: 
each  atom  started  at  z  =  2A25XF  with  vz  =  0  and  i>x=10cm/s 
(downward).  Other  parameters  were:  Ne  =  0.001,  Ap=— 28 
X  2 7T  MHz,  S0  =  10X277  MHz.  The  lower  part  gives  a  histogram 
of  the  probability  P(T)  for  an  atom  to  be  trapped  longer  than  a  time 
T.  A  fit  of  the  tail  of  this  distribution  to  an  exponential 
<*exp(— 77 r)  gives  r=  25±  2  ms. 

Since  atoms  with  these  initial  conditions  do  not  possess 
angular  momentum  around  the  z  axis,  this  in  some  sense 
represents  a  favorable  case  (although  the  atoms  are  not  put  in 
the  bottom  of  the  well).  However,  in  the  course  of  their  evo¬ 
lution  the  atoms  do  acquire  angular  momentum  so  that  this  is 
in  fact  not  a  severe  restriction.  For  more  detail  see  below 
(Fig.  22). 

In  Fig.  14  we  plot  the  results  of  simulations  of  1000  tra¬ 
jectories  for  an  atom  in  the  shallow  well  of  10  MHz.  We  plot 
the  average  rms  velocity  along  the  cavity  axis  as  a  function 
of  trapping  time  for  each  trajectory.  Here  we  defined  the 
“trapping  time”  as  the  time  spent  by  the  atom  in  one  par¬ 
ticular  given  well  of  size  X  F/2.  The  actual  trapping  time  in¬ 
side  the  cavity  may  be  longer,  obviously,  as  the  atom  may 
subsequently  get  trapped  in  different  wells.  For  very  short 
trapping  times,  urms  is  determined  by  the  initial  condition, 
but  for  longer  times  lower  temperatures  corresponding  to 
those  calculated  in  Fig.  1 1  are  reached.  Note,  however,  that 
the  simulations  were  done  in  3D,  and  as  such  do  not  neces¬ 
sarily  give  the  same  temperatures  as  predicted  for  on-axis 
(ID)  motion  in  Figs.  11  and  12.  Nevertheless,  the  effect  of 
the  atoms’  radial  motion  is  apparently  not  strong,  and  in  fact 
atoms  leave  the  well  while  still  being  trapped  radially.  This 
is  partly  due  to  the  fact  that  all  (especially  heating)  rates  in 
the  radial  direction  are  smaller  by  a  factor  kw0~  150  than 
those  in  the  axial  direction. 

About  half  of  the  atoms  is  basically  not  trapped  at  all.  The 
remaining  atoms  have  a  probability  P(T)  to  be  trapped 
longer  than  a  time  T,  with  P(T)  decaying  exponentially  with 
T.  The  average  trapping  time  for  these  parameters  is  found  to 
be  r^25  ms,  as  shown  in  Fig.  14. 

In  Figs.  15  and  16  we  plot  for  the  same  10  MHz  FORT  an 
example  of  a  single  trajectory,  after  the  atom  has  spent  4  ms 
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FIG.  15.  Snapshot  of  a  single  trajectory,  with  parameters  as  in 
Fig.  14.  The  upper  plot  gives  the  z  coordinate  of  the  atom  as  a 
function  of  time,  the  lower  plot  gives  the  transmission  (in  fact  the 
number  of  photons  (a^ a)  inside  the  cavity)  in  that  same  time  inter¬ 
val.  Note  the  time  scales  here  differ  by  two  orders  of  magnitude 
from  those  of  Fig.  16. 

in  the  trap.  The  oscillation  frequencies  along  the  z  and  the 
radial  directions  differ  by  two  orders  of  magnitude  (since 
kw0^  150):  in  the  z  direction  the  oscillation  rate  is 
—  200  kHz,  in  the  radial  direction  —2.2  kHz.  The  photon 
transmission  follows  both  these  oscillations  so  that  in  prin¬ 
ciple  the  atomic  motion  in  both  axial  and  radial  direction  is 
detectable.  Experimentally,  though,  the  oscillations  along  the 
cavity  axis  may  be  too  fast  to  be  accessible.  In  particular,  the 
average  rate  at  which  photons  leaking  out  through  one  end  of 
the  cavity  are  detected  is  at  most  (the  efficiency  is  less  than 
100%)  equal  to  the  cavity  decay  rate  multiplied  by  the  aver¬ 
age  number  of  photons  inside  the  cavity.  For  the  parameters 
of  Fig.  15  this  amounts  to  a  rate  —  0.01  X  k^2.5X  105 /sec, 


FIG.  16.  For  the  same  trajectory  as  the  previous  figure,  the 
upper  plot  gives  the  radial  distance  to  the  cavity  axis,  p  in  units  of 
w0  as  a  function  of  time,  the  lower  plot  gives  again  the  number  of 
photons  inside  the  cavity  during  that  same  time  interval.  The  atom 
has  a  nonzero  angular  momentum  along  z  and  does  not  cross  the  z 
axis. 


FIG.  17.  As  Fig.  14  but  for  Ve  =  0.01,  \p  =  —  10X2 tt  MHz, 
50  =  50X27r  MHz.  The  mean  trapping  time  is  t=250±20  ms. 

which  corresponds  to  just  about  one  photon  per  oscillation 
period. 

The  figures  show  that  when  the  atom  is  in  a  position 
where  it  is  not  coupled  to  the  cavity  (g  =  0),  the  number  of 
photons  in  the  cavity  drops  to  Ne  =  0.001.  Similarly,  when 
the  atom  moves  away  radially,  the  transmission  drops. 

To  make  a  direct  comparison  with  the  trapping  times 
achieved  in  the  experiment  [6],  we  now  turn  to  the  case  of  a 
50  MHz  FORT.  We  plot  rms  velocities  vs  trapping  times  for 
300  trajectories  for  an  atom  trapped  in  the  well  ranging  from 
z  =  2XF  to  z  =  2.5XF. 

For  the  parameters  of  Fig.  17  the  atom  is  either  trapped 
for  long  times  (>  10  ms)  or  only  for  a  short  time  (<  1  ms), 
both  with  about  50%  probability.  In  the  latter  case  the  rms 
velocity  is  determined  just  by  the  (arbitrarily  chosen)  initial 
condition  and  is  around  30  cm/s,  but  for  longer  trapping 
times  the  effects  of  cooling  are  visible.  Thermal  equilibrium 
is  reached  with  ^^-8  cm/s,  thus  confirming  the  results  of 
Fig.  12.  The  distribution  of  trapping  times  again  follows  an 
exponential  law,  and  the  average  trapping  time,  as  deter¬ 
mined  from  the  tail  of  the  distribution,  is  r^250  ms,  which 
is  ten  times  longer  than  for  the  (fluctuating)  50  MHz  FORT 
used  in  [6].  This  shows  the  great  potential  of  holding  single 
atoms  in  the  cavity  for  extended  periods  of  time  if  the  inten¬ 
sity  fluctuations  of  the  FORT  beam  can  be  minimized.  Ex¬ 
perimental  efforts  along  this  path  are  currently  underway. 

Also  for  this  case  we  plot  snapshots  for  a  single  trajectory 
(Fig.  18),  taken  after  the  atom  has  spent  25  ms  in  the  trap. 
Compared  to  the  10  MHz  FORT,  the  oscillations  of  the  atom 
along  the  cavity  axis  and  in  the  radial  direction  become 
faster  by  about  a  factor  of  3.  The  axial  oscillation  frequency 
is  about  600  kHz,  while  along  the  radial  direction  the  oscil¬ 
lations  occur  at  a  rate  6.2  kHz,  i.e.,  again  slower  by  two 
orders  of  magnitude.  In  this  case,  the  photon  transmission 
still  follows  directly  the  axial  oscillations  but  no  longer  fol¬ 
lows  the  radial  excursions  of  the  atom,  as  now  the  fluctua¬ 
tions  in  the  magnitude  of  g  at  the  atom’s  position  along  the 
cavity  axis  are  in  fact  larger  than  those  due  to  the  radial 
excursions  of  the  atom.  This  is  partly  due  to  the  fact  that  in 
the  simulations  here  the  driving  field  is  stronger  than  for  the 
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FIG.  18.  Snapshot  of  a  single  trajectory,  with  parameters  as  in 
Fig.  17.  The  upper  plot  gives  the  z  coordinate  of  the  atom  as  a 
function  of  time  (with  the  atom  released  with  standard  initial  con¬ 
ditions  at  f  =  0).  The  lower  plot  gives  the  transmission  (in  fact  the 
number  of  photons  inside  the  cavity)  in  that  same  time  interval. 
Note  the  time  scales  here  differ  by  two  orders  of  magnitude  from 
those  of  Fig.  19. 

10  MHz  example  above  so  that  fluctuations  in  the  atomic 
motion  occur  at  a  shorter  time  scale,  and  partly  simply  be¬ 
cause  the  radial  excursions  are  small.  Figure  19(c)  shows  that 
it  is  primarily  the  axial  fluctuations  that  determine  the  varia¬ 
tions  in  the  numbers  of  photons  inside  the  cavity. 

Generally  speaking,  the  axial  excursions  determine  (local) 
minimum  and  maximum  transmission  levels  (as  in  Fig.  15). 
When  these  minima  and/or  maxima  depend  on  the  radial 
position,  then  the  radial  motion  could,  in  principle,  be  visible 
in  the  cavity  transmission  level.  This  depends  in  turn  on 
whether  the  axial  fluctuations  on  the  time  scale  of  the  trans¬ 
verse  motion  are  sufficiently  small  so  as  not  to  hide  the  radial 
dependence.  There  seems  to  be  no  simple  general  rule  how 


FIG.  19.  For  the  same  trajectory  as  the  previous  figure,  (a)  the 
radial  distance  to  the  cavity  axis,  p  in  units  of  w0  as  a  function  of 
time,  (b)  the  number  of  photons  inside  the  cavity  during  that  same 
time  interval,  and  (c)  the  position  along  the  cavity  axis  in  units  of 
XF . 
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FIG.  20.  For  an  atom  in  the  well  ranging  from  z  —  0  to  z 
=  \f/2,  for  Ve  =  0.01,  Ap=  —  10X 277  MHz,  and  S0  =  50 
X  2  77  MHz,  the  upper  plot  gives  the  radial  distance  to  the  cavity 
axis,  p  in  units  of  w0  as  a  function  of  time,  the  lower  plot  gives  the 
number  of  photons  inside  the  cavity  during  that  same  time  interval. 

this  interplay  between  radial  and  axial  motions  depends  on 
detunings,  driving  strength,  and  the  particular  well. 

In  contrast,  in  a  different  well,  the  one  ranging  from  z 
=  0.0  to  z  =  0.5XF,  the  photon  number  in  the  cavity  does 
follow  the  radial  motion,  as  the  radial  excursions  become 
larger  (Fig.  20).  Perhaps  more  importantly,  the  average  trans¬ 
mission  level  is  higher  by  more  than  a  factor  2  compared  to 
the  previous  case,  as  a  result  of  g  being  larger  in  this  well  (cf. 
Fig.  2).  This  shows  how,  in  principle,  different  wells  may  be 
experimentally  distinguished  via  the  transmission  of  the 
probe  field  through  the  cavity. 

We  also  simulated  the  motion  of  an  atom  trapped  under 
more  adverse  conditions,  namely  for  an  atom  in  the  well  [z 
=  \F— >1.5\f]  at  a  probe  detuning  Ap/(2tt)  =  —5  MHz. 
According  to  Fig.  12,  the  atom  is  not  cooled  on  axis  under 
these  conditions  (i.e.  the  average  friction  coefficient  around 
the  equilibrium  point  on  the  z  axis  is  negative).  This  is  con¬ 
firmed  by  Fig.  21:  the  mean  trapping  time  for  an  atom  start¬ 
ing  at  z  =  1.125\f  is  now  very  short,  about  1.6  ms,  while  the 
average  rms  velocity  is  ujms^28  cm/s,  as  determined  essen¬ 
tially  by  the  initial  condition. 

Finally,  we  consider  the  influence  of  different  initial  con¬ 
ditions  on  trapping  and  cooling.  All  the  results  so  far  were 
obtained  by  considering  atoms  that  initially  are  moving  on 
axis.  Thus,  they  have  no  angular  momentum  along  the  z  axis, 
nor  any  radial  potential  energy.  Figure  22  shows  a  plot  of 
rms  velocities  vs  trapping  times  for  atoms  trapped  under  the 
same  conditions  as  for  Fig.  17)  (i.e.,  in  the  well  from  z 
=  2X  to  z  —  2.5X,  for  \p=  —  10X277  MHz,  A/e  =  0.01,  and 
S'o  =  50X277  MHz),  but  with  different  (nonzero)  values  for 
the  initial  angular  momentum. 

Obviously,  the  more  initial  potential  energy  the  atom  has, 
the  less  likely  it  is  to  be  trapped.  In  fact,  the  angular  momen¬ 
tum  does  not  play  any  role  here,  as  confirmed  by  similar 
calculations  with  initial  conditions  chosen  such  that  the  at¬ 
oms  have  no  initial  angular  momentum  but  have  the  same 
potential  energy.  The  results  are  the  same  in  that  case.  For 
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FIG.  21.  As  Fig.  14  but  for  Ne  =  0.01,  A^  =  -  5  X  2  77  MHz,  S0 
=  50X277  MHz.  The  initial  position  of  the  atom  is  z=1.125\F. 
The  mean  trapping  time  is  r=  1.6  ±0.1  ms. 

atoms  starting  at  y  =  0.2w0  the  trapping  times  and  rms  ve¬ 
locities  are  basically  not  affected,  and  the  trapping  time  is 
still  around  250  ms.  But  for  atoms  starting  at  y  =  0.5w0  the 
effect  of  their  increased  potential  energy  leads  to  clearly 
shorter  trapping  times  (by  roughly  a  factor  of  2),  and  for 
atoms  starting  at  y  =  w0  this  effect  is  even  more  pronounced 
with  a  decrease  in  trapping  time  of  about  a  factor  of  10. 


F.  A  different  trapping  structure 

We  now  consider  a  different  case  where  the  atomic  ex¬ 
cited  state  is  assumed  to  be  shifted  down  by  the  FORT  field, 
just  as  the  ground  state  is  (see,  for  instance  [12]).  This  can  be 
achieved  by  using  a  FORT  that  is  (red)  detuned  in  such  a 
way  that  the  excited  atomic  state  is  relatively  closer  to  reso¬ 
nance  with  a  higher-lying  excited  state  than  with  the  ground 


FIG.  22.  rms  velocities  vs  trapping  times  for  atoms  trapped  un¬ 
der  the  same  conditions  as  for  Fig.  17  but  with  different  initial 
radial  conditions  for  y.  In  particular,  for  plot  (a)  the  initial  condi¬ 
tions  on  y  is  y  =  0.2w0.  for  (b)  y  =  0.5w0  and  for  (c)  y  =  w0.  Since 
vx  =  — 10  cm/s,  the  atoms  have  different  angular  momenta  along  z 
in  these  cases,  and  different  initial  potential  energies. 


FIG.  23.  Transition  frequencies  A  +  for  the  case  where  the 
atomic  excited  state  is  assumed  to  be  shifted  down  by  the  FORT 
field  by  the  same  amount  SF  as  is  the  ground  state,  for  the  50  MHz 
FORT.  A  detuning  of  —  35  MHz  is  indicated  by  the  dotted  line. 

state.  This  situation  at  first  sight  looks  even  more  appealing 
for  trapping  purposes,  as  now  both  excited  and  ground  state 
will  be  trapped  in  the  same  positions.  Moreover,  fluctuations 
in  the  force  due  to  the  FORT  are  diminished. 

We  consider  only  the  50  MHz  FORT  here,  and  compare 
this  case  to  the  previous  50  MHz  FORT  case,  and  in  particu¬ 
lar  we  refer  the  reader  back  to  Figs.  5,10,  and  17.  For  ease  of 
comparison  we  keep  X F  the  same,  and  assume  for  simplicity 
that  the  excited  state  is  shifted  down  by  an  amount  SF ,  so 
that  the  shifts  of  the  ground  and  excited  state  are  in  fact 
identical. 

The  fact  that  ground  and  excited  states  have  the  same 
potential,  implies  that  the  transition  frequencies  to  the 
dressed  states  are  simply  periodic  in  space  with  period  X0,  as 
shown  in  Fig.  23,  rather  than  aperiodic  as  in  Fig.  5. 

Similarly,  the  fluctuations  in  the  force  due  to  the  FORT 
now  vanish,  as  both  ground  and  excited  state  undergo  the 
same  shift,  so  that  the  diffusion  coefficient  is  periodic  with 
period  X0.  Also  the  friction  force  arises  only  from  the  cavity 
QED  part  and  is  periodic.  Yet,  the  different  wells  are  not 
equivalent.  The  forces  are,  just  as  before,  driven  by  both 
cavity  QED  field  and  the  FORT,  and  the  value  of  g  at  the 
antinode  of  the  FORT  still  varies  over  the  different  wells. 
This  is  illustrated  in  Fig.  24  where  the  rms  velocities  in  the 
eight  different  wells  are  shown,  along  with  friction  and  dif¬ 
fusion  coefficients.  Since  in  this  example  the  probe  field  is 
detuned  below  the  lower  dressed  state,  one  has  cooling  ev¬ 
erywhere  in  space. 

The  simulations  show  that  the  mean  trapping  time  is 
smaller,  although  the  rms  velocities  are  just  as  small  as  be¬ 
fore.  The  reason  is  the  less  favorable  cooling  condition  away 
from  the  cavity  axis.  In  particular,  for  the  parameters  used 
here  the  expected  rms  velocity  v^ms  steadily  increases  to  90 
cm/s  at  a  radial  distance  p  =  2w0,  while  for  the  simulations 
of  Figs.  17,  is  increasing  only  slowly  to  12  cm/s.  This 
large  difference  can  be  understood  by  noting  the  difference 
in  dressed  state  structures  between  the  two  cases.  For  the 
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FIG.  24.  Friction  and  diffusion  coefficients,  and  the  resulting 
rms  velocity  as  functions  of  position  along  the  cavity  axis  for  the 
trapping  structure  of  Sec.  IV F.  Here  Ap/(27r)=— 35  MHz,  and 
A,  =  0.01. 

case  of  Fig.  5,  the  transition  frequency  to  the  lower  dressed 
state  around  the  equilibrium  position  z^2.25XF  does  not 
change  much  with  increasing  radial  distance,  so  that  the 
probe  field  in  that  trapping  region  is  always  detuned  below 
resonance  by  an  amount  that  stays  more  or  less  constant.  For 
the  dressed  state  structure  of  Fig.  23,  however,  the  probe 
detuning  increases  from  ^5  MHz  to  5^35  MHz  below  reso¬ 
nance,  thus  leading  to  much  worse  cooling  conditions.  In 
other  words,  the  presence  of  opposite  level  shifts  due  to  the 
FORT  makes  the  spatial  variation  of  the  transition  frequency 
to  the  lower  dressed  state  smaller,  compare  A  -  =  SF 
—  s]SF+  g2 3  to  A_  =  —  g,  especially  when  g<SF . 

The  alternative  trapping  potential  is,  therefore,  not  neces¬ 
sarily  more  favorable  for  trapping  purposes.  On  the  other 
hand,  all  atoms  are  captured  now  and  are  trapped  for  at  least 
10  ms.  This  can  be  understood  from  the  simple  fact  that  here 
the  friction  coefficient  is  positive  in  the  entire  well  (Fig.  25). 

V.  SUMMARY 

We  analyzed  cooling  limits  and  trapping  mechanisms  for 
atoms  trapped  in  optical  traps  inside  optical  cavities.  The 
main  distinguishing  feature  from  previous  discussions  on 
cooling  of  atoms  inside  cavities  is  the  presence  of  the  exter¬ 
nal  trapping  potential  with  a  different  spatial  periodicity  as 
compared  to  the  cavity  QED  field.  This  not  only  provides 
better  cooling  and  trapping  conditions  but  the  different  spa¬ 
tial  period  makes  the  various  potential  wells  qualitatively 
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FIG.  25.  As  Fig.  14,  but  for  the  different  trapping  structure  of 
Sec.  IV  F.  The  initial  position  was  z  =  0.125XF,  and  further  param¬ 
eters  were  Ae  =  0.01,  Ap  =  - 35 X 2 77  MHz,  S0  =  50X2 77  MHz. 
The  mean  trapping  time  is  r=  28± 2  ms. 

different.  Atoms  can  be  trapped  in  regions  of  space  where 
the  coupling  to  the  cavity  QED  field  is  maximum,  minimum 
or  somewhere  in  between.  Depending  on  the  laser  detuning, 
cooling  may  take  place  only  in  wells  where  the  atom  is  mini¬ 
mally  coupled  to  the  cavity  QED  field,  or  where  it  is  maxi¬ 
mally  coupled.  This  allows  one,  in  principle,  to  distinguish  to 
a  certain  degree  the  different  atomic  positions  along  the  cav¬ 
ity  axis,  namely,  by  comparing 

(1)  the  average  transmission  level, 

(2)  the  fluctuations  of  the  cavity  transmission, 

(3)  the  total  trapping  time 

which  reflect,  respectively,  the  average  atom-cavity  cou¬ 
pling,  the  temperature  of  the  atom  and  under  certain  condi¬ 
tions  the  radial  motion,  and  the  overall  cooling  and  trapping 
conditions.  This  is  an  important  additional  tool  useful  for 
eventual  control  of  coherent  evolution  of  the  atomic  center- 
of-mass  degrees  of  freedom,  as  relevant  to  performing  quan¬ 
tum  logic  operations. 
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Fidelity  Fdassicai  —  \  has  been  established  as  setting  the  boundary  between  classical  and  quantum 
domains  in  the  teleportation  of  coherent  states  of  the  electromagnetic  field  (S.  L.  Braunstein,  C.  A. 

Fuchs,  and  H.  J.  Kimble,  J.  Mod.  Opt.  47,  267  (2000)).  Two  recent  papers  by  P.  Grangier  and  F. 
Grosshans  (quant -ph/0009079  and  quant-ph/00 10 107)  introduce  alternate  criteria  for  setting  this 
boundary  and  as  a  result  claim  that  the  appropriate  boundary  should  be  F  =  |.  Although  larger 
fidelities  would  lead  to  enhanced  teleportation  capabilities,  we  show  that  the  new  conditions  of 
Grangier  and  Grosshans  are  largely  unrelated  to  the  questions  of  entanglement  and  Bell- inequality 
violations  that  they  take  to  be  their  primary  concern.  With  regard  to  the  quantum- classical  bound¬ 
ary,  we  demonstrate  that  fidelity  Fdassicai  —  \  remains  the  appropriate  point  of  demarcation.  The 
claims  of  Grangier  and  Grosshans  to  the  contrary  are  simply  wrong,  as  we  show  by  an  analysis  of  the 
conditions  for  nonseparability  (that  complements  our  earlier  treatment)  and  by  explicit  examples  of 
Bell-inequality  violations. 


I.  INTRODUCTION 

As  proposed  by  Bennett  et  al.  [1],  the  protocol  for 
achieving  quantum  teleportation  is  the  following.  Alice 
is  to  transfer  an  unknown  quantum  state  1^)  to  Bob,  us¬ 
ing  as  the  sole  resources  some  previously  shared  quantum 
entanglement  and  a  classical  channel  capable  of  commu¬ 
nicating  measurement  results.  Physical  transport  of  \ip) 
from  Alice  to  Bob  is  excluded  at  the  outset.  Ideal  tele¬ 
portation  occurs  when  the  state  |^)  enters  Alice’s  send¬ 
ing  station  and  the  same  state  \ip)  emerges  from  Bob’s 
receiving  station. 

Of  course,  in  actual  experiments  [2-5],  the  ideal  case 
is  unattainable  as  a  matter  of  principle.  The  question 
of  operational  criteria  for  gauging  success  in  an  experi¬ 
mental  setting,  therefore,  cannot  be  avoided.  We  have 
proposed  previously  that  a  minimal  set  of  conditions  for 
claiming  success  in  the  laboratory  are  the  following  [6] . 

1.  An  unknown  quantum  state  (supplied  by  a  third 
party  Victor)  is  input  physically  into  Alice’s  station 
from  an  outside  source. 

2.  The  “recreation”  of  this  quantum  state  emerges 
from  Bob’s  receiving  terminal  available  for  Victor’s 
independent  examination. 

3.  There  should  be  a  quantitative  measure  for  the 
quality  of  the  teleportation  and  based  upon  this 
measure,  it  should  be  clear  that  shared  entangle¬ 
ment  enables  the  output  state  to  be  “closer”  to  the 
input  state  than  could  have  been  achieved  if  Al¬ 
ice  and  Bob  had  utilized  a  classical  communication 
channel  alone. 

In  Ref.  [6],  it  was  shown  that  the  fidelity  F  between 
input  and  output  states  is  an  appropriate  measure  of  the 


degree  of  similarity  in  Criterion  3.  For  an  input  state 
I'tpin)  and  output  state  described  by  the  density  operator 
Pout,  the  fidelity  is  given  by  [7] 

F  —  ('Ipinl Pout  •  (1) 

To  date  only  the  experiment  of  Furusawa  et  al.  [4]  has 
achieved  unconditional  experimental  teleportation  as  de¬ 
fined  by  the  three  criteria  above  [6,8,9].  This  experiment 
was  carried  out  in  the  setting  of  continuous  quantum  vari¬ 
ables  with  input  states  I'tpin)  consisting  of  coherent  states 
of  the  electromagnetic  field,  with  an  observed  fidelity 
Fex p  =  0.58  d=  0.02  having  been  attained.  This  bench¬ 
mark  is  significant  because  it  can  be  demonstrated  [4,6] 
that  quantum  entanglement  is  the  critical  ingredient  in 
achieving  an  average  fidelity  greater  than  Fciassicai  =  \ 
when  the  input  is  an  absolutely  random  coherent  state 
[10]. 

Against  this  backdrop,  Grangier  and  Grosshans  [11,12] 
have  recently  suggested  that  the  appropriate  boundary 
between  the  classical  and  quantum  domains  in  the  tele¬ 
portation  of  coherent  states  should  be  a  fidelity  F  =  |. 
Their  principal  concern  is  the  distinction  between  “en¬ 
tanglement”  and  “non-separability,”  where  by  the  lat¬ 
ter  term,  they  mean  “the  physical  properties  associated 
with  non-locality  and  the  violation  of  Bell’s  inequalities 
(BI).”*  They  claim  that  “due  to  imperfect  transmissions, 


*  Since  the  terms  “entanglement”  and  “nonseparability”  are 
used  interchangeably  in  the  quantum  information  community, 
we  will  treat  them  as  synonyms  to  eliminate  further  confu¬ 
sion.  We  will  refer  to  violations  of  Bell’s  inequalities  explicitly 
whenever  a  distinction  must  be  made  between  entanglement 
and  local  realism  per  se.  The  only  exceptions  will  be  when 
we  quote  directly  from  Grangier  and  Grosshans  [11,12]. 
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...  it  becomes  possible  to  violate  the  classical  boundary 
(i.e.,  F  =  \)  of  teleportation  without  any  violation  of 
BI.”  [11]  However,  rather  than  addressing  the  issue  in 
a  direct  manner,  they  then  propose  the  violation  of  a 
certain  “Heisenberg- type  inequality  (HI)”  as  “a  more  ef¬ 
fective  -  and  in  some  sense  ‘necessary’  -  way  to  char¬ 
acterize  shared  entanglement.”  It  is  this  that  leads  to 
their  condition  F  >  |  as  being  necessary  for  the  dec¬ 
laration  of  successful  teleportation.  In  support  of  this 
threshold,  they  further  relate  their  criterion  based  on  the 
HI  to  ones  previously  introduced  in  the  quantum  nonde¬ 
molition  measurement  (QND)  literature.  Finally,  in  Ref. 
[12],  Grangier  and  Grosshans  find  that  F  >  |  is  also  re¬ 
quired  by  a  criterion  they  introduce  having  to  do  with  a 
certain  notion  of  reliable  “information  exchange”  [12]. 

The  purpose  of  the  present  paper  is  to  demonstrate 
that  the  conclusions  of  Grangier  and  Grosshans  concern¬ 
ing  the  proposed  quantum-classical  boundary  F  =  |  are 
unwarranted  and,  by  explicit  counter  example,  incorrect. 
Our  approach  will  be  to  investigate  questions  of  nonsep¬ 
arability  and  violations  of  Bell  inequalities  for  the  partic¬ 
ular  entangled  state  employed  in  the  teleportation  pro¬ 
tocol  of  Ref.  [13].  Of  significant  interest  will  be  the  case 
with  losses,  so  that  the  relevant  quantum  states  will  be 
mixed  quantum  states.  Our  analysis  supports  the  follow¬ 
ing  conclusions. 

1.  Although  the  argument  of  Grangier  and  Grosshans 
is  claimed  to  be  based  upon  “EPR  non-separability 
of  the  entanglement  resource”  [12]  [by  which  they 
mean  a  potential  violation  of  a  BI],  they  offer  no 
quantitative  connection  (by  constructive  proof  or 
otherwise)  between  the  criteria  they  introduce  (in¬ 
cluding  the  threshold  F  =  |)  [11,12]  and  the  actual 
violation  of  any  Bell  inequality.  Nothing  in  their 
analysis  provides  a  warranty  that  F  >  |  would 
preclude  a  description  in  terms  of  a  local  hidden- 
variables  theory.  They  offer  only  the  suggestion 
that  “F  >  |  would  be  much  safer”  [11]. 

2.  By  application  of  the  work  of  Duan  et  al.  [14],  Si¬ 
mon  [15],  and  Tan  [16],  we  investigate  the  ques¬ 
tion  of  entanglement.  We  show  that  the  states  em¬ 
ployed  in  the  experiment  of  Ref.  [4]  are  nonsepa- 
rable,  as  was  operationally  confirmed  in  the  exper¬ 
iment.  Moreover,  we  study  the  issue  of  nonsepa¬ 
rability  for  mixed  states  over  a  broad  range  in  the 
degree  of  squeezing  for  the  initial  EPR  state,  in  the 
overall  system  loss,  and  in  the  presence  of  thermal 
noise.  This  analysis  reveals  that  EPR  mixed  states 
that  are  nonseparable  do  indeed  lead  to  a  fidelity  of 
F  >  Fciassicai  =  \  for  the  teleportation  of  coherent 
states.  Hence,  in  keeping  with  Criterion  3  above, 
the  threshold  fidelity  for  employing  entanglement 
as  a  quantum  resource  is  precisely  the  same  as  was 
deduced  in  the  previous  analysis  of  Ref.  [6] .  Within 


the  setting  of  Quantum  Optics,  this  threshold  coin¬ 
cides  with  the  standard  benchmark  for  manifestly 
quantum  or  nonclassical  behavior,  namely  that  the 
Glauber- Sudarshan  phase-space  function  becomes 
nonpositive-definite,  here  for  any  bipartite  nonsep¬ 
arable  state  [17].  By  contrast,  the  value  F  =  | 
championed  by  Grangier  and  Grosshans  is  essen¬ 
tially  unrelated  to  the  threshold  for  entanglement 
(nonseparability)  in  the  teleportation  protocol,  as 
well  as  to  the  boundary  for  the  nonclassical  char¬ 
acter  of  the  EPR  state. 

3.  By  application  of  the  work  of  Banaszek  and  Wod- 
kiewicz  [18,19],  we  explore  the  possibility  of  vio¬ 
lations  of  Bell  inequalities  for  the  EPR  (mixed) 
states  employed  in  the  teleportation  of  continuous 
quantum- variables  states.  We  find  direct  violations 
of  a  CHSH  inequality  [20]  over  a  large  domain. 
Significant  relative  to  the  claims  of  Grangier  and 
Grosshans  is  a  regime  both  of  entanglement  (non- 
separability)  and  of  violation  of  a  CHSH  inequal¬ 
ity  for  which  the  teleportation  fidelity  F  <  |  and 
for  which  the  criterion  of  the  Heisenberg  inequal¬ 
ities  of  Ref.  [11]  fails.  Hence,  teleportation  with 
\  <  F  <  |  is  possible  with  EPR  (mixed)  states 
which  do  not  admit  a  local  hidden  variables  descrip¬ 
tion.  In  contradistinction  to  the  claim  of  Grangier 
and  Grosshans,  F  >  |  does  not  provide  a  relevant 
criterion  for  delineating  the  quantum  and  classical 
domains  with  respect  to  violations  of  Bell’s  inequal¬ 
ities. 

4.  By  adopting  a  protocol  analogous  to  that  employed 
in  all  previous  experimental  demonstrations  of  vi¬ 
olations  of  Bell’s  inequalities  [21-23],  scaled  corre¬ 
lation  functions  can  be  introduced  for  continuous 
quantum  variables.  In  terms  of  these  scaled  corre¬ 
lations,  the  EPR  mixed  state  used  for  teleportation 
violates  a  generalized  version  of  the  CHSH  inequal¬ 
ity,  though  non-ideal  detector  efficiencies  require  a 
“fair  sampling”  assumption  for  this.  These  viola¬ 
tions  set  in  for  F  >  \  and  have  recently  been  ob¬ 
served  in  a  setting  of  low  detection  efficiency  [24] . 
This  experimental  verification  of  a  violation  of  a 
CHSH  inequality  (with  a  fair  sampling  assump¬ 
tion)  again  refutes  the  purported  significance  of 
the  threshold  F  =  |  promoted  by  Grangier  and 
Grosshans. 

Overall,  we  find  no  support  for  the  claims  of  Grangier 
and  Grosshans  giving  special  significance  to  the  thresh¬ 
old  fidelity  F  =  |  in  connection  to  issues  of  separability 
and  Bell-inequality  violations.  Instead,  as  we  will  show, 
it  is  actually  the  value  Fciassicai  =  \  that  heralds  en¬ 
trance  into  the  quantum  domain  with  respect  to  the  very 
same  issues.  Their  claims  based  upon  a  Heisenberg- type 
inequality  and  a  criterion  for  “information  exchange”  are 
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essentially  unrelated  to  the  issue  of  a  quantum-classical 
boundary. 

All  this  is  not  to  say  that  teleportation  of  coher¬ 
ent  states  with  increasing  degrees  of  fidelity  beyond 
-^classical  =  \  to  F  >  |  is  not  without  significance.  In 
fact,  as  tasks  of  ever  increasing  complexity  are  to  be  ac¬ 
complished,  there  will  be  corresponding  requirements  to 
improve  the  fidelity  of  teleportation  yet  further.  More¬ 
over,  there  are  clearly  diverse  quantum  states  other  than 
coherent  states  that  one  might  desire  to  teleport,  includ¬ 
ing  squeezed  states,  quantum  superpositions,  entangled 
states  [16],  and  so  on.  The  connection  between  the  “in¬ 
tricacy”  of  such  states  and  the  requisite  resources  for 
achieving  high  fidelity  teleportation  has  been  discussed 
in  Ref.  [13],  including  the  example  of  the  superposition 
of  two  coherent  states, 

\a)  +  \-  a)  ,  (2) 

which  for  |a|  1  requires  an  EPR  state  with  an  extreme 

degree  of  quantum  correlation. 

Similiarly,  Heisenberg-type  inequalities  are  in  fact 
quite  important  for  the  inference  of  the  properties  of 
a  system  given  the  outcomes  of  measurements  made 
on  a  meter  following  a  system-meter  interaction.  Such 
quantities  are  gainfully  employed  in  Quantum  Optics  in 
many  settings,  including  realizations  of  the  original  EPR 
gedanken  experiment  [25-27]  and  of  back-action  evad¬ 
ing  measurement  and  quantum  non-demolition  detection 
[28]. 

Our  only  point  is  that  the  claim  of  Grangier  and 
Grosshans  that  F  =  |  is  required  for  the  “successful 
quantum  teleportation  of  a  coherent  state”  [12]  is  incor¬ 
rect.  They  simply  offer  no  quantitative  analysis  directly 
relevant  to  either  entanglement  or  Bell-inequality  viola¬ 
tion  issues.  In  contrast,  the  prior  treatment  of  Ref.  [6] 
demonstrates  that  in  the  absence  of  shared  entanglement 
between  Alice  and  Bob,  there  is  an  upper  limit  for  the  fi¬ 
delity  for  the  teleportation  of  randomly  chosen  coherent 
states  given  by  Fciassicai  =  Nothing  in  the  work  of 
Grangier  and  Grosshans  calls  this  analysis  into  question. 

This,  however,  leads  to  something  we  would  like  to 
stress  apart  from  the  details  of  any  particular  teleporta¬ 
tion  criterion.  There  appears  to  be  a  growing  confusion  in 
the  community  that  equates  quantum  teleportation  ex¬ 
periments  with  fundamental  tests  of  quantum  mechanics. 
The  purpose  of  such  tests  is  generally  to  compare  quan¬ 
tum  mechanics  to  other  potential  theories,  such  as  lo¬ 
cally  realistic  hidden- variable  theories  [11,29,30].  In  our 
view,  experiments  in  teleportation  have  nothing  to  do 
with  this.  They  instead  represent  investigations  within 
quantum  mechanics,  demonstrating  only  that  a  particu¬ 
lar  task  can  be  accomplished  with  the  resource  of  quan¬ 
tum  entanglement  and  cannot  be  accomplished  without 
it.  This  means  that  violations  of  Bell’s  inequalities  are 
largely  irrelevant  as  far  as  the  original  proposal  of  Ben¬ 
nett  et  al.  [1]  is  concerned,  as  well  as  for  experimental 


implementations  of  that  protocol.  In  a  theory  which  al¬ 
lows  states  to  be  cloned,  there  would  be  no  need  to  dis¬ 
cuss  teleportation  at  all  -  unknown  states  could  be  cloned 
and  transmitted  with  fidelity  arbitrarily  close  to  one. 

These  comments  notwithstanding,  Grangier  and 
Grosshans  did  nevertheless  attempt  to  link  the  idea  of 
Bell-inequality  violations  with  the  fidelity  of  teleporta¬ 
tion.  It  is  to  the  details  of  that  linkage  that  we  now 
turn.  The  remainder  of  the  paper  is  organized  as  follows. 
In  Section  II,  we  extend  the  prior  work  of  Ref.  [6]  to  a 
direct  treatment  of  the  consequences  of  shared  entangle¬ 
ment  between  Alice  and  Bob,  beginning  with  an  explicit 
model  for  the  mixed  EPR  states  used  for  teleportation  of 
continuous  quantum  variables.  In  Section  III  we  review 
the  criteria  Grangier  and  Grosshans  introduced  in  prepa¬ 
ration  for  showing  their  inappropriateness  as  tools  for  the 
questions  at  hand.  In  Section  IV,  we  demonstrate  explic¬ 
itly  the  relationship  between  entanglement  and  fidelity, 
and  find  the  same  threshold  Fciassicai  =  ^  as  in  our  prior 
analysis  [6] .  The  value  F  =  |  is  shown  to  have  no  partic¬ 
ular  distinction  in  this  context.  In  Sections  V  and  VI,  we 
further  explore  the  role  of  entanglement  with  regard  to 
violations  of  a  CHSH  inequality  and  provide  a  quantita¬ 
tive  boundary  for  such  violations.  Again,  Fciassicai  =  \ 
appears  as  the  point  of  entry  into  the  quantum  domain, 
with  the  point  F  =  |  having  no  particular  distinction. 
Our  conclusions  are  collected  in  Section  VII.  Of  partic¬ 
ular  significance,  we  point  out  that  the  teleportation  ex¬ 
periment  of  Ref.  [4]  did  indeed  cross  from  the  classical  to 
the  quantum  domain,  just  as  advertised  previously. 

II.  THE  EPR  STATE 

The  teleportation  protocol  we  consider  is  that  of 
Braunstein  and  Kimble  [13],  for  which  the  relevant  en¬ 
tangled  state  is  the  so-called  two-mode  squeezed  state. 
This  state  is  given  explicitly  in  terms  of  a  Fock-state  ex¬ 
pansion  for  two-modes  (1,2)  by  [31,32] 

1  oo 

\ePR)i,2  =  ^(tanhr)”|n)1|n)2  ,  (3) 

where  r  measures  the  amount  of  squeezing  required  to 
produce  the  entangled  state.  Note  that  for  simplicity  we 
consider  the  case  of  two  single  modes  for  the  electromag¬ 
netic  field;  the  extension  to  the  multimode  case  for  fields 
of  finite  bandwidth  can  be  found  in  Ref.  [33]. 

The  pure  state  of  Eq.  (3)  can  be  equivalently  described 
by  the  corresponding  Wigner  distribution  ITepr  over  the 
two  modes  (1,2), 

Wepr(xi,pi;x2,P2)  (4) 

=  ~2^T^T  eXP  (  “  +  Xl)2  +  (Pi  “  P2)2]/o-+ 

~[{xi  -  x2f  +  (jpt  +  P2)2}/  <?-)  , 
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where  a±  are  expressed  in  terms  of  the  squeezing  param¬ 
eter  by 


=  e+2r\  (5) 

<x2  =  e-2r, 

with  a\a2_  =  1.  Here,  the  canonical  variables  ( Xj,pj ) 
are  related  to  the  complex  field  amplitude  aj  for  mode 
3  =  (l,2)  by 

aj=xj  +  ipj.  (6) 

In  the  limit  of  r  — >  oo,  Eq.  (4)  becomes 

CS(x1-x2)S(p1+p2)  (7) 

which  makes  a  connection  to  the  original  EPR  state  of 
Einstein,  Podolsky,  and  Rosen  [25]. 

Of  course,  Wepr  as  given  above  is  for  the  ideal,  lossless 
case.  Of  particular  interest  with  respect  to  experiments 
is  the  inclusion  of  losses,  as  arise  from,  for  example,  finite 
propagation  and  detection  efficiencies.  Rather  than  deal 
with  any  detailed  setup  (e.g.,  as  treated  in  explicit  detail 
in  Ref.  [26])  here  we  adopt  a  generic  model  of  the  fol¬ 
lowing  form.  Consider  two  identical  beam  splitters  each 
with  a  transmission  coefficient  77,  one  for  each  of  the  two 
EPR  modes.  We  take  0  <  77  <  1,  with  77  =  1  for  the 
ideal,  lossless  case.  The  input  modes  to  the  beam  split¬ 
ter  1  are  taken  to  be  (1',  a'),  while  for  beam  splitter  2,  the 
modes  are  labeled  by  (2',  b').  Here,  the  modes  (1',  2')  are 
assumed  to  be  in  the  state  specified  by  the  ideal  Wepr 
as  given  in  Eq.  (4)  above,  while  the  modes  (a',  b')  are 
taken  to  be  independent  thermal  (mixed)  states  each  with 
Wigner  distribution 

W(x,p)  =  1  1  -exp{-(x2 +p2)/(n+ 1/2)},  (8) 

7r  (n  +  ±) 

where  n  is  the  mean  thermal  photon  number  for  each  of 
the  modes  (a',  b'). 

The  overall  Wigner  distribution  for  the  initial  set  of 
input  modes  (!',  2'),  (a',  b ')  is  then  just  the  product 


WEpr(xv ,pi>;x2,,p2')W (xa>  ,pa>)W (xv , pb> )  .  (9) 

The  standard  beam-splitter  transformations  lead  in  a 
straightforward  fashion  to  the  Wigner  distribution  for  the 
output  set  of  modes  (1,  2),  (a,  6),  where,  for  example, 

X\  =  s/rjxy  -  a/1  -  T)Xa>,  (10) 

Xa  =  Vvxa'  +  V1  -  TfXy. 

We  require  p*R  for  the  (1,2)  modes  alone,  which  is 
obtained  by  integrating  over  the  (a,  b)  modes.  A  straight¬ 
forward  calculation  results  in  the  following  distribution 
for  the  mixed  output  state: 


^epr(TiWE  %2,P2)  (11) 

=  -4  -2-2  exP  (  -  [(^1  +x2)2  +  (Pi  ~P2)2]/o-2+ 

7i  \ 

-  [(*i  -  x2)2  +  (/q  +  p2)2}/  a2  ) , 

where  a±  are  given  by 

ct\  =  r]e+2r  +  (1  —  77)  (1  +  2  h),  (12) 

a2_  =  r]e~2r  +  (1  -  77) (1  +  2 ft). 

Note  that  p*R  as  above  follows  directly  from  Wepr  in 
Eq.  (4)  via  the  simple  replacements  cr±  — >  a±.  Relevant 
to  the  discussion  of  Bell  inequalities  in  Sections  V  and  VI 
is  the  fact  that  o\o2_  >  1  for  any  r  >  0  and  77  <  1. 


III.  THE  CRITERIA  OF  G RANGIER  AND 
GROSSHANS 


The  two  recent  papers  of  Grangier  and  Grosshans  ar¬ 
gue  that  “fidelity  value  larger  than  |  is  actually  required 
for  successful  teleportation”  [11,12].  In  this  section,  we 
recapitulate  the  critical  elements  of  their  analysis  and 
state  their  criteria  in  the  present  notation.  In  subsequent 
sections  we  proceed  further  with  our  own  analysis  of  en¬ 
tanglement  and  possible  violations  of  Bell’s  inequalities 
for  the  EPR  state  of  Eq.  (11). 

Beginning  with  Ref.  [11],  Eq.  (21),  Grangier  and 
Grosshans  state  the  following: 

“As  a  criteria  for  non-separability  [by  which 
they  mean  violations  of  Bell’s  inequalities], 
we  will  use  the  EPR  argument:  two  different 
measurements  prepare  two  different  states,  in 
such  a  way  that  the  product  of  conditional 
variances  (with  different  conditions)  violates 
the  Heisenberg  principle.” 


This  statement  takes  a  quantitative  form  in  terms  of 
the  following  conditional  variances  expressed  in  the  no¬ 
tation  of  the  preceding  section  for  EPR  beams  (1,2): 


VXi \XJ  =  (Ax2) 
=  <Ap?) 


(Ax2)  ’ 
(A P2)  ■ 


(13) 


with  (7,7)  =  (1,2)  and  i  7^  j.  Note  that,  for  example, 
VX2\Xi  gives  the  error  in  the  knowledge  of  the  canonical 
variable  X2  based  upon  an  estimate  of  X2  from  a  measure¬ 
ment  of  xi,  and  likewise  for  the  other  conditional  vari¬ 
ances.  These  variances  were  introduced  in  Refs.  [26,27] 
in  connection  with  an  optical  realization  of  the  original 
gedanken  experiment  of  Einstein,  Podolsky,  and  Rosen 
[25].  An  apparent  violation  of  the  uncertainty  princi¬ 
ple  arises  if  the  product  of  inference  errors  is  below  the 
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uncertainty  product  for  one  beam  alone.  For  example, 
Vx2\x1Vp2\pi  <  T6  rePresenfs  such  an  apparent  viola¬ 
tion  since  Ax2  A p2  >  A  is  demanded  by  the  canoni¬ 
cal  commutation  relation  between  x 2  and  P2,  with  here 
Ax2?2  =  \  =  Apf  2  f°r  Ae  vacuum  state  [26,27]. 

Grangier  and  Grosshans  elevate  this  concept  of  infer¬ 
ence  at  a  distance  from  the  EPR  analysis  to  “a  criteria 
for  non-separability  [i.e.,  violation  of  Bell’s  inequalities].” 
Specifically,  they  state  that  “the  classical  limit  of  no  ap¬ 
parent  violation  of  HI”  [and  hence  the  domain  of  local 
realism]  is  determined  by  the  conditions 

1  1 

Vx2\x1Vp2\pi  —  Yg’  an<^  ^X1\x2^Pi\P2  —  (14) 

As  shown  in  Refs.  [26,27]  for  the  states  under  consider¬ 
ation,  the  conditional  variances  of  Eq.  (13)  are  simply 
related  to  the  following  (unconditional)  variances 

Ad«  =  (Oi  -VijXj)2),  (15) 

^P%  =  (G Pi  -  VijPj)2)- 

If  we  use  a  measurement  of  Xj  to  estimate  x*,  then  Ax2 
is  the  variance  of  the  error  when  the  estimator  is  cho¬ 
sen  to  be  /JbijXj ,  and  likewise  for  A p2...  For  an  optimal 
estimate,  the  parameters  are  given  by  [26,27] 


opt  (x^Xj) 

M ij  ~  {Ax2) 

opt  ( PiPj ) 

’  0  "  <AP2)’ 

(16) 

and  in  this  case, 

VjCi\Xj  =  Ax  opt , 

J  Uij 

and  VPi\PJ  =  AP2°Pt. 

(17) 

The  “non-separability”  condition  of  Grangier 
Grosshans  in  Eq.  (14  )  can  then  be  re-expressed  as 

and 

and  Axl12A pli2  >  T 

(18) 

where  we  assume  the  optimized  choice  and  drop  the  su¬ 
perscript  ‘opt’.  Again,  Grangier  and  Grosshans  take  this 
condition  of  “no  apparent  violation  of  HI”  as  the  oper¬ 
ational  signature  of  “nonseparability  criteria”  [violations 
of  Bell  inequalities],  and  hence,  by  their  logic,  to  delin¬ 
eate  the  classical  boundary  for  teleportation  [11]. 

To  make  apparent  the  critical  elements  of  the  discus¬ 
sion,  we  next  assume  symmetric  fluctuations  as  appro¬ 
priate  to  the  EPR  state  of  Eq.  (11),  pij  =  fip  =  fi  and 
Vij  =  Vji  =  z/,  with  (i  =  —  v.  Note  that  within  the  con¬ 
text  of  our  simple  model  of  the  losses,  the  optimal  value 
of  p  is  given  by 

?7sinh2r  (  . 

^  (1  —  77)  +  77  cosh  2r  ’ 

where  in  the  limit  r  >>  1,  p  — >  1.  For  this  case  of  sym¬ 
metric  fluctuations,  the  HI  of  Eq.  (18)  becomes 


AxlAPl  >  (20) 

where 

Axl  =  {{x%  -  nx 2)2)  =  ((x2  -  fix  1)2),  (21) 

AP2  =  ((Pi  +  PP2)2)  =  ((P2  +  PPif). 

Note  that  in  general  the  inequality 

a 2 

H1E2  >  (22) 

implies  that 

V1  +  V2>V1  +  ^->a,  (23) 

so  that  the  criterion  of  Eq.  (20)  for  classical  teleportation 
leads  to 

Aa£  +  Ap2>l.  (24) 

Hence,  the  requirement  of  Grangier  and  Grosshans  for 

quantum  teleportation  is  that 

Axl  +  Apl<^,  (25) 

which  for  r  1  becomes 

Ax2  +  A p2  <  i.  (26) 

Here,  (Ax2,  A p2)  are  as  defined  in  Eq.  (21),  now  with 

»  =  i; 

Ax2  =  —  X2)2),  (27) 

AP2  =  <(Pi  +P2)2), 

where  from  Eq.  (11),  we  have  that  Ax2  +  Ap2  =  d2 
for  the  EPR  beams  (1,2).  The  claim  of  Grangier  and 
Grosshans  [11]  is  that  the  inequality  of  Eq.  (20)  serves 
as  “the  condition  for  no  useful  entanglement  between  the 
two  beams,”  where  by  ‘useful’  they  refer  explicitly  to 
“the  existence  of  quantum  non-separability  [violation  of 
Bell’s  inequalities].”  The  inequalities  of  Eqs.  (18)  and 
(20)  are  also  related  to  criteria  developed  within  the  set¬ 
ting  of  quantum  nondemolition  detection  (QND)  [28],  as 
discussed  in  the  next  section. 

In  a  second  paper  [12],  Grangier  and  Grosshans  intro¬ 
duce  an  alternative  criteria  for  the  successful  teleporta¬ 
tion  of  coherent  states,  namely  that 

“the  information  content  of  the  teleported 
quantum  state  is  higher  than  the  information 
content  of  any  (classical  or  quantum)  copy 
of  the  input  state,  that  may  be  broadcasted 
classically.” 
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To  quantify  the  concept  of  “information  content”  they 
introduce  a  “generalized  fidelity”  describing  not  the  over¬ 
lap  of  quantum  states  as  is  standard  in  the  quantum  in¬ 
formation  community,  but  rather  the  conditional  proba¬ 
bility  P(a\I)  that  a  particular  coherent  state  |a)  was  ac¬ 
tually  sent  given  “the  available  information  In  effect, 
Grangier  and  Grosshans  consider  the  following  protocol. 
Victor  sends  to  Alice  some  unknown  coherent  state  |ao), 
with  Alice  making  her  best  attempt  to  determine  this 
state  [34],  and  sending  the  resulting  measurement  out¬ 
come  to  Bob  as  in  the  standard  protocol.  Bob  then  does 
one  of  two  things.  In  the  first  instance,  he  forwards  only 
this  classical  message  with  Alice’s  measurement  outcome 
to  Victor  without  reconstructing  a  quantum  state.  In  the 
second  case,  he  actually  generates  a  quantum  state  condi¬ 
tioned  upon  Alice’s  message  and  sends  this  state  to  Vic¬ 
tor,  who  must  then  make  his  own  measurement  to  deduce 
whether  the  teleported  state  corresponds  to  the  one  that 
he  initially  sent.  For  successful  teleportation,  Grangier 
and  Grosshans  demand  that  the  information  gained  by 
Victor  should  be  greater  in  the  latter  case  where  quantum 
states  are  actually  generated  by  Bob  than  in  the  former 
case  where  only  Alice’s  classical  measurement  outcome  is 
distributed.  It  is  straightforward  to  show  that  Eq.  (26) 
given  above  is  sufficient  to  ensure  that  this  second  crite¬ 
ria  is  likewise  satisfied  for  the  teleportation  of  a  coherent 
state  | a),  albeit  with  the  same  caveat  expressed  in  [10], 
namely  that  neither  the  set  S  of  initial  states  {|  V’m)}  nor 
the  distribution  P(|^n))  over  these  states  is  specified. 

We  now  turn  to  an  evaluation  of  these  criteria  of 
Grangier  and  Grosshans  placing  special  emphasis  on  the 
issues  of  entanglement  and  violations  of  Bell’s  inequali¬ 
ties,  specifically  because  these  are  the  concepts  Grangier 
and  Grosshans  emphasize  in  their  work  [11,12]. 


IV.  ENTANGLEMENT  AND  FIDELITY 
A.  Nonseparability  of  the  EPR  beams 

To  address  the  question  of  the  nonseparability  of  the 
EPR  beams,  we  refer  to  the  papers  of  Duan  et  al.  and 
of  Simon  [14,15],  as  well  as  related  work  by  Tan  [16]. 
For  the  definitions  of  that  we  have  chosen  for  the 

EPR  beams  (1,2),  a  sufficient  condition  for  nonsepara- 
bility  (without  an  assumption  of  Gaussian  statistics)  is 
that 


Ax2  +  A p2  <  1,  (28) 

where  Ax2  and  A p2  are  defined  in  Eq.  (27).  This  result 
follows  from  Eq.  (3)  of  Duan  et  al.  with  a  =  1  (and  from 
a  similar  more  general  equation  in  Simon)  [35].  Note  that 
Duan  et  al.  have  Ax?  =  \  =  A p?  for  the  vacuum  state, 
while  our  definitions  lead  to  Ax?  =  \  =  A p?  for  the 


vacuum  state,  where  for  example,  Ax2  =  (xi2),  and  that 
all  fields  considered  have  zero  mean. 

Given  the  Wigner  distribution  W^pR  as  in  Eq.  (11), 
we  find  immediately  that 

-2 

Ax2  +  Ap2  =  (29) 

=  r]e~2r  +  (1  -  77) (1  +  2 h). 


For  the  case  n  =  0,  the  resulting  state  is  always  entangled 
for  any  r  >  0  even  for  ^  <  1,  in  agreement  with  the 
discussion  in  Duan  et  al.  [14].  For  nonzero  n,  the  state  is 
entangled  so  long  as 


n  < 


rj[  1  —  exp(— 2r)] 
2(1-7?) 


(30) 


We  emphasize  that  in  the  experiment  of  Furusawa  et 
al.  [4]  for  which  n  —  0  is  the  relevant  case,  the  above 
inequality  guarantees  that  teleportation  was  carried  out 
with  entangled  (i.e.,  nonseparable)  states  for  the  EPR 
beams,  independent  of  any  assumption  about  whether 
these  beams  were  Gaussian  or  pure  states  [36] . 

By  contrast  to  the  condition  for  entanglement  given 
in  Eq.  (28),  Grangier  and  Grosshans  require  instead  the 
more  stringent  condition  of  Eq.  (25)  for  successful  tele¬ 
portation.  Although  they  would  admit  that  the  EPR 
beams  are  indeed  entangled  whenever  Eq.  (28)  is  satis¬ 
fied,’*’  they  would  term  entanglement  in  the  domain 

i  <  Ax2  +  A p2  <  1 
as  not  “useful”  [11]. 

With  regard  to  the  QND-like  conditions  introduced  by 
Grangier  and  Grosshans  [11],  we  note  that  more  general 
forms  for  the  nonseparability  condition  of  Eq.  (28)  are 
given  in  Refs.  [14,15].  Of  particular  relevance  is  a  condi¬ 
tion  for  the  variances  of  Eq.  (15)  for  the  case  of  symmetric 
fluctuations  as  for  EPR  state  in  Eq.  (11),  Pij  =  fiji  =  /a 
and  Uij  =  Vji  =  z/,  with  p  =  —v.  Consider  for  example 
the  first  set  of  variances  in  Eq.  (21),  namely 


Axl  =  ((x2  -  M2fi)2)  and  A p2^  =  {{p2  +  ppi))2, 

(31) 


as  would  be  appropriate  for  an  inference  of  (x2,P2)  from 
a  measurement  (at  a  distance)  of  (xi,pi).  In  this  case,  a 
sufficient  condition  for  entanglement  of  the  EPR  beams 
(1,  2)  may  be  obtained  using  Eq.  (11)  of  Ref.  [15]  yielding 

+  A pi  <  (1  ) ,  (32) 


^Grangier  was  in  fact  unaware  of  Refs.  [14,15]  when  Ref.  [11] 
was  originally  posted,  having  had  this  work  pointed  out  by 
us. 
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which  reproduces  Eq.  (28)  for  fi  =  1.  This  equation  for 
nonseparability  implies  that 

AxIapI  <  h  (33) 

which  is  in  the  form  of  a  Heisenberg-type  inequality. 
Note  that  this  inequality  is  satisfied  for  any  r  >  0  and 
0  <  r]  <  1  for  n  =  0.  As  discussed  in  Refs.  [26,27],  fi 
must  be  chosen  in  correspondence  to  the  degree  of  cor¬ 
relation  between  the  EPR  beams,  with  0  <  /i  <  1.  An 
explicit  expression  for  our  current  model  given  in  Eq. 
(19).  By  constrast,  in  applying  their  QND-like  condi¬ 
tions,  Grangier  and  Grosshans  demand  to  the  contrary 
the  Heisenberg-type  inequality 

AxlAPl  <  (34) 

Within  the  setting  our  current  model,  this  condition  can 
only  be  satisfied  for  efficiency  rj  >  ^  [37].  Although  this 
criterion  has  been  found  to  be  useful  in  the  analysis  of 
back-action  evading  measurement  for  quantum  nondemo¬ 
lition  detection,  it  apparently  has  no  direct  relevance  to 
the  question  of  entanglement,  for  /j,  =  1  or  otherwise. 

Certainly,  fi  =  1  is  the  case  relevant  to  the  actual  tele¬ 
portation  protocol  of  Ref.  [13].  However,  Alice  and  Bob 
are  surely  free  to  explore  the  degree  of  correlation  be¬ 
tween  their  EPR  beams  and  to  test  for  entanglement  by 
any  means  at  their  disposal,  including  simple  measure¬ 
ments  with  /a  7^  1. 

Although  the  boundary  expressed  by  the  nonsepara- 
bility  conditions  of  Eqs.  (28)  and  (32)  are  perhaps  not  so 
familiar  in  Quantum  Optics,  we  stress  that  these  criteria 
are  associated  quite  directly  with  the  standard  condi¬ 
tion  for  nonclassical  behavior  adopted  by  this  commu¬ 
nity.  Whenever  Eqs.  (28)  and  (32)  are  satisfied,  the 
Glauber- Sudarshan  phase-space  function  becomes  non¬ 
positive  [17],  which  for  almost  forty  years  has  heralded 
entrance  into  a  manifestly  quantum  or  nonclassical  do¬ 
main.  It  is  difficult  to  understand  how  Grangier  and 
Grosshans  propose  to  move  from  Ax2  +  A p2  =  1  to 
Ax 2  +  A p2  =  ^  without  employing  quantum  resources 
in  the  teleportation  protocol  (as  is  required  when  the 
Glauber-Sudarshan  P-function  is  not  positive  definite). 
Their  own  work  offers  no  suggestion  of  how  this  is  to  be 
accomplished. 


where  this  result  applies  to  teleportation  of  coherent 
states  [38,39].  When  combined  with  Eq.  (29),  we  find 
that 


F  = 


1 

1  +  ( Ax 2  +  A p2)  ’ 


(36) 


The  criterion  of  Eq.  (28)  for  nonseparability  then  guar¬ 
antees  that  nonseparable  EPR  states  as  in  Eqs.  (4,11)  (be 
they  mixed  or  pure)  are  sufficient  to  achieve 

F  >  F classical  =  ^  ’  (37) 


whereas  separable  states  must  have  F  <  Fciassicai  = 
although  we  emphasize  that  this  bound  applies  for  the 
average  fidelity  for  coherent  states  distributed  over  the 
entire  complex  plane  [6,39]. 

We  thereby  demonstrate  that  the  condition  F  > 
Fciassicai  =  \  for  quantum  teleportation  as  established 
in  Ref.  [6]  coincides  with  that  for  nonseparability  (i.e., 
entanglement)  of  Refs.  [14,15]  for  the  EPR  state  of 
Eq.  (11).  Note  that  for  n  =  0,  we  have 


F  = 


1 

2  —  rj(l  —  e~2r )  ’ 


(38) 


so  that  the  entangled  EPR  beams  considered  here  (as 
well  as  in  Refs.  [11,12])  provide  a  sufficient  resource  for 
beating  the  limit  set  by  a  classical  channel  alone  for  any 
r  >  0,  so  long  as  rj  >  0.  In  fact,  the  quantities  (Ax2,  Ap2) 
are  readily  measured  experimentally,  so  that  the  entan¬ 
glement  of  the  EPR  beams  can  be  operationally  verified, 
as  was  first  accomplished  in  Ref.  [26],  and  subsequently 
in  Ref.  [4].  We  stress  that  independently  of  any  further 
assumption,  the  condition  of  Eq.  (28)  is  sufficient  to  en¬ 
sure  entanglement  for  pure  or  mixed  states  [40,41]. 

The  dependence  of  fidelity  F  on  the  degree  of  squeezing 
r  and  efficiency  p  as  expressed  in  Eq.  (38)  is  illustrated  in 
Figure  1.  Here,  in  correspondence  to  an  experiment  with 
fixed  overall  losses  and  variable  parametric  gain  in  the 
generation  of  the  EPR  entangled  state,  we  show  a  family 
of  curves  in  Figure  1  each  of  which  is  drawn  for  constant  77 
as  a  function  of  r.  Clearly,  F  >  Fciassicai  =  ^  and  hence 
nonseparability  results  in  each  case.  The  only  apparent 
significance  of  F  =  |  as  championed  by  Grangier  and 
Grosshans  (and  which  results  for  Ax2  +  A p2  =  |)  is  to 
bound  F  for  77  =  0.5. 


B.  Fidelity 


Turning  next  to  the  question  of  the  relationship  of  en¬ 
tanglement  of  the  EPR  beams  [as  quantified  in  Eq.  (28)] 
to  the  fidelity  attainable  for  teleportation  with  these 
beams ,  we  recall  from  Eq.  (2)  of  Ref.  [4]  that 


F  = 


1 

lTdP 


(35) 
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of  squeezing  r  for  fixed  efficiency  77.  From  top  to  bottom,  the 
curves  are  drawn  with  77  =  {0.99,  0.90,  0.70,  0.50}  in  corre¬ 
spondence  to  increasing  loss  (1  —  77).  Note  that  FciaSsicai  —  \ 
provides  a  demarcation  between  separable  and  nonseparable 
states  (mixed  or  otherwise),  while  F  —  |  is  apparently  of 
no  particular  significance,  the  contrary  claims  of  Ref.  [11,12] 
notwithstanding.  Note  that  for  77  =  1,  r  =  ^  =  0.3466  gives 
F  —  |,  corresponding  to  — 3dB  of  squeezing.  In  all  cases, 
n  —  0. 

As  for  the  criterion  of  “information  content”  intro¬ 
duced  by  Grangier  and  Grosshans  [12],  we  note  it  can 
be  easily  understood  from  the  current  analysis  and  the 
original  discussion  in  Ref.  [13].  Each  of  the  interven¬ 
tions  by  Alice  and  Bob  represent  one  unit  of  added  vac¬ 
uum  noise  that  will  be  convolved  with  the  initial  input 
state  in  the  teleportation  protocol  (the  so-called  qudu- 
ties ) .  Grangier  and  Grosshans  compare  the  following  two 
situations:  (i)  Bob  passes  directly  the  classical  informa¬ 
tion  that  he  receives  to  Victor  and  (ii)  Bob  generates  a 
quantum  state  in  the  usual  fashion  that  is  then  passed 
to  Victor.  Grangier  and  Grosshans  would  demand  that 
Victor  should  receive  the  same  information  in  these  two 
cases,  which  requires  that  a2_  =  Ax2  +  A p2  <  and 
hence  F  >  |.  That  is,  as  the  degree  of  correlation  be¬ 
tween  the  EPR  beams  is  increased,  there  comes  a  point 
for  which  Ax2  +  Ap2  =  and  for  which  each  of  Alice  and 
Bob’s  excess  noise  has  been  reduced  from  1  quduty  each 
to  |  quduty  each.  At  this  point,  Grangier  and  Grosshans 
would  (arbitrarily)  assign  the  entire  resulting  noise  of 
|  +  |  =  1  quduties  to  Alice,  with  then  the  perspective 
that  Bob’s  state  recreation  adds  no  noise.  Of  course  one 
could  equally  well  make  the  complementary  assignment, 
namely  1  quduty  to  Bob  and  none  to  Alice  (again  in  the 
case  with  a2_  =  |).  The  point  that  seems  to  be  missed 
by  Grangier  and  Grosshans  is  that  key  to  quantum  tele¬ 
portation  is  the  transport  of  quantum  states.  Although 
they  correctly  state  that  “there  is  no  extra  noise  asso¬ 
ciated  to  the  reconstruction:  given  a  measured  /?,  one 
can  exactly  reconstruct  the  coherent  state  |  /?),  by  using 


a  deterministic  translation  of  the  vacuum.”  Bob  can  cer¬ 
tainly  make  such  a  state  deterministically,  but  it  is  an 
altogether  different  matter  for  Victor  to  receive  a  classi¬ 
cal  number  from  Bob  in  case  (i)  as  opposed  to  the  actual 
quantum  state  in  (ii).  In  this  latter  case  apart  from  hav¬ 
ing  a  physical  state  instead  of  a  number,  Victor  must 
actually  make  his  own  measurement  with  the  attendant 
uncertainties  inherent  in  \/3)  then  entering.  Analogously, 
transferring  measurement  results  about  a  qubit,  without 
recreating  a  state  at  the  output  (i.e.,  without  sending  an 
actual  quantum  state  to  Victor),  is  not  what  is  normally 
considered  to  constitute  quantum  teleportation  relative 
to  the  original  protocol  of  Bennett  et  al  [1]. 

Turning  next  to  the  actual  experiment  of  Ref.  [4] ,  we 
note  that  a  somewhat  subtle  issue  is  that  the  detection 
efficiency  for  Alice  of  the  unknown  state  was  not  100%, 
but  rather  was  r]\  =  0.97.  Because  of  this,  the  fidelity 
for  classical  teleportation  (i.e.,  with  vacuum  states  in 
place  of  the  EPR  beams)  did  not  actually  reach  but 
was  instead  Fo  =  0.48.  This  should  not  be  a  surprise, 
since  there  is  nothing  to  ensure  that  a  given  classical 
scheme  will  be  optimal  and  actually  reach  the  bound 
^classical  =  Hence,  the  starting  point  in  the  exper¬ 
iment  with  r  —  0  had  Fo  <  Fciassicai ;  the  EPR  beams 
with  r  >  0  (which  were  in  any  event  entangled  by  the 
above  inequality)  then  led  to  increases  in  fidelity  from  Fo 
upward,  exceeding  the  classical  bound  FciaSsicai  =  |  for 
a  small  (but  not  infinitesimal)  degree  of  squeezing.  Note 
that  the  whole  effect  of  the  offset  F0  =  0.48  <  \  can  be 
attributed  to  the  lack  of  perfect  (homodyne)  efficiency 
at  Alice’s  detector  for  the  unknown  state.  In  the  current 
discussion  for  determining  the  classical  bound  in  the  op¬ 
timal  case,  we  set  Alice’s  detection  efficiency  instead  to 
r]\  =  1,  then  as  shown  above,  classical  teleportation  will 
achieve  F  = 

Independent  of  such  considerations,  we  reiterate  that 
the  nonseparability  condition  of  Refs.  [14,15]  applied  to 
the  EPR  state  of  Eqs.  (4)  and  (11)  leads  to  the  same  re¬ 
sult  Fciassicai  =  \  [Eqs.  (36)  and  (37)]  as  did  our  previous 
analysis  based  upon  teleportation  with  only  a  classical 
communication  channel  linking  Alice  and  Bob  [6].  This 
convergence  further  supports  Fciassicai  =  \  as  the  appro¬ 
priate  quantum-classical  boundary  for  the  teleportation 
of  coherent  states,  the  claims  of  Grangier  and  Grosshans 
notwithstanding.  Relative  to  the  original  work  of  Ben¬ 
nett  et  al  [1],  exceeding  the  bound  Fciassicai  =  \  for 
the  teleportation  of  coherent  can  be  accomplished  with  a 
classical  channel  and  entangled  (i.e.,  nonseparable)  EPR 
states,  be  they  mixed  or  pure,  as  is  made  clear  by  the 
above  analysis  and  as  has  been  operationally  confirmed 

[4]- 

We  should  however  emphasize  that  the  above  con¬ 
clusions  concerning  nonseparability  and  teleportation  fi¬ 
delity  apply  to  the  specific  case  of  the  EPR  state  as  in 
Eq.  (11),  for  which  inequality  Eq.  (28)  represents  both  a 
necessary  and  sufficient  criterion  for  nonseparability  ac- 


cording  to  Refs.  [14,15].  More  generally,  for  arbitrary  en¬ 
tangled  states,  nonseparability  does  not  necessarily  lead 
to  F  >  \  in  coherent- state  teleportation  [40,41]. 


V.  BELL’S  INEQUALITIES 

The  papers  by  Banaszek  and  Wodkiewicz  [18,19]  pro¬ 
vide  our  point  of  reference  for  a  discussion  of  Bell’s  in¬ 
equalities.  In  these  papers,  the  authors  introduce  an  ap¬ 
propriate  set  of  measurements  that  lead  to  a  Bell  inequal¬ 
ity  of  the  CHSH  type.  More  explicitly,  Eq.(4)  of  Ref. 
[18]  gives  the  operator  Il(<a;  /3)  whose  expectation  values 
are  to  be  measured.  Banaszek  and  Wodkiewicz  point  out 
that  the  expectation  value  of  ft(cq  /3)  is  closely  related  the 
Wigner  function  of  the  field  being  investigated,  namely 

W{cr,0)  =  4n(a;/3),  (39) 
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where  n(cq  /?)  =  (ft (a;  /?)). 

For  the  entangled  state  shared  by  Alice  and  Bob  in  the 
teleportation  protocol,  we  identify  W$pR  as  the  relevant 
Wigner  distribution  for  the  modes  (1,2)  of  interest,  so 
that 


Repr(xi,Pi',x2,P2)  (40) 

=  -2d2  exp{  — [(xi  +x2)2  +  (pi  -P2)2]/ct+ 

-  l(xi  -X2)2  +  (pi  +P2)2}/  a-2-}  ■ 

Banaszek  and  Wodkiewicz  show  that  ngpR(aq,pi;  £2^2) 
gives  directly  the  correlation  function  that  would  oth¬ 
erwise  be  obtained  from  a  particular  set  of  observa¬ 
tions  over  an  ensemble  representing  the  field  with  den¬ 
sity  operator  p,  where  the  actual  measurements  to  be 
made  are  as  described  in  Refs.  [18,19].  In  simple  terms, 
ftgpR(0,  0;  0,  0)  is  the  parity  operator  for  separate  mea¬ 
surements  of  photon  number  on  modes  (1,2),  with  then 
nonzero  (xi,pi)  corresponding  to  a  “rotation”  on  the  in¬ 
dividual  mode  i  that  precedes  its  parity  measurement. 

FIG.  2.  The  function  B(J)  from  Eq.  (41)  as  a  function  of  J 
for  various  values  of  (r,  77) .  Recall  that  B  >  2  heralds  a  direct 
violation  of  the  CHSH  inequality,  with  the  dashed  line  B  —  2 
shown.  In  each  of  the  plots  (a)-(d)  a  family  of  curves  is  drawn 
for  fixed  efficiency  77  and  four  values  of  r  —  {0.1,  1.0,  2.0}. 

(a)  77  =  0.99,  (b)  77  =  0.90,  (c)  77  =  0.70,  (d)  77  =  0.50;  in  all 
cases,  n  =  0. 
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The  function  constructed  by  Banaszek  and  Wod- 
kiewicz  to  test  for  local  hidden  variable  theories  is  de¬ 
noted  by  B  and  is  defined  by 

B(J)  (41) 

=  nggR(0, 0;  0, 0)  +  n$R(VJ,  0;  0, 0)  __ 

+ng|*R(o,  0;  -VJ,  0)  -  nil o;  -VJ,  o) , 

where  J  is  a  positive  (real)  constant.  As  shown  in  Ref. 
[18,19],  any  local  theory  must  satisfy 

-  2  <  B  <  2  .  (42) 

As  emphasized  by  Banaszek  and  Wodkiewicz  for  the  loss¬ 
less  case,  ngpR(0, 0;  0,  0)  =  1  “describes  perfect  correla¬ 
tions  ...  as  a  manifestation  of  ...  photons  always  gener¬ 
ated  in  pairs.” 

There  are  several  important  points  to  be  made  about 
this  result.  In  the  first  place,  in  the  ideal  case  with  no 
loss  (77  =  1),  there  is  a  violation  of  the  Bell  inequality 
of  Eq.  (42)  for  any  r  >  0.  Further,  this  threshold  for 
the  onset  of  violations  of  the  CHSH  inequality  coincides 
with  the  threshold  for  entanglement  as  given  in  Eq.  (28), 
which  likewise  is  the  point  for  surpassing  Fciassicai  =  \ 
as  in  Eqs.  (36,37)  and  as  shown  in  our  prior  analysis 
of  Ref.  [6]  which  is  notably  based  upon  a  quite  different 
approach. 

Significantly,  there  is  absolutely  nothing  special  about 
the  point  r  =  ^  ~  0.3466  (i.e.,  the  point  for  which 
exp[— 2r]  =0.5  and  for  which  F  =  |  for  the  teleportation 
of  coherent  states),  in  contradistinction  to  the  claims  of 
Grangier  and  Grosshans  to  the  contrary  [11,12].  Instead, 
any  r  >  0  leads  to  a  nonseparable  EPR  state,  to  a  vio¬ 
lation  of  a  Bell  inequality,  and  to  F  >  Fciassicai  =  |  for 
the  teleportation  of  coherent  states.  There  is  certainly 
no  surprise  here  since  we  are  dealing  with  pure  states  for 
Tj  =  1  [42]. 

We  next  examine  the  case  with  77  <  1,  which  is  clearly 
of  interest  for  any  experiment.  Figure  2  illustrates  the 
behavior  of  B  as  a  function  of  J  for  various  values  of 
the  squeezing  parameter  r  and  of  the  efficiency  77.  Note 
that  throughout  our  analysis  in  this  section,  we  make 
no  attempt  to  search  for  optimal  violations,  but  instead 
follow  dutifully  the  protocol  of  Banaszek  and  Wodkiewicz 
as  expressed  in  Eq.  (41)  for  the  case  with  losses  as  well. 

From  Figure  2  we  see  that  for  any  particular  set  of 
parameters  (r,  77) ,  there  is  an  optimum  value  ^7m ax  that 
leads  to  a  maximum  value  for  S(j7max)5  which  is  a  sit¬ 
uation  analogous  to  that  found  in  the  discrete  variable 
case.  By  determining  the  corresponding  value  ,7max  at 
each  (r,  77),  in  Figure  3  we  construct  a  plot  that  displays 
the  dependence  of  B  on  the  squeezing  parameter  r  for 
various  values  of  efficiency  77.  Note  that  all  cases  shown 
in  the  figure  lead  to  fidelity  F  >  Fciassicai. 


FIG.  3.  (a)  The  quantity  B  from  Eq.  (41)  as  a  function  of 
r  for  various  values  of  efficiency  77  =  {0.99,  0.90,  0.70,  0.50} 
as  indicated.  At  each  point  in  (r,  77),  the  value  of  J  that 
maximizes  B  has  been  chosen.  Recall  that  B  >  2  heralds  a 
direct  violation  of  the  CHSH  inequality,  with  the  dashed  line 
B  —  2  shown.  Also  note  that  F  >  \  for  all  r  >  0.  (b)  An 
expanded  view  of  B  in  the  small  r  region  r  <  0.1.  Note  that 
in  the  case  77  =  0.70,  B  >  2  for  small  r.  In  all  cases,  n  =  0. 

For  |  <  77  <  1  there  are  regions  in  r  that  produce 
direct  violations  of  the  Bell  inequality  considered  here, 
namely  B  >  2  [43].  In  general,  these  domains  with  B  >  2 
contract  toward  smaller  r  with  increasing  loss  (1  —  77).  In 
fact  as  r  increases,  77  must  become  very  close  to  unity  in 
order  to  preserve  the  condition  B  >  2,  where  for  r  1, 

2(1  —  77)  cosh(2r)  <C  1.  (43) 

This  requirement  is  presumably  associated  with  the  EPR 
state  becoming  more  “nonclassical”  with  increasing  r  and 
hence  more  sensitive  to  dissipation  [44].  Stated  some¬ 
what  more  quantitatively,  recall  that  the  original  state 
| EPR)  of  Eq.  (3)  is  expressed  as  a  sum  over  correlated 
photon  numbers  for  each  of  the  two  EPR  beams  (1,2). 
The  determination  of  B  derives  from  (displaced)  parity 
measurements  on  the  beams  (1,2)  (i.e.,  projections  onto 
odd  and  even  photon  number),  so  that  B  should  be  sen- 
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sitive  to  the  loss  of  a  single  photon.  The  mean  photon 
number  fii  for  either  EPR  beam  goes  as  sinh2  r,  with 
then  the  probability  of  losing  no  photons  after  encoun¬ 
tering  the  beam-splitter  with  transmission  77  scaling  as 
roughly  po  ~  [77] Ui .  We  require  that  the  total  probability 
for  the  loss  of  one  or  more  photons  to  be  small,  so  that 

(1  -  Po)  «  1,  (44) 

and  hence  for  (1  —  77)  <C  1  and  r  1  that 

(1  -  rj)fii  ~  (1  —  77)  exp(2r)  <C  1,  (45) 

in  correspondence  to  Eq.  (43)  [45]. 

On  the  other  hand,  note  that  small  values  of  r  in  Figure 
3  lead  to  direct  violations  of  the  CHSH  inequality  B  >  2 
with  much  more  modest  efficiencies  [44].  In  particular, 
note  that  for  r  =  ^2  «  0.3466  and  77  =  0.90,  F  <  | 
[from  Eq.  (38)].  This  case  and  others  like  it  provide  ex¬ 
amples  for  which  mixed  states  are  nonseparable  and  yet 
directly  violate  a  Bell  inequality,  but  for  which  F  <  |. 
Such  mixed  states  do  not  satisfy  the  criteria  of  Grangier 
and  Grosshans  (neither  with  respect  to  their  Heisenberg- 
type  inequality  nor  with  respect  to  their  information  ex¬ 
change),  yet  they  are  states  for  which  \  <  F  <  §  and 
B  >  2,  which  in  and  of  itself  calls  the  claims  of  Grangier 
and  Grosshans  into  question.  There  remains  the  possi¬ 
bility  that  F  >  |  might  be  sufficient  to  warranty  that 
mixed  states  in  this  domain  would  satisfy  that  B  >  2, 
and  hence  to  exclude  a  description  of  the  EPR  state  in 
terms  of  a  local  hidden  variables  theory. 

To  demonstrate  that  this  is  emphatically  not  the  case, 
we  examine  further  the  relationship  between  the  quan¬ 
tity  B  relevant  to  the  CHSH  inequality  and  the  fidelity 
F.  Figure  4  shows  a  parametric  plot  of  B  versus  F  for 
various  values  of  the  efficiency  77.  The  curves  in  this  fig¬ 
ure  are  obtained  from  plots  as  in  Figures  1  and  3  by 
eliminating  the  common  dependence  on  r.  From  Figure 
4,  we  are  hard  pressed  to  find  any  indication  that  the 
value  F  =  |  is  in  any  fashion  noteworthy  with  respect 
to  violations  of  the  CHSH  inequality.  In  particular,  for 
efficiency  77  ~  0.90  most  relevant  to  current  experimental 
capabilities,  the  domain  F  >  |  is  one  largely  devoid  of 
instances  with  B  >  2,  in  contradistinction  to  the  claim 
of  Grangier  and  Grosshans  that  this  domain  is  somehow 
“safer”  [11]  with  respect  to  violations  of  Bell’s  inequali¬ 
ties.  Moreover,  contrary  to  their  dismissal  of  the  domain 
\  <  F  <  |  as  not  being  manifestly  quantum,  we  see 
from  Figure  4  that  there  are  in  fact  regions  with  B  >  2. 
Overall,  the  conclusions  of  Grangier  and  Grosshans  [11] 
related  to  the  issues  of  violation  of  a  Bell  inequality  and 
of  teleportation  fidelity  are  simply  not  supported  by  an 
actual  quantitative  analysis. 


[Eq.  (41)]  versus  fidelity  F  [Eq.  (38)].  The  curves  are  con¬ 
structed  from  Figures  1  and  3  by  eliminating  the  r  depen¬ 
dence,  now  over  the  range  0  <  r  <  5,  with  r  increasing  from 
left  to  right  for  each  trace.  The  efficiency  77  takes  on  the 
values  77  =  {0.99,  0.90,  0.70,  0.50}  as  indicated;  in  all  cases, 
77  —  0.  Recall  that  B  >  2  heralds  a  direct  violation  of 
the  CHSH  inequality,  with  the  dashed  line  B  =  2  shown, 
(b)  An  expanded  view  around  B  —  2.  Note  that  B  >  2 
is  impossible  for  F  <  Fciassicai  =  4,  but  that  B  >  2  for 
F  >  Fciassicai  in  various  domains  (including  for  77  =  0.70 
at  small  r).  The  purported  boundary  F  —  |  proposed  by 
Grangier  and  Grosshans  [11,12]  is  seen  to  have  no  particular 
significance.  Contrary  to  their  claims,  F  —  |  provides  abso¬ 
lutely  no  warranty  that  B  >  2  for  F  >  | ,  nor  does  it  preclude 
B  >  2  for  F  <  |. 

While  the  above  results  follow  from  the  particular  form 
of  the  CHSH  inequality  introduced  by  Banaszek  and 
Wodkiewicz  [18,19],  we  should  note  that  another  quite 
different  path  to  a  demonstration  of  the  inadequacy  of  lo¬ 
cal  realism  for  continous  quantum  varialbes  has  recently 
been  proposed  by  Ralph,  Munro,  and  Polkinghorne  [46]. 
These  authors  consider  a  novel  scheme  involving  mea¬ 
surements  of  quadrature-phase  amplitudes  for  two  en¬ 
tangled  beams  (A,  B).  These  beams  are  formed  by  com¬ 
bining  two  EPR  states  (i.e. ,  a  total  of  four  modes,  two  for 
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each  beam).  Relevant  to  our  discussion  is  that  maximal 
violations  of  a  CHSH  inequality  (i.e.,  B  =  2^2)  are  pre¬ 
dicted  for  r  <C  1,  with  then  a  decreasing  maximum  value 
of  B  for  increasing  r.  Once  again,  the  threshold  for  onset 
of  the  violation  of  a  Bell’s  inequality  coincides  with  the 
threshold  for  entanglement  of  the  relevant  fields  [i.e.,  Eq. 
(28)],  with  no  apparent  significance  to  the  boundary  set 
by  the  Heisenberg-type  inequality  Eq.  (26)  of  Grangier 
and  Grosshans. 

To  conclude  this  section,  we  would  like  to  inject  a  note 
of  caution  concerning  any  discussion  involving  issues  of 
testing  Bell’s  inequalities  and  performing  quantum  tele¬ 
portation.  We  have  placed  them  in  juxtaposition  here 
to  refute  the  claims  of  Grangier  and  Grosshans  related 
to  a  possible  connection  between  the  bound  F  =  |  and 
violation  of  Bell’s  inequalities  (here,  via  the  behavior  of 
the  CHSH  quantity  B  ).  However,  in  our  view  there  is 
a  conflict  between  these  concepts,  with  an  illustration 
of  this  point  provided  by  the  plot  of  the  CHSH  quan¬ 
tity  B  [Eq.  (42)]  versus  fidelity  F  [Eq.  (38)]  in  Figure 
4.  For  example,  for  g  =  0.90,  B  >  2  over  the  range 
0.50  <  F  <  0.66,  while  B  <  2  for  larger  values  of  F. 
Hence,  local  hidden  variables  theories  are  excluded  for 
modest  values  of  fidelity  0.50  <  F  <  0.66,  but  not  for 
larger  values  F  >  0.66.  This  leads  to  the  strange  con¬ 
clusion  that  quantum  resources  are  required  for  smaller 
values  of  fidelity  but  not  for  larger  ones.  The  point  is 
that  the  nonseparable  states  that  can  enable  quantum 
teleportation,  can  in  a  different  context  also  be  used  to 
demonstrate  a  violation  of  local  realism.  Again,  the  jux¬ 
taposition  of  these  concepts  in  this  section  is  in  response 
to  the  work  of  Ref.  [11],  which  in  any  event  offers  no 
quantitative  evidence  in  support  of  their  association. 

VI.  BELL’S  INEQUALITIES  FOR  SCALED 
CORRELATIONS 

The  conclusions  reached  in  the  preceding  section  about 
violations  of  the  CHSH  inequality  by  the  EPR  (mixed) 
state  for  modes  (1,2)  follow  directly  from  the  analysis  of 
Banaszek  and  Wodkiewicz  [18,19]  as  extended  to  account 
for  losses  in  propagation.  Towards  the  end  of  making 
these  results  more  amenable  to  experimental  investiga¬ 
tion,  recall  that  the  more  traditional  versions  of  the  Bell 
inequalities  formulated  for  spin  \  particles  or  photon  po¬ 
larizations  are  based  upon  an  analysis  of  the  expectation 
value 

E(a,  b)  (46) 

for  detection  events  at  locations  (1,2)  with  analyzer  set¬ 
tings  along  directions  (a,  b) .  As  emphasized  by  Clauser 
and  Shimony,  actual  experiments  do  not  measure  di¬ 
rectly  E(a,  b)  but  rather  record  a  reduced  version  due 
to  “imperfections  in  the  analyzers,  detectors,  and  state 


preparation  [20].”  Even  after  more  than  thirty  years  of 
experiments,  no  direct  violation  of  the  CHSH  inequal¬ 
ity  has  been  recorded,  where  by  direct  we  mean  without 
the  need  for  post-selection  to  compensate  for  propaga¬ 
tion  and  detection  efficiencies  (also  called  strong  viola¬ 
tions)  [22,23].  Rather,  only  subsets  of  events  that  give 
rise  to  coincidences  are  included  for  various  polarization 
settings.  This  “problem”  is  the  so-called  detector  effi¬ 
ciency  loophole  that  several  groups  are  actively  working 
to  close. 

Motivated  by  these  considerations,  we  point  out  that 
an  observation  of  violation  of  a  Bell-type  inequality  has 
recently  been  reported  [24] ,  based  in  large  measure  upon 
the  earlier  proposal  of  Ref.  [47],  as  well  as  that  of  Refs. 
[18,19].  This  experiment  was  carried  out  in  a  pulsed 
mode,  and  utilized  a  source  that  generates  an  EPR  state 
of  the  form  given  by  Eq.  (11)  in  the  limit  r  <  1.  Here, 
the  probability  P(a q,  o^)  of  detecting  a  coincidence  event 
between  detectors  (Di,!^)  for  the  EPR  beams  (1,2)  is 
given  by 

P{olucl2)  =  M[1  +  Vcos(0i^02  +  0)],  (47) 

with  then  the  correlation  function  E  relevant  to  the  con¬ 
struction  of  a  CHSH  inequality  —  2  <  S  <  2  given  by 

E^cf^m  Vcos^  -02  +  0),  (48) 

where  the  various  quantities  are  as  defined  in  associa¬ 
tion  with  Eqs.  (2,3)  in  Ref.  [24].  Note  that  the  quantity 
M  represents  an  overall  scaling  that  incorporates  losses 
in  propagation  and  detection.  Significantly,  Kuzmich 
et  al.  demonstrated  a  violation  of  a  CHSH  inequality 
(5exP  =  2.46  zb  0.06)  in  the  limit  r  «  1  and  with  ineffi¬ 
cient  propagation  and  detection  g  <C  1,  albeit  with  the 
so-called  “detection”  or  “fair-sampling”  loophole. 

In  terms  of  our  current  discussion,  this  experimental 
violation  of  a  CHSH  inequality  is  only  just  within  the 
nonseparability  domain  Ax2  +  A p2  <  1  (by  an  amount 
that  goes  as  gr  <C  1),  yet  it  generates  a  large  violation 
of  a  CHSH  inequality.  If  this  same  EPR  state  were  em¬ 
ployed  for  the  teleportation  of  coherent  states,  the  fi¬ 
delity  obtained  would  likewise  be  only  slightly  beyond 
the  quantum-classical  boundary  Fciassicai  =  It  would 
be  far  from  the  boundary  F  =  |  offered  by  Grangier 
and  Grosshans  as  the  point  for  “useful  entanglement,” 
yet  it  would  nonetheless  provide  an  example  of  telepor¬ 
tation  with  fidelity  F  >  |  and  of  a  violation  of  a  CHSH 
inequality.  Of  course,  the  caveat  would  be  the  aforemen¬ 
tioned  “fair-sampling”  loophole,  but  this  same  restriction 
accompanies  all  previous  experimental  demonstrations  of 
violations  of  Bell’s  inequalities.  Once  again,  we  find  no 
support  for  the  purported  significance  of  the  criteria  of¬ 
fered  by  Grangier  and  Grosshans  [11,12]. 
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VII.  CONCLUSIONS 

Beyond  the  initial  analysis  of  Ref.  [6] ,  we  have  exam¬ 
ined  further  the  question  of  the  appropriate  point  of  de¬ 
marcation  between  the  classical  and  quantum  domains 
for  the  teleportation  of  coherent  states.  In  support  of  our 
previous  result  that  fidelity  Fciassicai  =  ^  represents  the 
bound  attainable  by  Alice  and  Bob  if  they  make  use  only 
of  a  classical  channel,  we  have  shown  that  the  nonsepara¬ 
bility  criteria  introduced  in  Refs.  [14,15]  are  sufficient  to 
ensure  fidelity  beyond  this  bound  for  teleportation  with 
the  EPR  state  of  Eq.  (11),  which  is  in  general  a  mixed 
state.  Significantly,  the  threshold  for  entanglement  for 
the  EPR  beams  as  quantified  by  these  nonseparability 
criteria  coincides  with  the  standard  boundary  between 
classical  and  quantum  domains  employed  in  Quantum 
Optics,  namely  that  the  Glauber- Sudarshan  phase-space 
function  becomes  non-positive  definite  [17]. 

Furthermore,  we  have  investigated  possible  violations 
of  Bell’s  inequalities  and  have  shown  that  the  thresh¬ 
old  for  the  onset  of  such  violations  again  corresponds  to 
Fdassical  =  For  thermal  photon  number  n  =  0  as 
appropriate  to  current  experiments,  direct  violations  of 
a  CHSH  inequality  are  obtained  over  a  large  domain  in 
the  degree  of  squeezing  r  and  overall  efficiency  g.  Sig¬ 
nificant  relative  to  the  claims  of  Grangier  and  Grosshans 
[11,12]  is  that  there  is  a  regime  for  nonseparability  and 
violation  of  the  CHSH  inequality  for  which  F  <  |  and 
for  which  their  Heisenberg  inequalities  are  not  satisfied. 
Moreover,  the  experiment  of  Ref.  [24]  has  demonstrated 
a  violation  of  the  CHSH  inequality  in  this  domain  for 
(r,  rf)  1  (i.e.,  F  would  be  only  slightly  beyond  |),  albeit 
with  the  caveat  of  the  “fair-sampling”  loophole.  We  con¬ 
clude  that  fidelity  F  >  |  offers  absolutely  no  warranty 
or  “safety”  relative  to  the  issue  of  violation  of  a  Bell  in¬ 
equality  (as  might  be  desirable,  for  example,  in  quantum 
cryptography) ,  in  direct  disagreement  with  the  assertions 
by  Grangier  and  Grosshans.  Quite  the  contrary,  larger  r 
(and  hence  larger  F)  leads  to  an  exponentially  decreasing 
domain  in  allowed  loss  (1  —  77)  for  violation  of  the  CHSH 
inequality,  as  expressed  by  Eq.  (43)  [45]. 

Moreover,  beyond  the  analysis  that  we  have  pre¬ 
sented  here,  there  are  several  other  results  that  sup¬ 
port  Fciassicai  =  \  as  being  the  appropriate  boundary 
between  quantum  and  classical  domains.  In  particular, 
we  note  that  any  nonseparable  state  and  hence  also  our 
mixed  EPR  state  is  always  capable  of  teleporting  perfect 
entanglement,  i.e.,  one  half  of  a  pure  maximally  entan¬ 
gled  state.  This  applies  also  to  those  nonseparable  states 
which  lead  to  fidelities  \  <  F  <  |  in  coherent-state  tele¬ 
portation.  According  to  Refs.  [11,12],  this  would  force 
the  conclusion  that  there  is  entanglement  that  is  capable 
of  teleporting  truly  nonclassical  features  (i.e.,  entangle¬ 
ment),  but  which  is  not  “useful”  for  teleporting  rather 
more  classical  states  such  as  coherent  states.  Further, 


in  Ref.  [48]  it  has  been  shown  that  entanglement  swap¬ 
ping  can  be  achieved  with  two  pure  EPR  states  for  any 
nonzero  squeezing  in  both  initial  states.  Neither  of  the 
initial  states  has  to  exceed  a  certain  amount  of  squeez¬ 
ing  in  order  to  enable  successful  entanglement  swapping. 
This  is  another  indication  that  F  =  |,  which  is  exceed- 
able  in  coherent-state  teleportation  only  with  more  than 
3  dB  squeezing,  is  of  no  particular  significance. 

We  also  point  out  that  Giedke  et  al.  have  shown  that 
for  all  bipartite  Gaussian  states,  nonseparability  implies 
distillability  [49,50].  This  result  applies  to  those  non¬ 
separable  states  for  which  \  <  F  <  |  in  coherent  state 
teleportation,  which  are  otherwise  dismissed  by  Grangier 
and  Grosshans  as  not  “useful.”  To  the  contrary,  entan¬ 
glement  distillation  could  be  applied  to  the  mixed  EPR 
states  employed  for  teleportation  in  this  domain  (and  in 
general  for  F  >  4)  [51]  5  leading  to  enhanced  teleporta¬ 
tion  fidelities  and  to  expanded  regions  for  violations  of 
Bell’s  inequalities  for  the  distilled  subensemble. 

By  contrast,  there  appears  to  be  no  support  for  the 
claims  of  Grangier  and  Grosshans  [11,12]  that  their  so- 
called  Heisenberg  inequality  and  information  exchange 
are  somehow  “special”  with  respect  to  the  issues  of  sep¬ 
arability  and  violations  of  Bell’s  inequalities.  They  have 
neither  found  fault  in  the  prior  analysis  of  Ref.  [6], 
nor  with  the  application  of  the  work  on  nonseparabil¬ 
ity  [14-16]  to  the  current  problem.  They  have  likewise 
provided  no  analysis  that  directly  supports  their  asser¬ 
tion  that  their  Heisenberg  inequality  is  in  any  way  sig¬ 
nificant  to  the  possibility  that  “the  behavior  of  the  ob¬ 
served  quantities  can  be  mimicked  by  a  classical  and  lo¬ 
cal  model.”  [11]  Rather,  they  attempt  to  set  aside  by 
fiat  a  substantial  body  of  evidence  in  favor  of  the  bound¬ 
ary  Fciassicai  =  \  for  the  teleportation  of  coherent  states 
with  a  lack  of  rigor  indicated  by  their  claim  that  UF  =  | 
would  be  much  safer.”  [11] 

However,  having  said  this,  we  emphasize  that  there  is 
no  criterion  for  quantum  teleportation  that  is  sufficient 
to  all  tasks.  For  the  special  case  of  teleportation  of  coher¬ 
ent  states,  the  boundary  between  classical  and  quantum 
teleportation  is  fidelity  Fciassicai  =  |,  as  should  by  now 
be  firmly  established.  Fidelity  F  >  |  will  indeed  enable 
certain  tasks  to  be  accomplished  that  could  not  other¬ 
wise  be  done  with  \  <  F  <  |.  However,  F  =  |  is  in  no 
sense  an  important  point  of  demarcation  for  entrance  into 
the  quantum  domain.  There  is  instead  a  hierarchy  of  fi¬ 
delity  thresholds  that  enable  ever  more  remarkable  tasks 
to  be  accomplished  via  teleportation  within  the  quantum 
domain,  with  no  one  value  being  sufficient  for  all  possi¬ 
ble  purposes.  For  example,  if  the  state  to  be  teleported 
were  some  intermediate  result  from  a  large-scale  quan¬ 
tum  computation  as  for  Shor’s  algorithm,  then  surely 
the  relevant  fidelity  threshold  would  be  well  beyond  any 
value  currently  accessible  to  experiment,  F  ~  1  —  e,  with 
e  <  10-4  to  be  compatible  with  current  work  in  fault  tol¬ 
erant  architectures.  We  have  never  claimed  that  F  =  \ 
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endows  special  powers  for  all  tasks  such  as  these,  only 
that  it  provides  an  unambiguous  point  of  entry  into  the 
quantum  realm  for  the  teleportation  of  coherent  states. 
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An  extensive  characterization  of  high  finesse  optical  cavities  used  in  cavity  QED  experiments  is 
described.  Different  techniques  in  the  measurement  of  the  loss  and  phase  shifts  associated  with 
the  mirror  coatings  are  discussed  and  their  agreement  shown.  Issues  of  cavity  field  mode  structure 
supported  by  the  dielectric  coatings  are  related  to  our  effort  to  achieve  the  strongest  possible  coupling 
between  an  atom  and  the  cavity. 


I.  INTRODUCTION 

For  many  contemporary  physics  experiments,  measure¬ 
ment  enhancement  via  an  optical  cavity  is  a  useful  tool. 
Indeed,  an  optical  cavity  allows  one  to  extend  the  in¬ 
teraction  length  between  matter  and  field,  to  build  up 
the  optical  power,  to  maintain  a  well-defined  mode  struc¬ 
ture,  and  to  study  the  extreme  nonlinear  optics  and  quan¬ 
tum  mechanics  associated  with  the  large  field  of  a  single 
photon  for  small  cavity  volumes  [1].  In  most  situations, 
a  better  understanding  of  cavity  and  mirror  properties 
is  important  for  achieving  improved  sensitivity  and  for 
elimination  of  systematic  errors.  For  example,  in  cavity 
QED,  one  needs  to  know  the  mode  structure  of  the  in¬ 
tracavity  field  in  order  to  develop  the  optimum  strategy 
of  atom-cavity  coupling;  for  frequency  metrology,  accu¬ 
rate  determination  of  phase  shifts  of  the  resonant  fields 
can  provide  precision  frequency  markers;  and  in  quan¬ 
titative  spectroscopy,  knowledge  of  the  mirror  loss  sets 
the  accuracy  scale  of  absorption  measurement.  On  the 
technology  development  side,  the  knowledge  gained  from 
careful  mirror  characterization  could  provide  guidelines 
for  the  optic  coating  community  to  develop  in  situ  mea¬ 
surement  and  control  capabilities  of  the  coating  process. 

The  work  presented  in  this  paper  is  motivated  by  the 
ever-increasing  demand  for  a  high  coherent  coupling  rate 
between  an  atom  and  the  field,  as  well  as  of  a  decreas¬ 
ing  cavity  loss  rate.  The  aim  is  to  have  coherent  (re¬ 
versible)  evolution  dominating  over  dissipative  processes, 
and  thereby  to  explore  manifestly  quantum  dynamics  in 
real  time,  which  in  turn  should  lead  eventually  to  the 
investigation  of  the  strong  conditioning  of  system  evolu¬ 
tion  on  measurement  results  and  the  realization  of  quan¬ 
tum  feedback  control.  An  important  feature  associated 
with  strong  coupling  is  that  system  dynamics  are  read¬ 
ily  influenced  by  single  quanta.  Thus  single-atom  and 
single-photon  cavity  QED  provides  an  ideal  stage  where 
the  dynamical  processes  of  individual  quantum  systems 
can  be  isolated  and  manipulated.  A  collection  of  such  co¬ 
herent  systems  could  help  to  realize  a  distributed  quan¬ 
tum  networks  for  computation  and  communication  [2]. 
At  each  node  the  quantum  information  is  stored  by  one 


or  a  collection  of  entangled  atoms.  Photons  serve  as  the 
communication  link  which  in  turn  entangle  the  whole  net¬ 
work.  Within  this  context,  technical  advances  in  optical 
cavity  quantum  electrodynamics  have  become  increas¬ 
ingly  important.  Some  significant  developments  down 
this  road  have  been  achieved  by  the  group  at  Caltech 
[3,  4,  5,  6,  7,  8],  where  in  Ref.  [8]  the  one-photon  Rabi 
frequency  is  Ct\/2i r  =  220MHz,  in  comparison  with  the 
atomic  decay  rate  7^  =  2.6MHz  and  the  cavity  decay 
rate  k/2tt  =  14.2MHz. 

The  strong  coupling  condition  Ui  >>  (7^,  k)  is  achieved 
by  using  a  small  cavity  length,  of  the  order  of  10/im.  Pre¬ 
cise  measurement  of  the  length  of  a  short  optical  cavity 
facilitates  the  determination  of  mirror  coating  character¬ 
istics.  A  10/im  cavity  length  translates  to  a  free- spectral- 
range  ( FSR )  of  15  THz,  or  a  wavelength  difference  of  a 
few  tens  of  nanometers  (for  example,  it  is  36  nm  for  a  cen¬ 
ter  wavelength  of  852  nm)  for  neighboring  cavity  modes. 
Therefore  a  straightforward  6-digit  measurement  of  the 
wavelengths  (Burleigh  wavemeter)  of  the  cavity  modes 
acquires  a  precision  of  the  order  of  5  x  10“ 5  for  accurate 
determination  of  the  equivalent  optical  length  of  the  cav¬ 
ity,  from  which  details  of  the  index  of  refraction  and  layer 
thickness  of  materials  in  the  mirror  stack  can  be  inferred. 

The  low  loss  rate  of  the  cavity  field  is  made  possible 
by  high  quality  mirror  coatings  that  lead  to  scatter  and 
absorption  losses  in  the  10-6  range  [9,  10].  The  cavity 
finesse  and  overall  cavity  transmission  can  be  measured 
directly  to  determine  the  mirror  losses,  /,  and  transmis¬ 
sion,  T.  This  information  can  be  combined  with  the  FSR 
measurement  in  two  useful  ways:  Firstly,  the  FSR  mea¬ 
surement  is  sensitive  to  the  difference  in  refractive  index 
n#  —  til  of  the  materials  making  up  the  multilayer  mir¬ 
ror  stack,  whereas  the  transmission  T  depends  on  the 
ratio  tim/til,  as  will  be  shown  later.  As  a  result,  a  pre¬ 
cise  measurement  of  both  the  FSR  and  T  can  be  used 
to  determine  the  values  of  n#  and  til  independently. 
Moreover,  by  mapping  out  the  wavelength  dependence 
of  the  FSR ,  the  thickness  of  layers  in  the  mirror  stack 
can  be  determined.  Secondly,  if  one  of  the  refractive 
indices  (here  til)  is  well  known,  then  the  FSR  measure¬ 
ment  determines  n#,  and  an  independent  value  for  the 
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mirror  transmission  T  can  then  be  calculated  from  n# 
and  riL,  and  compared  to  the  experimentally  measured 
result.  Indeed,  the  work  presented  in  this  paper  shows 
that  we  are  able  to  make  complementary  and  mutually- 
confirming  measurements  of  the  cavity  properties  by  the 
two  approaches,  i.e. ,  measurements  of  the  direct  cavity 
loss  and  the  dispersion  of  the  cavity  modes. 

Coming  back  to  the  cavity  QED  experiments,  we 
note  that  knowledge  of  the  cavity  properties  is  of  im¬ 
portance  in  two  particular  ways:  1.  Mirror  absorp¬ 
tion/scatter  losses  are  a  critical  limiting  factor  in  the 
loss  rate  from  our  cavity  QED  system  -  for  our  current 
cavities  the  loss  rate  from  photon  scattering  due  to  mir¬ 
ror  imperfections  is  similar  in  size  to  the  atomic  spon¬ 
taneous  emission  rate.  To  build  robust  quantum  com¬ 
puting/communications  devices  from  cavity  QED  com¬ 
ponents,  it  is  necessary  to  improve  the  ratio  of  mirror 
transmission  to  mirror  losses.  2.  The  standing- wave  light 
field  inside  the  cavity  penetrates  into  the  mirror  coat¬ 
ings,  giving  a  larger  mode- volume  Vmode  than  would  be 
expected  naively  from  the  physical  distance  between  the 
mirror  surfaces.  Since  Ui  oc  1/VKnode,  as  our  micro¬ 
cavities  are  pushed  to  shorter  lengths,  this  leakage  field 
will  have  a  non-negligible  effect  on  the  achievable  cou¬ 
pling  strength  go  =  fii/2. 


II.  DIRECT  TRANSMISSION  AND  LOSS 
MEASUREMENTS 

All  of  the  mirrors  described  in  this  paper  were 
fabricated  by  Research  Electro- Optics  in  Boulder, 
Colorado  [10].  More  specifically,  the  measurements  were 
made  for  the  particular  coating  run  REO  #T95  and 
involved  mirrors  with  radius  of  curvature  R  =  10  and 
20  cm.  The  coating  run  had  a  design  transmission 
of  Tth= 7ppm  at  a  center  wavelength  of  852nm,  from 
which  a  cavity  finesse  of  P=370,000  was  expected.  It 
was  somewhat  surprising  therefore  to  measure  a  finesse 
of  P=480,000  at  the  targeted  wavelength,  and  this 
prompted  us  to  make  more  detailed  measurements  of  the 
mirror  properties,  and  design  a  model  to  match  these 
measurements. 

Firstly,  losses  were  measured  directly  with  a  40 gm 
length  cavity  of  20cm  radius  of  curvature  mirrors  in  the 
usual  way  by  recording  resonant  cavity  transmission,  re¬ 
flection  and  finesse.  If  we  denote  the  transmission  of  mir¬ 
rors  1  and  2  by  T\  and  X2,  respectively,  and  the  (absorp¬ 
tion  +  scatter)  loss  per  mirror  as  l{  =  (A  +  S)i,  then  the 
total  cavity  losses  C  =  T\  +  X2  + 1\  + 12  can  be  determined 
from  the  cavity  finesse  T  =  FSR/ 2ft,  with  FSR  as  the 
cavity  free  spectral  range  and  k  as  the  HWHM  for  the 
TEMoo  mode  of  the  cavity;  equivalently,  T  =  2i t/jC.  The 
cavity  linewidth  (3  =  2k  can  be  determined  from  a  ring- 
down  measurement  or  using  a  modulation  sideband  as  a 
frequency  marker  with  the  cavity  length  scanned,  which 
is  the  technique  employed  here.  The  cavity  transmission 
Itrans  =  (Ti+Tb+Ti+f2)2  can  then  he  used  to  determine 


li  T I2 •>  if  T\  and  X2  are  known  independently.  In  practice 
this  is  a  difficult  measurement  to  make,  because  the  over¬ 
all  transmission  Itrans  depends  on  the  mode-matching 
into  the  cavity  being  perfect.  A  variation  of  this  proto¬ 
col  that  does  not  require  perfect  mode-matching  can  be 
derived,  by  comparing  the  cavity  reflection  and  transmis¬ 
sion  values  with  the  cavity  locked  on  resonance  and  off 
resonance. 

The  rudiments  of  this  protocol  are  as  follows.  First 
of  all,  the  total  loss  (£  =  T\  +  X2  +  h  +  1 2)  is  always 
measured  first  with  the  determination  of  the  cavity  FSR 
and  linewidth.  Now  let  us  denote  the  input  power  as  P^n, 
the  reflected  power  Pr  ,  and  the  transmitted  power  Pt. 
There  is  also  a  mode  matching  factor  e,  meaning  that  of 
the  input  power  of  P^n,  only  eP^n  is  useful  for  coupling 
to  the  cavity  TEMoo  mode,  (1  —  e)Pin  is  wasted.  We 
have  the  following  equations  (the  assumption  of  two  equal 
mirrors  is  reasonable  since  the  two  mirrors  are  produced 
in  the  same  coating  run) 


P  = 


27 r 

T\  +  X2  +  h  +  I2 


7 r 

TTt 


(i) 


-E=4T1T2(T)2=T2(-)2 
tFin  7 T 


(2) 


Pr  ~  (1  ~  e)PjT 

fP 

in 


{h+l2+T1~T2)2F)2  =  l2(-)2 

Z7T  7 T 

(3) 


Remember  that  (1  —  e)P^n  is  the  “useless”  power  that 
is  reflected  directly  off  of  the  input  mirror,  and  must  be 
subtracted  from  Pr  to  leave  the  reflected  power  we  wish 
to  measure,  that  is,  the  sum  of  the  field  leaked  from  the 
cavity  storage  and  the  field  (mode- matched)  directly  re¬ 
flected  off  the  input  mirror.  This  cavity  contrast  is  a 
direct  result  of  the  mirror  properties.  Division  of  equa¬ 
tion  2  by  3  gives 


^  r2(f)2 

Pr  ~  Pin  l2(§)2  ~  1 

Equation  4,  combined  with  1,  will  determine  completely 
T  and  l  . 

In  the  actual  experiment,  this  direct  measurement  ap¬ 
proach  found  that  (from  finesse  we  have  /  +  T  =  7.2 ppm) 
Pin  -  54/ikE,  Pr  =  42.6/ikE  and  Pt  =  4.82 gW  and  there¬ 
fore  l  =  2.9 ppm  and  T  =  4.3 ppm,  with  measurement 
uncertainties  below  5%. 

Another  way  to  measure  the  (T,  l)  is  by  sweeping  out 
all  the  high  order  spatial  modes  and  carefully  noting  the 
transmission  and  reflection  powers  at  each  spatial  mode. 
One  measures  the  total  input  power  and  also  sums  to¬ 
gether  the  powers  of  every  matched  mode  for  transmis¬ 
sion  and  reflection.  These  three  powers  can  be  used  in 
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Eq.  2  and  3  to  calculate  the  partition  between  T  and  l. 
That  measurement  produced  l  =  3 ppm,  and  T  =  4.2 ppm. 
The  value  of  T  should  be  a  bit  lower  in  this  case  because 
it  is  not  possible  to  include  all  higher  order  modes  in 
the  measurement,  some  of  them  are  simply  impossible  to 
resolve  due  to  their  weakness. 

Other  cavities  measured  with  mirrors  from  the  same 
coating  run  had  higher  finesse,  very  likely  due  to  a  lower 
density  of  surface  defects.  To  construct  a  cavity  of  min¬ 
imal  mode  volume  for  the  intended  maximal  coherent 
coupling  rate,  we  need  to  have  the  distance  between  two 
mirrors  (radius  of  curvature  R  =  10cm)  on  the  order  of 
10 fim  or  below.  To  avoid  contact  between  the  outer  edges 
of  the  two  mirrors,  the  mirrors  were  fabricated  with  cone- 
shaped  fronts,  reducing  the  substrate  radius  from  3  mm 
to  1  mm.  We  notice  this  extra  machine  process  might 
have  introduced  some  additional  surface  defects  on  some 
mirrors.  However,  the  highest  finesse  achieved  with  cone- 
shaped  mirrors  was  comparable  to  unmodified  pieces,  at 
*F=480,000±10,000,  corresponding  to  losses  l  =  2.2 ppm 
if  mirror  transmission  T  =  4.3 ppm  as  determined  from 
the  above  measurements. 


III.  TECHNICAL  DETAILS  OF  THE  MODEL 


In  this  section  we  derive  a  model  for  the  coating  prop¬ 
erties.  A  transfer- matrix  formalism  was  used  to  calcu¬ 
late  the  input-output  propagation  of  a  plane-wave  field 
through  the  37  layer  stack  of  alternating  high  index 
(Ta205,  tih= 2.0411)  and  low  index  (Si02,  nz,=1.455)  di¬ 
electric  layers  (these  dielectric  constants  are  assumed  to 
be  constant  with  wavelength).  The  substrate  refractive 
index  (supplied  by  REO)  used  was  =1.5098.  That 
is,  the  transfer  of  the  field  through  each  A/4  layer  is  repre¬ 
sented  by  a  matrix,  and  the  response  of  the  entire  mirror 
(or  cavity)  is  determined  by  the  product  of  these  individ¬ 
ual  matrices. 

Following  the  treatment  of  Hecht  [11]  for  normal  inci¬ 
dence,  we  take  the  matrix  representing  layer  j  to  be  given 
cos  (khj)  ( isin(khj))/Yj 

iYjsm(khj)  cos  (khj) 

lates  the  electric  and  magnetic  fields  (E,  H )  of  the  input 
and  output  via 


by  Mj  = 


.  Here  M.  re- 


Eout 

E-out 


=  [M] 


Eir 

Hi, 


(5) 


k  =  2tt/X  is  the  free-space  wavevector  of  the  incident 
light,  hj  =  rtj  x  (layer  thickness)  with  rij  the  refrac¬ 


tive  index,  and  Yj  = 


' ^Tij  with  (eo,/io)  the  electric 


and  magnetic  constants  in  SI  units.  For  an  exact  A/4 
layer  (and  for  light  at  the  design  wavelength  of  the  coat- 

0  i/Yj 


ing),  this  simplifies  to  Mj  = 


iYj 


0 


A  multi¬ 


layer  stack  is  represented  by  multiplying  the  matrices 
of  the  individual  layers:  For  light  incident  on  layer  1, 


the  matrix  for  the  entire  structure  of  q  layers  is  defined 
as  the  product  M  =  M\M2...Mq.  For  our  mirror  stack, 
this  gives  M  =  (MTa2o5MSio2)18 MTa2o5-  Note  that  at 
the  coating  center  (where  there  is  an  exact  A/4  layer), 


MTa2o5Msi02  —  y  qh 
has  the  simple  form  M  = 


0 

nH 


so  the  system  matrix 


0 


18 


_Y( 18  " 
YH^nH) 

0 


For  a  field  incident  from  material  with  index  no  and 
exiting  into  material  with  index  ns,  the  resulting  trans¬ 
mission  coefficient  is  given  by 


t  =  2  To/  (To  Mu  +  Y0YsM12  +  M21  +  YSM22 ),  (6) 

with  transmission  T  =  —  It  1 2  (the  factor  —  accounts  for 

no  1  1  v  no 

the  change  in  the  amplitude  of  the  electric  field  in  the 
dielectric,  thereby  conserving  the  net  energy  flux).  At 
the  center  wavelength  of  the  coating  then, 

H 

T  =  —\  —  2 i/[(ns/nH)(nL/nH)18  +  (nH /n0)(nH /nL)18}  |2. 
no 

(7) 

We  can  make  a  further  simplification:  as  (n^/n^)18  = 
0.0018  and  (n^/n^)18  =557,  the  first  term  in  the  denom¬ 
inator  of  the  above  equation  is  only  a  10-6  correction,  so 
the  final  result  for  T  at  the  coating  center  becomes 

T  =  4  nsn0(nL)36/(nH)38,  (8) 

and  the  transmission  is  determined  by  the  ratio  of  the 
refractive  indices. 

This  calculation  reproduced  the  target  reflectiv¬ 
ity  of  Tth= 7.3ppm  for  the  coating  run  #T95,  and 
Tth= 14.6ppm  for  another  REO  coating  run  t^D1306 
where  the  number  of  layers  was  reduced  to  35.  The  model 
and  measured  (REO  spectrophotometer  data)  “coating 
curves”  are  shown  in  Fig.  1  for  the  #D1306  coating  run. 

For  a  fixed  cavity  length  the  resonance  wavelengths 
of  the  cavity  can  be  calculated  simply  with  the  same 
transfer-matrix  formalism,  using  a  matrix  for  the  entire 
system,  Mtotai  =  M MgapM ,  (a  product  of  two  mirrors 
plus  a  fixed- length  vacuum  gap  in  between).  The  calcu¬ 
lation  steps  through  a  series  of  wavelengths  calculating 
the  cavity  transmission  T  at  each,  and  by  finding  places 
of  maximum  transmission  finds  the  vacuum  wavelengths 
of  the  cavity  resonances. 

Conversely,  for  a  given  set  of  measured  cavity- 
resonance  wavelengths,  it  is  possible  to  determine  the 
effective  cavity  length  precisely.  With  a  commercial 
wavelength-meter  that  gives  6-digit  wavelength  measure¬ 
ment,  we  typically  measure  the  cavity  resonance  within 
an  uncertainty  of  0.01  nm.  Error  propagation  analysis 
gives  an  uncertainty  for  the  determination  of  the  effective 
cavity  length  (tens  of  microns)  on  the  order  of  0.05  —  0.1 
nm.  The  parameters  of  the  model  (index  contrast,  layer 
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thickness)  are  set  by  comparison  to  such  measurements. 
Hence,  armed  with  the  detailed  knowledge  of  the  mirrors 
provided  from  the  model,  the  physical  cavity  length  can 
be  determined  precisely  from  a  single  measurement  of  res¬ 
onance,  for  example,  when  the  cavity  is  locked  to  a  laser 
of  known  frequency  (in  our  case  a  cesium  transition  at 
852.359nm).  Close  to  the  center  of  the  design  wavelength 
of  the  coating,  the  effective  cavity  length  (on  resonance) 
is  roughly  Leff  =  L  + 1.633 A/2  with  L  (the  physical  dis¬ 
tance  between  the  mirror  surfaces)  an  integer  number  of 
A/2.  The  physical  cavity  length  can  therefore  be  deter¬ 
mined,  with  an  uncertainty  of  ~  0.5  nm,  limited  by  the 
overall  parameter-fitting  in  the  model.  Further  details  of 
the  wavelength-dependence  are  provided  by  reference  to 
the  model. 


IV.  FREE-SPECTRAL  RANGE  (FSR) 
MEASUREMENTS 


To  determine  the  parameters  of  the  model  (index  con¬ 
trast,  layer  thickness),  a  series  of  precise  measurements 
of  the  cavity  FSR  (frequency  between  successive  cav¬ 
ity  resonances)  was  made[12,  13,  14].  At  fixed  cav¬ 
ity  length  a  Ti-Sapphire  laser  was  tuned  to  find  succes¬ 
sive  resonant  wavelengths  (Ai,A2)  of  the  cavity,  and  an 
experimentally  determined  length  was  then  defined  by 
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FIG.  1:  (a)  Calculated  and  (b)  Measured  transmission  of  coat¬ 
ing  as  a  function  of  wavelength,  for  a  35  layer  A/4  stack  with 
77,^=2.0411,  riL= 1.455,  and  center  wavelength  850nm. 


Lexpt  =  AiA2/2(Ai  —  A2). 

This  length  comprises  the  actual  physical  length  be¬ 
tween  the  two  mirror  surfaces,  L,  plus  a  contribution 
from  leakage  of  the  mode  into  the  mirror  stack,  which 
gives  rise  to  an  additional  phase  shift  at  the  coatings, 
to  give  a  length  Leff  >  L.  In  addition,  the  leakage  into 
the  coatings  increases  with  wavelength  as  (Ai,A2)  move 
away  from  the  coating  design  wavelength,  so  this  gives 
another  additional  contribution  to  the  round-trip  phase 
and  hence  to  the  measured  length  Lexpt. 

As  discussed  in  Ref.  [12],  if  Ai  and  A2  were  closely 
spaced  compared  to  the  scale  on  which  the  coating  prop¬ 
erties  vary  (so  that  coating  dispersion  could  be  neglected) 
then  near  the  design  wavelength  of  the  coating,  we  would 
have  Lexpt  =  Leff  =  L  +  x  Ac/2  where 

tih  and  nj,  are  the  high  and  low  index  materials  of  the 
stack,  and  Ac  =  2AiA2/(Ai  +  A2)  is  the  average  (in  fre¬ 
quency)  of  wavelengths  Ai  and  A2.  We  thereby  have 
a  dependence  of  the  free  spectral  range  on  ( nH1_nL )? 
which  combined  with  the  transmission  (which  depends 
on  til/uh )  can  fix  tih  and  nj,.  For  these  materials,  this 
gives  Leff  =  L  +  1.633Ac/2.  However,  for  our  measure¬ 
ments  with  short  cavities,  Ai  and  A2  are  separated  by 
~30nm,  so  Lexpt  >  Leff.  But  we  can  still  use  the  com¬ 
plete  model  to  fit  to  the  measured  values  (Ai,A2)  and 
determine  parameters  of  the  coating.  Finally,  by  map¬ 
ping  out  this  wavelength  dependence  of  the  free-spectral 
range  to  find  min(Lexpt ),  we  find  the  center  wavelength 
of  the  coating. 

In  the  model,  the  refractive  indices  used  are  adjusted 
to  obtain  the  same  pairs  (Ai,  A2)  as  measured.  Then,  the 
layer  thickness  in  the  model  is  adjusted  to  agree  with  the 
measured  coating  center  wavelength.  By  using  the  addi¬ 
tional  information  of  the  measured  mirror  transmission 
T  from  Section  II,  we  can  now  either: 

1.  Derive  independent  values  for  the  refractive  indices 
and  layer  thickness,  or 

2.  Assuming  one  index  is  known,  use  the  refractive 
indices  and  layer  thickness  information  to  give  an  inde¬ 
pendent  value  for  the  mirror  transmission,  which  can  be 
compared  to  the  measurement  of  Section  II. 

That  the  dispersion  (FSR)  measurement  alone  is  suffi¬ 
cient  to  determine  the  loss-less  part  of  the  mirror  prop¬ 
erties  represents  some  useful  information  for  the  mirror 
coating  technician:  the  index  difference  tih  —  til  and 
the  optical  thickness  of  the  coating  layers  can  be  simply 
measured  in  this  way  without  interference  from  absorp¬ 
tion/scatter  losses.  And,  if  til  is  known,  this  also  gives  a 
simple  way  of  finding  the  mirror  transmission.  Adding  in 
a  direct  measurement  of  mirror  transmission  yields  values 
for  tih  and  ul  separately. 

Data  obtained  from  these  measurements  are  shown  in 
Figure  2,  where  Lexpt  is  plotted  as  a  function  of  wave¬ 
length,  for  a  10  fi m  cavity  with  10  cm  radius  of  curvature 
mirrors.  The  circles  are  measured  data,  and  the  curves 
the  calculation  from  the  model,  with  parameters  chosen 
to  best  fit  the  data.  This  data  was  taken  by  setting  the 
cavity  to  a  series  of  different  lengths,  and  recording  a  pair 
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of  resonant  wavelengths  (Ai,A2)  at  each  length.  The  x 
axis  is  center  wavelength  Ac  =  2AiA2/(Ai  +  A2),  the  y 
axis  the  measured  cavity  length  Lexpt  =  AiA2/2(Ai  —  A2) 
shown  in  units  of  Ai/2  :  for  each  pair  (Ai,A2),  the 
length  is  such  that  Lexpt/( Ai/2)  =  24. xx  .  Dividing 
by  A2  instead  would  exactly  give  23. xx,  since  by  rear¬ 
ranging  the  formula  for  Lexpt  we  see  that  Lexpt/{ Ai/2)  = 
Lexpt / ( A2  /2)  + 1 .  Due  to  a  finite  drift  in  the  cavity  length, 
each  measurement  of  A  was  made  to  only  5  digits  resolu¬ 
tion  (e.g.  852.59±0.01nm),  leading  to  the  uncertainty  in 
Lexpt  shown.  Uncertainty  in  Ac  is  d=  0.03nm  and  cannot 
be  seen  on  this  scale. 

Two  theory  curves  are  shown.  The  solid  curve  shows  a 
model  with  til  assumed  to  be  fixed  at  its  nominal  value 
of  71^= 1.455.  To  best  fit  the  data,  nn  was  increased 
to  nn= 2.0676  (a  factor  of  1.3%).  In  addition,  the  cen¬ 
ter  wavelength  was  shifted  to  847nm  (by  reducing  the 
thickness  of  each  A/4  layer  by  0.6%).  Discussions  with 
REO  confirmed  that  1.3%  is  a  known  offset  in  nn  for 
the  particular  coating  machine  which  produced  this  run, 
and  also  that  a  few  nm  uncertainty  in  the  center  wave¬ 
length  is  typical.  With  these  parameters,  the  inferred 
mirror  transmission  is  Tinf  =4.6±0.2  ppm,  agreeing  well 
with  the  measured  value  Texp  =4.3  ppm  from  Section 
II.  The  dotted  curve  (which  overlaps  the  solid  curve) 
shows  the  model  when  both  ul  and  uh  are  allowed  to 
vary.  Their  values  are  chosen  to  match  both  the  FSR 
measurement  shown,  and  to  give  a  mirror  transmission 
to  match  exactly  the  experimentally  determined  value 
Tex p  =  4.3  ppm.  Parameters  which  satisfy  these  criteria 
are  nn  =2.0564  (0.75%  increase)  and  77^=1.4440  (0.76% 
decrease).  Our  direct  measurement  of  T  in  Section  II  had 
a  large  uncertainty,  which  limits  the  absolute  determina¬ 
tion  of  nn  and  ul  to  about  this  1%  level.  However,  a 
more  precise  measurement  could  in  principle  determine 
the  indices  at  the  0.1%  level.  One  application  might  be  to 
measure  T  and  the  FSR  as  a  function  of  position  across 
a  mirror  substrate,  thereby  mapping  out  stress  induced 
variations  in  the  refractive  indices  at  the  0.1%  level  with 
a  spatial  resolution  of  ^  10  /im. 

In  this  data  set  the  correction  for  the  Gaussian  phase 
difference  between  the  actual  resonator  mode  and  the 
plane-wave  of  the  model  has  been  neglected.  After  the 
propagation  distance  from  the  mode  waist  to  the  mirror 
surfaces,  a  Gaussian  beam  will  have  acquired  less  phase 
than  a  plane  wave  traveling  the  same  distance.  For  a  10 
/im  cavity  with  10  cm  radius  of  curvature  mirrors,  this 
gives  a  2%  correction,  corresponding  to  a  shift  in  Lexpt 
by  ~  0.0045  cavity  orders  (that  is,  A L  ~  x  0.0045). 

Lowering  the  refractive  index  contrast  of  the  model  to 
shift  the  calculated  curve  by  this  amount  would  increase 
the  inferred  mirror  transmission  by  <  0.1  ppm.  For  our 
second  cavity  (44  /im,  20  cm  radius  of  curvature  mirrors), 
the  correction  is  0.0066  cavity  orders. 

The  mirror  phase  shift  (FSR  measurement)  is  only 
sensitive  to  the  transmission  (index  contrast)  and  cen¬ 
ter  wavelength  (layer  thickness).  Therefore,  if  absorp¬ 


tion/scatter  losses  are  added  to  the  model  (by  intro¬ 
ducing  an  imaginary  component  to  the  refractive  index) 
the  cavity  resonance  wavelengths  do  not  change.  More 
precisely,  adding  a  scattering  loss  at  the  mirror  surfaces 
has  exactly  zero  effect  on  the  FSR  and  mirror  transmis¬ 
sion.  Adding  losses  within  the  coatings  has  a  small  effect: 
increasing  the  mirror  absorption  from  0.5ppm  to  2ppm 
(an  experimentally  reasonable  range)  changes  the  mirror 
transmission  by  a  factor  of  ~  10_5T,  clearly  negligible, 
and  again  there  is  no  effect  on  the  FSR  measurement.  As 
a  result,  this  measurement  (with  ul  assumed  fixed)  pro¬ 
vided  a  very  simple  and  sensitive  inference  of  the  mirror 
transmission  of  T\n f  =4.6±0.2  ppm,  which  is  unaffected 
by  absorption/scatter  losses. 

The  same  measurement  and  fitting  procedure  was  used 
on  another  cavity  with  mirrors  from  the  same  coating 
run.  This  44  /am  cavity  made  from  20  cm  radius  of  cur¬ 
vature  mirrors  gave  a  transmission  of  Tinf  =4.5±0.2  ppm, 
with  a  center  wavelength  of  848  nm.(This  was  the  cav¬ 
ity  used  for  the  direct  measurements  of  Section  II  which 
gave  7  =4. 3  ppm). 

One  other  factor  which  has  been  ignored  so  far  is  the  ef¬ 
fect  of  fluctuations  in  the  A/4  layer  thickness.  Discussions 
with  REO  suggested  that  a  1%  variation  in  thickness  was 
reasonable,  so  a  Gaussian-distributed  variation  (of  stan¬ 
dard  deviation  1%)  was  added  to  the  layer  thicknesses 
of  the  model.  For  cavity  calculations,  identical  mirrors 
were  used  for  both  sides  of  the  cavity.  The  principal  ef¬ 
fect  of  this  variation  is  to  shift  the  center  wavelength  of 
the  coating  -  over  several  realizations  of  random  coatings, 
this  resulted  in  an  rms  shift  of  the  center  wavelength  by 
±1.2  nm.  So,  the  measured  shift  of  center  wavelength 
in  the  coating  (from  852  nm  to  847  nm)  is  probably  due 
partly  to  a  systematic  offset,  and  partly  to  fluctuations. 
The  mirror  transmission  is  also  affected:  the  value  of  the 


Measured  and  Calculated  Cavity  Length  vs  Wavelength 


FIG.  2:  The  cavity  length  Lexpt  measured  from  the  Free- 
Spectral  Range  (FSR)  varies  about  the  design  wavelength  of 
the  coating.  Fitting  a  model  to  these  data  points  gives  a 
measure  of  mirror  transmission  (from  fitting  of  the  difference 
tih—til)  and  center  wavelength  (from  fitting  layer  thickness). 
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transmission  is  on  average  increased  slightly,  by  0.6%  in 
the  case  studied,  from  4.55  ppm  to  4.58  ppm  at  the  center 
of  the  coating.  At  the  level  of  our  current  measurements 
this  is  another  negligible  effect,  but  with  a  more  precise 
measurement  aimed  at  determining  nn  and  the  pos¬ 
sibility  of  a  systematic  offset  from  this  mechanism  should 
be  considered.  Lastly,  the  FSR  measurement  is  mostly 
effected  via  the  change  in  center  wavelength  of  the  coat- 
ing  -  the  value  of  min(Lexpt{simulated))  has  a  mean  the 
same  as  without  the  added  fluctuations,  and  varies  by 
only  0.0014  mode  orders  rms,  again  negligible  for  our 
purposes. 

Another  useful  result  of  these  calculations  is  that  the 
free  spectral  range  of  the  cavity  is  well  known,  so  that 
resonant  wavelengths  of  the  cavity  can  be  accurately  pre¬ 
dicted.  This  is  important  for  choosing  a  diode  laser  of 
correct  center  wavelength  to  match  the  mode,  for  ap¬ 
plications  such  as  cavity  locking  or  dipole- force  traps. 
With  the  idea  of  using  a  laser  of  >920  nm  wavelength 
to  form  an  intracavity  dipole- force  trap,  [15]  this  knowl¬ 
edge  was  particularly  important:  our  Ti:Sapphire  laser 
tuned  only  as  high  as  890nm  so  cavity  resonances  in  this 
wavelength  range  could  not  be  measured,  only  predicted. 
With  the  parameters  chosen  above  for  the  model,  the 
following  theoretical  and  experimental  resonance  wave¬ 
lengths  resulted: 

theory  787.208nm  818.659  853.255  890.798  930.683 

experiment  787.170nm  818.651  853.255  890.800  N/A 

The  experimental  value  for  the  cavity  resonance  can 
then  confidently  be  predicted  to  be  930.7±0.05nm,  and 
a  diode  laser  chosen  accordingly. 


V.  LIMITATIONS  TO  MODE  VOLUME 

In  a  similar  calculation  to  the  one  described  above,  it  is 
possible  to  calculate  the  field  distribution  of  light  inside 
the  resonant  cavity,  by  describing  each  layer  separately 
with  a  left  and  right  travelling  plane  wave,  then  matching 
electromagnetic  boundary  conditions  between  layers.  An 
example  of  this  kind  of  calculation  is  shown  in  Figure  3, 
where  refractive  index  and  held  distribution  (modulus  of 
the  electric-held)  are  plotted  as  a  function  of  distance  for 
a  cavity  with  length  Leff  =  3 A/2.  The  coupling  strength 
go  of  an  atom  placed  in  the  center  of  the  cavity  mode  is 
proportional  to  -^== ,  where  the  cavity  mode  volume 
is  found  by  integrating  the  held  ( D  •  E )  over  the  standing 
wave  and  Gaussian  transverse  mode  prohle.  Large  cou¬ 
pling  is  achieved  by  making  a  short  cavity  with  a  small 
mode  waist  (short  radius  of  curvature  mirrors). 

For  a  cavity  of  physical  length  L,  the  “leakage”  of  the 
mode  into  the  A/4  mirror  stack  (look  at  the  tails  of  the 
mode  in  Fig.  3)  that  increases  L  to  Te//  also  increases 
the  cavity  mode  volume.  For  our  materials  at  852nm, 
Leff  =  L  +  1.633 A/ 2  ,  so  for  a  cavity  with  physical 
distance  between  mirror  surfaces  L  =  A/2,  the  cavity 
mode  volume  ends  up  being  2.63  times  larger  than  might 
otherwise  have  been  expected,  and  hence  the  atom-cavity 


coupling  go  is  0.6  times  smaller  than  the  naive  estimate 
based  on  the  physical  separation  of  the  mirror  surfaces. 

This  effect  is  proportionately  larger  as  the  cavity 
length  gets  shorter.  In  Figure  4,  the  expected  go  is  plot¬ 
ted  for  a  cavity  formed  with  two  20cm  radius  of  curvature 
mirrors,  as  a  function  of  the  physical  distance  L  between 
the  mirrors.  The  two  curves  show  a  real  mirror  (with  go 
reduced  by  leakage  into  the  coatings)  and  an  idealized 
mirror  with  no  leakage  (perfect  reflectors  at  EL/ 2).  The 
transverse  (Gaussian  waist)  dimension  is  calculated  by 
simple  Gaussian  beam  propagation,  which  is  not  strictly 
accurate  for  length  scales  less  than  a  few  microns;  how¬ 
ever  any  error  in  this  should  be  roughly  the  same  for 
both  the  ideal  and  actual  mirror  cases,  so  the  ratio  of 
these  should  remain  sensibly  correct.  The  cavity  is  as¬ 
sumed  resonant  at  an  integer  number  of  half- wavelengths 
of  light  at  the  852nm  Cs  D2  transition;  that  is,  each  A/2 
is  a  distance  of  0.426  microns. 

The  discrepancy  between  the  expected  and  achieved 
coupling  g  is  large  even  for  our  longer  cavities  -  5%  for 
a  10/im  cavity.  However,  in  the  lab  this  is  largely  com¬ 
pensated  by  the  fact  that  we  never  measure  the  actual 
physical  distance  L  between  mirror  surfaces,  but  instead 
Lexpt  —  Ai  A2 /2 ( Ai — A2 ) ,  which  is  close  to  Lef  f ,  and  so  in¬ 
corporates  the  same  offset  of  mirror  penetration  that  de¬ 
termines  go.  This  method  of  length  measurement  breaks 
down  eventually  due  to  the  dispersion  of  the  mirror  coat¬ 
ings:  Eventually  if  Ai  is  at  the  center  of  the  coating,  A2 
will  be  so  far  separated  in  wavelength  that  it  reaches  the 
edges  of  the  mirror  coating  stopband,  and  the  observed 
round-trip  phase  has  then  more  to  do  with  the  structure 
of  the  dielectric  coatings  than  it  does  with  the  vacuum 
gap  between  the  surfaces  of  the  cavity  mirrors.  That  is 
to  say,  our  measured  Lexpt  becomes  increasingly  differ¬ 
ent  from  Lef  f  and  introduces  an  offset  in  estimating  the 
mode  volume  as  the  cavity  length  approaches  the  scale 
of  the  wavelength. 


Refractive  Index,  3A/2  Optical  Cavity 


FIG.  3:  Mirror  refractive  index  stack  design,  and  resulting 
electric  held  distribution  for  a  resonant  3 A/2  cavity  made  from 
dielectric  mirrors. 
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At  L  —  20A/2  physical  length  (the  regime  of  our 
present  cavities)  the  difference  between  the  coupling  coef¬ 
ficient  go  inferred  from  Lexpt  and  that  found  by  integrat¬ 
ing  D  •  E  over  the  mode  volume  is  <0.1%.  At  L  =  10A/2 
(4.26/im)  it  would  be  a  1%  error,  at  5A/2  an  8%  error. 
Note  however  that  knowledge  of  these  offsets  means  that 
when  calculating  go  from  Lexpt  we  can  compensate  for 
this  effect.  Measurements  of  Lexpt  for  cavities  any  shorter 
than  5 A/2  would  be  impossible  since  A2  has  reached  the 
edge  of  the  mirror  stopband.  To  align  shorter  cavities 
a  new  method  for  length  measurement  will  need  to  be 
developed,  such  as  measuring  the  frequency  spacing  of 
transverse  modes. 

We  are  now  in  a  position  to  estimate  parameters  for  the 
best  Fabry-Perot  cavity  that  will  be  experimentally  fea¬ 
sible  in  the  near  future  using  this  type  of  mirrors.  First 
consider  a  L  =  A/2  cavity  with  20cm  radius  of  curva¬ 
ture  mirrors.  If  the  mirror  transmission  and  losses  were 
each  reduced  to  T  =  l  =0.5ppm  to  yield  a  cavity  finesse 
of  T  =  3.14  x  106  ,  then  this  cavity  has  parameters 
(go,  k,  7±)/27r  =  (647,  56,  2.6)MHz,  which  gives  critical 
photon  number  no  =  7^/2 g%  =  8.1  x  10-6  and  critical 
atom  number  No  =  ‘Zn^^/go  =  7.0  x  10-4.  To  make 
a  cavity  of  this  length  the  20cm  mirrors  would  have  to 
be  reduced  to  a  diameter  of  0.5mm  rather  than  1mm. 
At  this  size  there  would  still  be  a  0.11/rm  gap  between 
the  mirror  edges  for  the  L  =  A/2  (0.426  micron)  cavity 
length,  which  should  make  it  possible  to  still  get  atoms 
into  and  out  of  the  cavity  (as  in  Ref.  [3,  4,  5,  6,  7,  8]), 
and  to  align  the  mirrors. 

If  the  mirror  diameter  could  be  reduced  to  350/im 
(without  adversely  affecting  the  cavity  losses),  then 
10cm  radius  of  curvature  mirrors  could  be  used,  with 
a  0.12/im  gap  at  the  edges.  Due  to  the  tighter  radius 


Achievable  coupling  strength  with  mirror  leakage 


FIG.  4:  Coupling  coefficient  go  (expressed  in  cycles  per  sec¬ 
ond)  versus  the  physical  separation  L  of  the  surfaces  of  two 
mirrors  forming  a  Fabry-Perot  resonator.  Due  to  penetration 
of  the  standing- wave  mode  into  the  mirror  coatings,  the  cavity 
mode  volume  achieved  with  real  mirrors  is  larger  (and  hence 
the  coupling  strength  smaller)  than  for  an  ideal  mirror  with 
the  same  spacing  between  mirror  surfaces  but  no  penetration. 


of  curvature,  go /27t  would  be  increased  to  770MHz  in 
this  case.  Now  speculating  that  “dream”  mirrors  of 
T  =0.2ppm  transmission,  l  =0.2ppm  loss  might  be  pos¬ 
sible  (T  —  7.85  x  106),  we  could  aim  for  the  ultimate 
goal  of  (go,  k,  7±)/27t  =  (770,  22,  2.6)  MHz,  in  which  case 
no  =  5.7  x  10-6  photons,  and  No  =  1.9  x  10-4  atoms. 


In  conclusion,  we  have  presented  two  measurement  ap¬ 
proaches,  one  based  upon  direct  loss  and  the  other  on 
cavity  dispersion,  that  produce  the  same  quantitative 
determination  of  the  mirror  coating  properties.  The  dis¬ 
persion  measurement  is  more  informative,  as  it  has  the 
potential  to  determine  the  complete  characteristics  of  a 
mirror.  A  model  has  been  derived  to  link  the  mirror 
properties  to  the  physical  parameters  of  coating  layers. 
Issues  relevant  to  optical  cavity  QED,  such  as  the  cavity 
field  mode  structure,  have  been  discussed. 
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I.  INTRODUCTION 

A  variety  of  passive  and  active  optical  devices  can  be 
constructed  by  introducing  point  or  line  defects  into  a  peri¬ 
odic  array  of  holes  perforating  an  optically  thin  semiconduc¬ 
tor  slab.  In  such  structures,  light  is  confined  within  the  defect 
regions  by  the  combined  action  of  distributed  Bragg  reflec¬ 
tion  and  internal  reflection.  This  technique  has  been  em¬ 
ployed  in  making  a  microcavity  semiconductor  (In-Ga-As-P) 
laser  [1]  (emitting  at  A,  =  1.55  gm),  and  for  demonstrating 
Si  optical  waveguides  with  sharp  bends  [2].  One  can  thus 
easily  envision  the  fabrication  of  integrated  optical  “net¬ 
works”  on  a  single  chip,  with  numerous  microcavity-based 
active  devices  linked  by  passive  waveguide  interconnects. 

The  combination  of  high  quality  factor  and  extremely  low 
mode  volume  that  should  be  obtainable  in  point-defect  mi¬ 
crocavities  makes  the  photonic  crystal  (PC)  paradigm  espe¬ 
cially  attractive  for  experiments  in  cavity  quantum  electro¬ 
dynamics  (cavity  QED)  [3],  with  potential  applications  in 
quantum  information  technology.  The  primary  focus  of  this 
paper  will  be  to  discuss  the  optimization  of  microcavity  de¬ 
signs  for  cavity  QED  with  strong  coupling  between  defect 
modes  and  gas -phase  neutral  atoms.  This  new  paradigm  for 
cavity  QED  poses  formidable  technical  challenges  with  re¬ 
gard  to  atom  trapping  and  the  characterization  of  surface 
effects,  so  we  include  some  discussion  of  these  issues  and  of 
our  current  approaches  to  addressing  them. 

In  Sec.  II  of  this  paper  we  begin  by  analyzing  elementary 
microcavities  formed  by  changing  either  the  refractive  index 
or  radius  of  a  single  defect  hole.  We  discuss  in  detail  some 
problems  in  previous  finite-difference  time-domain  (FDTD) 
calculations  done  by  our  group  [4],  as  intuitions  gained  from 
the  resolution  of  these  problems  are  important  in  the  micro¬ 
cavity  design  optimization  for  cavity  QED.  In  Sec.  Ill  we 
present  our  designs  of  microcavities  optimized  for  strong 
coupling  between  the  cavity  field  and  an  atom  trapped  within 
a  hole  of  the  PC.  Based  on  numerical  analysis  using  the 
three-dimensional  (3D)  finite-difference  time-domain 
(FDTD)  method,  we  predict  that  quality  factors  over 
3  X  104  can  be  achieved  in  these  structures.  We  also  discuss 
in  this  section  the  difficulty  of  estimating  surface  interactions 
between  a  trapped  atom  and  the  semiconductor  substrate,  and 
propose  a  technical  strategy  for  confining  gas-phase  atoms 
within  a  defect  microcavity.  Finally,  in  Sec.  IV  we  describe  a 


procedure  that  we  have  developed  for  microcavity  fabrica¬ 
tion. 


II.  SINGLE  DEFECT  HOLE 

The  microcavities  analyzed  in  this  paper  are  formed  by 
introducing  point  defects  into  a  dielectric  slab  patterned  with 
a  hexagonal  array  of  air  holes.  The  thickness  of  the  slab  is  d 
and  its  refractive  index  is  equal  to  3.4.  The  spacing  between 
holes  is  denoted  by  a  and  the  hole  radius  by  r,  as  shown  in 
Fig.  1.  We  use  X  to  specify  the  optical  wavelength  in  air.  In 
our  calculations  of  microcavity  quality  factors,  the  boundary 
for  separation  of  vertical  from  lateral  loss  (i.e.,  the  vertical 
quality  factor  QL  from  the  lateral  quality  factor  Q||)  was 
positioned  approximately  at  X/2  from  the  surface  of  the 
membrane.  As  the  number  of  PC  layers  around  a  defect  is 
increased,  <2||  increases  and  the  total  quality  factor  Q  ap¬ 
proaches  Ql  .  We  will  adopt  a  coordinate  convention  in 
which  v  =  0,  y  =  0,  z  =  0  denotes  the  center  of  the  cavity  and 
z  —  0  is  the  middle  plane  of  the  slab. 

The  simplest  method  of  forming  a  microcavity  within  the 
structure  shown  in  Fig.  1  is  to  change  the  radius  or  index  of 
refraction  of  a  single  hole.  The  former  method  is  more  inter¬ 
esting  from  the  perspective  of  fabrication,  since  the  litho¬ 
graphic  tuning  of  geometric  parameters  of  individual  holes  is 
a  simple  process,  but  in  this  section  we  will  consider  both 
methods.  By  increasing  the  radius  of  a  single  hole  an  accep¬ 
tor  defect  state  is  excited,  i.e.,  pulled  into  the  band  gap  from 
the  dielectric  band.  On  the  other  hand,  by  decreasing  the 
radius  of  an  individual  hole  (or  by  tuning  its  refractive  index 
between  1  and  the  refractive  index  of  the  slab)  a  donor  defect 
state  is  pulled  into  the  band  gap  from  the  air  band  [5].  Ac- 


FIG.  1.  Optically  thin  slab  patterned  with  a  hexagonal  array  of 
air  holes. 
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FIG.  2.  Electric  field  intensity  pattern  of  the  x-dipole  mode. 
Parameters  of  the  structure  are  r/a  =  0.3,  dla  =  0.6,  and  in  the  used 
discretization  a=  15.  The  refractive  index  of  the  slab  is  3.4  and  of 
the  defect  is  2.4.  The  plotted  intensity  patterns  are  for  the  x-y  plane 
at  the  middle  of  the  slab. 

ceptors  tend  to  concentrate  their  electric  field  energy  density 
in  regions  where  the  larger  (semiconductor)  refractive  index 
was  located  in  the  unperturbed  PC,  while  the  electric  field 
energy  density  of  donors  is  concentrated  in  regions  where 
there  was  air  in  the  unperturbed  PC.  Since  the  electric  field 
intensity  in  air  regions  is  small  in  the  case  of  acceptor  de¬ 
fects,  these  are  not  good  candidates  for  strong  coupling  with 
a  single  gas-phase  atom  that  would  be  trapped  within  a  hole. 
In  this  paper  we  will  thus  focus  on  donor  states.  For  a  dis¬ 
cussion  of  acceptor  states  excited  in  an  optically  thin  slab 
perforated  with  a  hexagonal  PC  array,  readers  are  referred  to 
Ref.  [6]. 

A.  Changing  the  refractive  index  of  a  single  hole 

Microcavity  formation  by  alteration  of  the  refractive  in¬ 
dex  of  a  single  defect  hole  in  a  hexagonal  PC  has  been  ana¬ 
lyzed  previously  by  our  group  [4].  In  that  analysis  we  pre¬ 
dicted  that  dipole-like  donor  states  (such  as  the  x-dipole 
mode  shown  in  Fig.  2)  with  quality  factors  up  to  3  X  104 
should  exist.  We  now  believe  that  the  quality  factors  of  such 
microcavities  are  limited  to  several  thousand,  for  reasons  dis¬ 
cussed  below.  This  discussion  reveals  the  extreme  sensitivity 
of  microcavity  quality  factors  to  small  distortions  to  the  local 
PC  geometry,  which  will  later  be  used  as  a  powerful  design 
tool  in  optimization  for  cavity  QED. 

In  our  previous  work,  mirror  boundary  conditions  were 
applied  in  the  x,  y,  and  z  directions  to  achieve  an  eightfold 
reduction  in  the  computational  grid  size.  We  have  since  re¬ 
alized  that  the  manner  in  which  even  (symmetric)  mirror 
boundary  conditions  are  implemented  in  our  finite-difference 
time-domain  (FDTD)  code  results  in  numerical  output  that 
properly  corresponds  to  an  analyzed  structure  with  slight  de¬ 
formations  relative  to  the  intended  structure.  For  example, 
the  set  of  mirror  boundary  conditions  used  to  select  the 
x-dipole  mode  in  a  defect  cavity  leads  to  a  deformation  of  the 


FIG.  3.  Deformation  of  the  analyzed  structure  introduced  by  the 
application  of  mirror  boundary  conditions  along  the  x  and  y  axes 
and  through  the  center  of  the  defect.  The  applied  mirror  boundary 
conditions  select  the  x-dipole  mode,  whose  electric  field  pattern  is 
shown  in  Fig.  2.  Holes  on  the  x  axis  (in  the  central  row,  containing 
the  defect)  are  elongated  by  one  point  in  the  y  direction.  The  refrac¬ 
tive  index  of  the  defect  is  wdefect=2.4.  Parameters  of  the  structure 
are  r/a  =  03 ,  d/a  =  0.6,  and  in  the  used  discretization  a  =  15. 

structure  as  shown  in  Fig.  3.  Holes  on  the  x  axis  are  elon¬ 
gated  in  the  y  direction  by  one  point  but  the  distance  between 
holes  in  the  x  and  y  directions  is  preserved.  Because  hole-to- 
hole  distances  are  preserved  under  this  deformation,  the  half¬ 
spaces  y  >  1/2  and  y  <  —  1/2  actually  maintain  the  unper¬ 
turbed  PC  geometry  when  holes  in  the  central  row  are 
elongated  by  1/2  points  in  both  the  ±y  directions.  The  sym¬ 
metry  of  the  PC  surrounding  the  defect  is  therefore  broken, 
and  this  contributes  to  artificially  high  quality  factors  for 
x-dipole  modes.  An  even  mirror  boundary  condition  (BC) 
was  also  applied  in  the  z  direction  in  our  previous  analysis, 
causing  a  slight  error  in  the  thickness  of  the  slab.  The  correct 
dla  ratios  of  the  structures  analyzed  in  Ref.  [4]  would  be  0.6, 
0.46,  and  1,  instead  of  the  values  0.53,  0.4,  and  0.93,  as 
noted  there. 

1.  Perfectly  symmetric  cavity 

We  reanalyzed  the  structure  with  r/a  =  0.3,  d/a  =  0.6, 
^defect”  2.4,  ns iab=  3.4,  five  layers  of  holes  around  the  defect, 
and  a=  15  grid  points.  For  this  set  of  parameters,  the  pre¬ 
dicted  Q±  in  Ref.  [4]  for  the  x-dipole  mode  was  3  X  104.  In 
the  present  analysis,  even  mirror  boundary  conditions  were 
applied  to  the  lower  boundary  in  the  z  direction  only,  to 
reduce  the  computation  size  by  one-half  and  to  eliminate 
TM-like  modes.  Absorbing  boundary  conditions  were  ap¬ 
plied  to  ah  boundaries  in  x  and  y  directions  and  to  the  upper 
boundary  in  the  z  direction.  Under  these  BC’s  the  intended 
symmetry  of  the  defect  structure  was  achieved.  The  initial 
field  distribution  was  chosen  in  such  a  way  as  to  excite  x  or 
y  dipole  modes  selectively.  For  the  x-dipole  mode  we  now 
calculate  0||  =  226O,  0^  =  1730,  and  a/X  =0.3137,  and  for 
the  y-dipole  mode  we  calculate  Qq  =  1867,  0^  =  1007,  and 
a/X  =0.3182. 
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The  difference  between  parameters  computed  for  the  x- 
and  y-dipole  modes  comes  partly  from  the  asymmetry  of  the 
structure  introduced  by  imperfect  discretization.  In  a  two- 
dimensional  (2D)  PC  with  infinite  slab  thickness,  these  two 
modes  should  be  degenerate  [4].  In  the  thin  slab,  however, 
the  y-dipole  mode  suffers  more  vertical  scattering  at  the 
edges  of  holes  and,  therefore,  has  a  lower  Q±  .  For  a  non- 
symmetric  applied  initial  field  distribution  leading  to  excita¬ 
tion  of  both  x-  and  y-dipole  modes,  we  calculated  Q\\ 
=  2070,  Q±  =  1290,  and  a/X  =  0.316.  This  mode  can  be  rep¬ 
resented  as  a  superposition  of  the  x  and  y  dipole,  with 
weighting  factors  depending  on  the  initial  field.  Its  quality 
factor  and  resonant  frequency  depend  on  these  weighting 
factors. 

2.  Asymmetric  cavity 

Our  explanation  for  the  discrepancy  between  our  current 
and  previous  results  leads  us  to  ask  whether  a  “real”  (inten¬ 
tional)  elongation  of  the  central  row  of  holes  along  the  y  axis 
may  actually  improve  the  quality  factor  ( Q )  of  the  x-dipole 
mode.  A  possible  disadvantage  of  this  approach  is  the  exci¬ 
tation  of  acceptor  states  caused  by  enlarging  holes  in  the 
central  row.  We  have  therefore  analyzed  how  a/X  and  Q  of 
the  x-dipole  mode  in  this  structure  changes  as  a  function  of 
the  elongation  parameter  p.  Holes  on  the  x  axis  (including 
the  defect)  are  elongated  in  both  the  ±y  directions  by  p/2 
points,  in  such  away  that  hole-to-hole  distances  are  preserved 
and  the  half-spaces  y>p/2  and  y<—p/2  maintain  the  un¬ 
perturbed  PC  geometry.  The  structure  parameters  are  r/a 
=  0.3,  d/a  =  0.6,  ft  =  3.4,  ftdefect=2.4,  and  five  layers  of 
holes  surround  the  defect.  The  periodicity  a  used  in  these 
calculations  is  equal  to  15  grid  points,  and  the  elongation 
step  A/?=l  corresponds  to  a/ 15.  The  results  are  shown  in 
Fig.  4.  It  is  interesting  that  the  frequency  of  the  mode  de¬ 
creases  as  p  increases,  even  though  the  amount  of  the  low 
refractive  index  material  increases.  However,  the  net  amount 
of  low  refractive  index  material  does  not  matter;  what  does 
matter  is  where  the  low  refractive  index  is  positioned  relative 
to  the  unperturbed  PC.  The  explanation  of  the  decrease  in 
frequency  is  very  simple  if  we  recall  the  x-dipole  mode  pat¬ 
tern  shown  in  Fig.  2.  This  is  a  donor  type  defect  mode,  which 
concentrates  its  electric  field  energy  density  in  low  refractive 
index  regions  of  the  unperturbed  PC.  As  p  increases,  layers 
of  PC  holes  are  moved  away  from  the  defect  in  the  y  direc¬ 
tion.  For  example,  the  ftth  layer  of  holes  parallel  to  the  x  axis 
will  be  positioned  at  y=  ±na^3/2±p/2,  instead  of  y  = 
±  ft  ft  V3/2.  Therefore  a  large  refractive  index  material  will  be 
positioned  at  places  where  the  mode  expects  to  “see”  air, 
leading  to  a  decrease  in  the  mode’s  frequency.  By  tuning  the 
mode’s  frequency  across  the  band  gap,  we  can  also  tune  its  Q 
factor,  as  noted  previously  [6].  For  p  =  3,  QL  reaches  the 
value  of  20000.  The  analyzed  x-dipole  mode  mostly  reso¬ 
nates  in  the  direction  of  the  y  axis,  i.e.,  in  the  T  X  direction 
of  the  photonic  crystal.  The  tuning  of  the  elongation  param¬ 
eter  p  is,  therefore,  analogous  to  tuning  of  a  spacer  in  the 
micropost  cavity,  which  leads  to  tuning  of  the  mode’s  fre¬ 
quency  and  Q  factor.  A  more  detailed  explanation  of  the 
effect  of  elongations  on  Q  factors  is  given  in  The  Appendix. 


(a) 


(b) 


FIG.  4.  (a)  Q  factors  and  (b)  frequencies  of  x-dipole  modes  in 
the  structure  shown  in  Fig.  3  as  a  function  of  the  elongation  param¬ 
eter  p.  The  structure  parameters  are  r/a  =  0.3,  dla  =  0.6,  ft  =  3.4, 
ft  defect =  2-4,  a=  15,  and  five  layers  of  holes  surround  the  defect. 

By  increasing  the  number  of  PC  periods  around  the  defect, 
the  total  quality  factor  Q  approaches  Q±  ,  as  shown  in  Fig.  5. 
It  is  important  to  note  that  Q\\  does  not  increase  exponen¬ 
tially  with  the  number  of  PC  layers  around  the  defect,  as 
noted  previously  in  Ref.  [4].  Instead,  it  saturates  at  a  large 
number  of  PC  layers.  The  reason  is  in  the  choice  of  a  bound¬ 
ary  for  separation  of  Q±  from  Q||>  positioned  at  approxi¬ 
mately  X/2  from  the  surface  of  the  membrane.  From  the  ra¬ 
diation  pattern  of  the  x-dipole  mode  we  can  see  that  some 
portion  of  the  out-of-plane  loss  mostly  in  the  x  direction  still 
gets  collected  in  Q\\ .  This  loss  cannot  be  suppressed  by  in¬ 
creasing  the  number  of  PC  layers  around  the  defect,  and  it 
determines  a  value  at  which  <2||  saturates.  However,  much 
larger  out-of-plane  loss  is  collected  in  QL  ,  which  ultimately 
determines  the  total  quality  factor  Q. 

Therefore,  the  dramatic  improvement  in  Q  factors  of  di¬ 
pole  modes  over  single  defect  microcavities  can  be  obtained 
by  introducing  a  novel  type  of  PC  lattice  defect,  consisting  of 
the  elongation  of  holes  along  the  symmetry  axes.  We  call  it 
fractional  edge  dislocation,  by  analogy  with  edge  disloca¬ 
tions  in  solid  state  physics.  Edge  dislocations  are  formed  by 
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FIG.  5.  Q  factor  for  p  =  3,  in  the  structure  shown  in  Fig.  3,  as  a 
function  of  the  number  of  PC  periods  around  the  defect.  The  struc¬ 
ture  parameters  are  r/a  =  0.3,  d/a  =  0.6,  n  =  3A,  ndefect=2.4,  and 
a=15. 

introducing  extra  atomic  planes  into  the  crystal  lattice.  On 
the  other  hand,  we  here  insert  only  fractions  of  atomic  planes 
along  the  symmetry  axes  of  a  photonic  crystal. 

Unfortunately,  at  this  time,  we  do  not  know  how  to  con¬ 
trol  the  refractive  index  of  a  single  PC  hole  during  the  fab¬ 
rication.  For  that  reason  in  the  next  section  we  will  consider 
alternative  methods  of  forming  single  defect  microcavities, 
which  are  much  easier  to  construct  by  microfabrication. 

Even  after  the  elongation  of  holes  on  the  v  axis  by  1  point, 
Q  factors  of  30  000  were  not  obtained.  This  means  that  the 
application  of  mirror  BC’s  at  v  =  0  and  y  =  0  planes  causes 
additional  effects  that  lead  to  the  overestimation  of  quality 
factors.  One  of  the  reasons  may  be  that  the  excited  dipole 
mode  does  not  have  a  symmetry  described  by  the  applied 
discretized  mirror  BC’s.  This  may  be  partly  due  to  the  struc¬ 
ture  imperfection  caused  by  discretization.  In  order  to  avoid 
problems  caused  by  BC’s  all  the  calculations  in  this  paper 
were  done  by  applying  the  even  mirror  symmetry  to  the  z 
=  0  plane  only,  in  order  to  select  TE-like  modes  and  to  re¬ 
duce  the  computation  size  by  one-half.  Absorbing  boundary 
conditions  are  applied  to  all  boundaries  in  the  v  and  y  direc¬ 
tions  and  to  the  upper  boundary  in  the  z  direction.  To  prove 
that  the  application  of  mirror  BC’s  at  the  lower  z  boundary 
does  not  change  Q ,  we  also  analyzed  entire  structures  with 
absorbing  BC’s  applied  to  all  boundaries  and  obtained  the 
same  results  as  in  the  analysis  of  one-half  of  the  structure. 

B.  Reducing  the  radius  of  a  single  hole 

Microcavities  analyzed  in  this  section  are  formed  by  re¬ 
ducing  the  radius  of  a  single  hole  to  rdef.  We  calculate  pa¬ 
rameters  of  excited  dipole  modes  for  a  range  of  microcavity 
parameters  and  the  results  are  shown  in  Table  I.  For  all  tabu¬ 
lated  results,  five  layers  of  holes  surround  the  defect  and  a 
=  20.  Calculated  <2’s  are  not  very  impressive,  but  they  do 
provide  us  with  a  good  starting  point  for  further  optimiza- 


TABLE  I.  Q  factors  of  dipole  modes  excited  in  microcavities 
formed  by  decreasing  the  radius  of  a  single  PC  hole. 


r/a 

rdef/a 

d/a 

a/X 

fin 

Gi 

0.275 

0.15 

0.75 

0.286 

778 

920 

0.275 

0.2 

0.75 

0.297 

470 

2078 

0.25 

0.15 

0.75 

0.277 

230 

1840 

0.25 

0.2 

0.75 

0.284 

116 

3190 

tion.  Why  did  we  decide  to  use  a  relatively  small  r/a  ratio 
for  the  PC?  According  to  our  calculations,  increasing  r/a 
within  the  analyzed  range  leads  to  increases  in  the  band  gap 
and  reduction  of  lateral  losses.  However,  vertical  scattering 
at  the  edges  of  holes  also  increases  and  QL  drops.  It  is  there¬ 
fore  important  to  find  an  optimum  r/a  which  leads  to  small 
vertical  losses  but  preserves  good  lateral  confinement,  in  or¬ 
der  not  to  increase  the  mode  volume  too  much. 

The  band  diagram  for  TE-like  modes  of  a  thin  slab  ( n 
=  3.4,  d/<2  =  0.75)  surrounded  by  air  on  both  sides  and  pat¬ 
terned  with  a  hexagonal  array  of  air  holes  (r/a  =  0.275)  is 
shown  in  Fig.  6.  These  PC  parameters  are  used  in  most  of  the 
calculations  in  the  next  section.  From  the  comparison  of  the 
dipole  mode  frequencies  tabulated  in  Table  I  and  the  band 
diagram  shown  in  Fig.  6,  it  can  be  confirmed  that  as  the 
cavity  mode’s  frequency  approaches  the  bottom  of  the  air 
band,  vertical  losses  decrease,  but  lateral  losses  increase, 
which  in  turn  leads  to  an  increase  in  the  mode  volume.  Our 
goal  in  the  next  section  will  be  to  reduce  vertical  losses  and 
improve  Q  factors  even  further,  while  preserving  small  mode 
volumes.  For  this  purpose,  we  will  explore  fractional  edge 
dislocations.  In  Fig.  4,  one  can  observe  that  an  increase  in 
the  elongation  parameter  p  can  be  used  to  tune  the  QL  factor 
of  a  mode,  but  also  leads  to  a  decrease  in  the  dipole  mode’s 
frequency.  This  implies  that  by  increasing  p,  the  mode  is 
pulled  deeper  into  the  band  gap,  away  from  the  air  band 
edge,  which  leads  to  its  better  latteral  confinement.  There¬ 
fore,  we  can  simultaneously  achieve  a  reduction  in  vertical 
losses  and  an  improvement  in  lateral  confinement  (i.e.,  an 
increase  in  both  Q  L  and  g||,  and  a  reduction  in  the  mode 
volume). 

III.  CAVITIES  FOR  STRONG  COUPLING 

In  this  section  we  consider  the  design  of  PC  microcavities 
to  achieve  strong  coupling  between  the  cavity  field  and  a 
single  gas-phase  atom,  that  is,  an  atom  located  in  free  space 
rather  than  contained  as  an  impurity  in  the  dielectric  slab. 
Our  long-term  goal  is  to  investigate  photonic  band-gap  struc¬ 
tures  for  single-atom  cavity  quantum  electrodynamics  in  the 
strong  coupling  regime  [3].  For  this  purpose  the  microcavity 
mode  quality  factor  ( Q )  has  to  be  as  large  as  possible  and  the 
mode  volume  (Vmode)  as  small  as  possible.  These  two  design 
rules  are  also  followed  when  designing  PC  microcavities  for 
semiconductor  lasers.  However,  in  a  cavity  for  strong  cou¬ 
pling,  an  atom  must  be  trapped  at  the  point  where  it  interacts 
most  strongly  with  the  cavity  field.  Therefore  an  additional 
design  goal  is  imposed  in  this  case:  the  cavity  mode  should 
have  the  E’-field  intensity  as  high  as  possible  in  the  air  re- 
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FIG.  6.  Band  diagram  for  TE-like  modes  of  a 
thin  slab  (A  =  3.4,  d/a  =  0.75)  surrounded  by  air 
on  both  sides  and  patterned  with  a  hexagonal  ar¬ 
ray  of  air  holes  (r/a  =  0.275). 


gion.  When  designing  a  laser  cavity,  the  problem  is  the  op¬ 
posite:  one  tends  to  maximize  the  overlap  between  the  gain 
region  and  the  cavity  field  and,  therefore,  wants  to  have  the 
strongest  E-field  in  the  semiconductor  region. 

Mode  volume  (Vmode)’  critical  atom  (N0)  and  photon 
(m0)  numbers  are  defined  as  follows: 


f  f  f  e(r)\E\2dV 


max[e(r)|£  2] 

(1) 

2k?± 

,  > 
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(2) 

1  Tl)2 
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where  k  is  the  cavity  field  decay  rate,  proportional  to  the 
ratio  of  the  angular  frequency  of  the  mode  (oj0)  and  the 
mode  quality  factor  ( Q ): 


_  cX2 
y°  8777/ 


(7) 


Strong  coupling  is  possible  if  both  N0  and  m0  are  smaller 
than  1 .  Therefore,  in  order  to  predict  whether  the  strong  cou¬ 
pling  can  occur,  we  must  calculate  upper  limits  of  N0  and  ra0 
and  compare  them  to  1.  In  other  words,  it  is  acceptable  if 
calculated  critical  numbers  are  overestimated.  As  the  number 
of  PC  layers  around  the  defect  increases,  the  total  quality 
factor  Q  approaches  Q±  and  Vmode  drops  due  to  the  better 
lateral  confinement.  Hence  we  can  calculate  N0  and  m0  by 
assuming  Q  =  Q±  and  using  Vmode  calculated  for  five  PC 
layers  around  the  defect. 

For  all  calculations  in  this  section,  the  refractive  index  of 
the  slab  is  ft  =  3.4,  five  layers  of  holes  surround  the  central 
hole,  and  a  =  20.  The  elongation  step  Ap  =  1  corresponds  to 
a/ 20,  i.e.,  5%  of  the  lattice  periodicity  a.  The  material  and 
PC  properties  are  chosen  in  such  a  way  that  cavities  operate 
at  \  =  852  nm  (the  wavelength  corresponding  to  the  D2 
atomic  transition  in  133Cs). 


<W) 
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yL  is  the  atomic  dipole  decay  rate  (2.6  MHz  for  cesium)  and 
g  is  the  coupling  parameter  at  the  point  where  we  want  to  put 
an  atom: 
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g 0  denotes  the  vacuum  Rabi  frequency: 


8  o=7± 


(5) 


(6) 


A.  Single  defect  with  fractional  edge  dislocations 

Let  us  study  microcavities  formed  by  reducing  the  radius 
of  a  single  hole  and  simultaneously  applying  the  fractional 
edge  dislocation  (of  order  p)  along  the  v  axis.  We  will  cal¬ 
culate  the  dependence  of  the  v-dipole  mode  properties  on 
parameter  p.  Parameters  of  the  unperturbed  PC  are  rla 
=  0.275,  d/ft  =  0.75,  ft  =  20,  and  the  defect  hole  radius  is 
rdef/<2  =  0.2.  The  critical  atom  (N0)  and  photon  (m0)  num¬ 
bers  are  shown  as  a  function  of  the  elongation  parameter  p 
for  the  v-dipole  mode  in  Fig.  7.  Vmode  did  not  change  signifi¬ 
cantly  with  p  and  it  was  approximately  equal  to  0.1(\/2)3  for 
all  structures.  When  p  increases,  the  frequency  of  the  mode 
moves  away  from  the  band  edge,  towards  the  center  of  the 
band  gap,  g||  increases,  and  Q±  peaks  at  the  value  of  1 
X  104  for  p  =  2. 
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FIG.  7.  Parameters  of  the  x-dipole  mode  in  the  single  defect 
structure  (rla  =  0.275,  d/a  =  0.75,  rdef/a  =  0.2,  n  =  3. 4,  and  a 
=  20)  as  a  function  of  the  elongation  parameter  p :  (a)  m0  and  (b) 

N0- 

From  the  electric  field  intensity  pattern  of  the  v-dipole 
mode  shown  in  Fig.  8,  one  can  see  that  the  electric  field 
intensity  is  very  strong  within  the  defect  hole.  Therefore,  an 
atom  trapped  there  should  interact  very  strongly  with  the 
cavity  field.  From  the  calculated  critical  atom  and  photon 
numbers,  it  then  should  be  possible  to  achieve  very  strong 
coupling.  At  X  =  852  nm  the  parameters  of  such  a  cavity  are 
r  =  70  nm,  d  =  1 90  nm,  a  =  250,  and  rdej=  50  nm.  Due  to 
extremely  small  mode  volumes  in  these  cavities,  strong  cou¬ 
pling  is  possible  even  for  moderate  values  of  Q ,  as  Q±  did 
not  exceed  ~  1  X  104  in  the  parameter  range  of  Fig.  7.  Fur¬ 
thermore,  m0  is  much  smaller  than  N0,  which  means  that  we 
can  try  to  improve  Q  factors  further  at  the  expense  of  in¬ 
creasing  Vmode .  Q  factors  above  1  X  104  and  similar  values 
of  m0  and  N0  can  also  be  obtained  for  the  cavity  consisting 
of  a  single  defect  with  rdef/a  =  0.2  and  a  fractional  edge 
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FIG.  8.  Electric  field  intensity  pattern  of  the  x-dipole  mode  ex¬ 
cited  in  the  microcavity  formed  by  reducing  the  radius  of  a  single 
hole  and  simultaneously  elongating  holes  on  the  x  axis  by  two 
points,  (a)  Slice  through  the  middle  of  the  membrane,  in  the  x-y 
plane;  (b)  x-z  half-plane  (z>0). 

dislocation  of  order  p ,  produced  in  the  photonic  crystal 
whose  parameters  are  r/a  =  03 ,  d/a  =  0.65,  and  n  =  3A. 

B.  Tuning  holes  around  the  defect 

Dipole  modes  are  particularly  sensitive  to  the  geometry  of 
holes  closest  to  the  defect.  By  tuning  these  holes  we  can 
induce  frequency  splitting  of  dipole  modes  and  dramatically 
influence  their  Q  factors.  In  Ref.  [4],  the  variation  of  two 
nearest  neighbor  holes  along  the  v  axis  was  analyzed.  Here 
we  will  test  the  influence  of  changing  four  holes  closest  to 
the  defect  in  the  T  J  directions.  The  analyzed  structure  is 
shown  in  Fig.  9.  The  radius  of  the  central  hole  is  reduced  to 
r2  and  the  radii  of  the  four  closest  holes  in  the  T  J  directions 
are  reduced  to  r, .  These  four  holes  are  simultaneously 
moved  away  from  defect,  by  r~rx  in  the  T  J  directions, 
which  preserves  the  distance  between  them  and  the  next 
nearest  neighbors  in  the  same  directions.  This  design  will 
improve  the  Q  factor  of  the  y-dipole  mode  and  spoil  the  Q  of 
the  v-dipole  mode.  We  analyzed  structures  with  various  pa¬ 
rameters,  but  our  best  result  was  obtained  for  rla  =  0.275, 
d/a  =  0.75,  r2 / a  =  0.2,  rx/a  =  0.225,  and  a  =  20.  The  electric 
field  intensity  pattern  of  the  excited  y-dipole  mode  is  shown 
in  Fig.  10,  and  its  calculated  parameters  are  a/X  =  0.289  and 
<2±  =  4890.  From  Table  I  we  can  see  that  a  dipole  mode 
excited  in  a  single  defect  microcavity  with  this  r2/a ,  r/a , 
d/a  had  £9±  =  2078.  Therefore  the  tuning  of  four  holes  can 
lead  to  a  substantial  increase  in  Q  of  the  v  dipole  mode.  The 
disadvantages  of  this  design  include  the  excitation  of  defect 
modes  other  than  dipoles  (coming  from  variation  of  several 
holes). 
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FIG.  9.  Tuning  four  holes  closest  to  the  defect  in  the  T  J  direc¬ 
tions.  Their  radii  are  reduced  to  rl  and  they  are  simultaneously 
moved  away  from  the  defect  in  the  T  J  directions  by  r—rx.  The 
radius  of  the  central  hole  is  r2. 

Let  us  try  to  improve  Q  of  this  cavity  even  further  by  also 
employing  the  idea  of  elongation  of  holes  along  desired  di¬ 
rections.  Our  mode  of  choice  is  the  y  dipole  and  we  will 
elongate  holes  sitting  on  the  y  axis  by  p  points  in  the  x 
direction  in  such  a  way  that  the  half-spaces  x>p/2  and  x 
<—p/2  remain  the  unperturbed  PC  geometry.  The  depen¬ 
dence  of  N0  (which  decreases  with  Q)  on  parameter  p  is 
shown  in  Fig.  11.  Vmode  did  not  change  significantly  with  p 
and  was  in  the  range  between  0.09(\/2)3  and  0.12(\/2)3. 
The  calculated  ra0  was  around  5X10“ 8  for  all  structures. 
Again,  a  very  strong  coupling  is  achievable  by  this  design, 
and  we  note  that  Q±  at  the  point  p  =  2  achieves  a  value  of 
~3.3X104. 
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FIG.  10.  Electric  field  intensity  pattern  of  the  y-dipole  mode 
excited  in  the  cavity  where  four  holes  closest  to  the  defect  in  the 
T  J  directions  are  tuned.  Their  radii  are  reduced  to  r1/a  =  0.225  and 
they  are  simultaneously  moved  away  from  the  defect  in  the  T  J 
directions  by  r—  rl.  The  radius  of  the  central  hole  is  r2/a  =  0.2.  PC 
parameters  are  r/a  =  0.275,  d/a  =  0.75,  a  =  20,  and  n  =  3. 4. 


FIG.  11.  N0  of  the  v-dipole  mode  in  the  structure  where  four 
holes  in  the  T  J  directions  are  tuned  (as  shown  in  Figs.  9  and  10)  as 
a  function  of  the  elongation  parameter  p.  Holes  on  the  y  axis  are 
elongated  by  p  points  in  the  v  direction  in  such  a  way  that  the 
half-spaces  x>p/2  and  x<—p/2  remain  the  unperturbed  PC  ge¬ 
ometry. 

C.  Atomic  physics 

Given  that  microcavities  with  strong  coupling  parameters 
can  be  designed  and  fabricated,  two  further  technical  issues 
must  be  addressed  in  order  to  establish  the  feasibility  of  cav¬ 
ity  QED  with  neutral  atoms  in  PC’s.  First,  we  must  identify 
a  method  for  stably  trapping  an  individual  atom  within  a  hole 
of  the  PC.  Second,  accurate  estimates  or  measurements  must 
be  made  of  the  surface  interaction  between  such  a  trapped 
atom  and  the  semiconductor  substrate.  Although  definitive 
solutions  to  these  challenges  are  the  subject  of  future  work, 
we  include  in  this  section  a  brief  discussion  of  each  topic. 

In  addition  to  the  creation  of  photonic  crystals,  modern 
microfabrication  techniques  enable  the  patterning  of  either 
conductive  wires  or  ferromagnetic  materials  at  the  micron 
scale  and  below.  As  a  result,  it  should  be  possible  to  con¬ 
struct  magnetic  microtraps  with  field  curvatures 
—  108  G/cm2  [7-10].  With  this  magnitude  of  field  curvature 
an  Ioffe  trap  could  hold  a  Cs  atom  with  Lamb-Dicke  param¬ 
eter  77-0.035  in  the  radial  direction,  yielding  Ax  —  10  nm  in 
the  ground  state  of  the  trapping  potential  and  enabling 
resolved- sideband  laser  cooling  as  a  means  of  putting  single 
atoms  in  the  ground  state. 

We  are  currently  investigating  fabrication  and  laser  cool¬ 
ing  techniques  for  an  atom-trapping  scheme  in  which 
micron-scale  wires  would  be  deposited  on  the  surface  of  the 
PC  semiconductor  substrate,  such  that  the  circular  wire  pat¬ 
tern  of  an  Ioffe  microtrap  are  arranged  concentrically  around 
a  defect  microcavity.  The  trap  designs  discussed  in  Ref.  [7] 
have  a  sufficiently  large  inner  diameter  not  to  disturb  the 
photonic  band-gap  structure  of  the  defect  cavity.  Such  a  wire 
arrangement  would  project  a  magnetic  field  with  a  stable 
minimum  at  the  geometric  center  of  the  microcavity,  such 
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that  one  or  more  atoms  could  be  confined  within  the  defect 
hole  and  would  therefore  experience  strong  coupling.  A  simi¬ 
lar  geometry  could  be  envisioned  for  microtraps  based  on 
permanent  magnets  rather  than  current-carrying  wires,  which 
would  have  significant  advantages  in  terms  of  heat  load  to 
the  PC  substrate. 

A  neutral  atom  trapped  within  a  hole  of  a  PC  structure 
will  experience  surface  (van  der  Waals)  interactions  that  are 
quite  difficult  to  estimate.  The  significance  of  such  interac¬ 
tions  is  twofold.  The  sensitive  dependence  of  the  ground 
state  energy  shift  on  an  atom’s  position  will  lead  to  mechani¬ 
cal  forces  that  must  be  compensated  by  the  trap  design.  Dif¬ 
ferential  shifting  of  the  atomic  ground  and  excited  states  on 
the  cavity  QED  transition  must  also  be  accounted  for,  as  this 
will  introduce  a  position-dependent  detuning  relative  to  the 
fixed  microcavity  resonance.  Although  van  der  Waals  shifts 
can  be  computed  for  alkali  atoms  near  dielectric  or  metallic 
boundaries  with  simple  symmetry  [11,12],  the  case  of  an 
atom  in  a  PC  hole  is  far  more  complex.  The  local  geometry 
seen  by  a  trapped  atom  will  be  that  of  a  cylindrical  hole  with 
finite  extent,  and  a  proper  calculation  must  take  into  account 
the  overall  modification  of  vacuum  modes  due  to  the  ex¬ 
tended  photonic  crystal.  Furthermore,  the  resonant  frequen¬ 
cies  of  many  atomic  transitions  that  connect  to  low-lying 
states,  and  therefore  contribute  strongly  to  their  van  der 
Waals  shifts,  are  above  the  band  gap  of  the  semiconductor 
substrate.  Such  transitions  will  “see”  an  absorptive  surface 
while  those  below  the  band  gap  will  see  a  dielectric  surface. 
This  set  of  factors  brings  the  complexity  of  the  desired  cal¬ 
culation  well  beyond  that  of  existing  analytic  results  in  the 
literature.  It  should  be  noted  that  experimental  measurements 
of  the  van  der  Waals  shifts  in  our  proposed  system  would  be 
of  significant  interest  for  the  general  subject  of  quantum 
electrodynamics  of  semiconductors. 

We  are  pursuing  a  numerical  strategy  for  estimating  the 
magnitude  of  surface  interactions.  In  a  linear  response  ap¬ 
proximation,  it  should  be  possible  to  compute  leading-order 
contributions  to  the  van  der  Waals  shifts  from  FDTD  simu¬ 
lations  of  the  electromagnetic  field  created  by  an  oscillating 
dipole  source  in  the  photonic  crystal  structure.  While  the 
nature  of  the  code  does  not  allow  us  to  compute  directly  the 
backaction  of  the  scattered  field  on  the  source  dipole,  we 
believe  that  elementary  field  theory  can  be  used  to  relate  the 
simulated  field  to  van  der  Waals  shifts.  Our  findings  will  be 
reported  in  a  forthcoming  publication. 

D.  Coupled  dipole  defect  modes 

The  significance  of  surface  effects  that  could  perturb 
atomic  radiative  structure  within  the  small  defect  hole  is  still 
unknown.  For  that  reason  we  will  try  to  investigate  ways  of 
increasing  the  radius  of  the  hole  where  the  coupling  between 
the  atom  and  the  cavity  field  should  occur.  Let  us  now  ana¬ 
lyze  the  cavity  design  where  the  strong  .E-field  intensity  can 
be  achieved  in  the  center  of  an  unperturbed  hole.  The  idea  is 
to  use  coupling  of  two  dipole  defect  states. 

Resonant  modes  of  the  microcavity  formed  by  coupling 
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FIG.  12.  Electric  field  intensity  patterns  of  the  coupled  dipole 
modes:  (a)  constructively  and  (b)  destructively  coupled  defect 
states. 

two  single  defects  are  presented  in  Fig.  12.  Based  on  the 
resultant  electric  field  intensity  in  the  central,  unperturbed 
hole,  we  call  them  constructively  or  destructively  coupled 
defect  states.  They  have  different  frequencies  as  well  as  Q 
factors.  We  will  analyze  properties  of  the  constructively 
coupled  state,  since  the  central,  unperturbed  hole  would  be  a 
good  place  for  an  atom. 

We  analyzed  a  series  of  structures  with  different  param¬ 
eters.  The  best  results  were  obtained  for  two  coupled  defects 
with  rdQf/a  =  0.2  in  a  PC  with  the  following  parameters: 
d/ a  =  0.15,  rla  =  0.275,  ft  =  3.4,  and  ft  =  20.  Holes  in  the 
T  X  direction,  in  columns  containing  defects,  are  elongated 
by  two  points  in  the  v  direction.  The  mode  pattern  of  the 
constructively  coupled  defect  state  is  shown  in  Fig.  13.  Pa¬ 
rameters  of  the  mode  are  a/X  =  0.29,  g±  =  6100,  Vmode 
=  0.19(X/2)3,  m0=  1.5X10"7,  and  A0  =  0.0135.  An  atom 
can  now  be  trapped  in  the  central  hole  of  an  unperturbed 
radius.  For  X  =  852  nm,  this  radius  is  r  =  68  nm,  which  is  a 
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FIG.  13.  Electric  field  intensity  patterns  of  the  constructively 
coupled  dipole  modes  in  the  structure  with  the  following  param¬ 
eters:  rdefla  =  0.2,  d/a  =  0.75,  r/a  =  0.275,  n  =  3A,  and  a  =  20. 
Holes  in  columns  containing  defects  are  elongated  by  two  points  in 
the  v  direction. 


significant  improvement  over  the  previous  design,  where  an 
atom  must  be  trapped  within  a  50  nm  radius  hole.  Again,  a 
strong  coupling  is  achievable  in  this  cavity. 

An  alternative  way  of  forming  the  coupled  defects  state  is 
represented  in  Fig.  14.  We  used  the  same  PC  parameters  as 
previously:  rdQf/a  =  0.2,  d/a  =  0.75,  r/a  =  0.275,  a  =  3.4, 
and  a  =  20.  Holes  in  rows  containing  defects  are  elongated 
by  two  points  in  the  y  direction.  The  mode  pattern  of  the 
constructively  coupled  defect  state  is  shown  in  Fig.  14.  Pa¬ 
rameters  of  the  mode  are  a/X  =  0.288,  g±  =  12120,  Vmode 
=  0.14(\/2)3,  m0=1.4XlO-7,  and  N0  =  0.0063.  Strong  cou- 
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FIG.  15.  SEM  pictures  showing  the  top  views  of  the  fabricated 
structures. 


pling  is  achievable  for  an  atom  trapped  in  any  of  the  two 
central  holes  of  the  unperturbed  radius  (positioned  between 
the  defects).  For  X  =  852  nm  this  radius  is  again  r  =  68  nm. 


IV.  FABRICATION 


FIG.  14.  Electric  field  intensity  patterns  of  the  constructively 
coupled  dipole  modes  in  the  structure  with  the  following  param¬ 
eters:  rdef/a  =  0.2,  d/a  =  0.75,  r/a  =  0.275,  n  =  3A,  and  a  =  20. 
Holes  in  rows  containing  defects  are  elongated  by  two  points  in  the 
y  direction. 


We  have  recently  developed  the  fabrication  procedure  for 
making  these  cavities  in  Al^Ga^^As.  The  material  and  PC 
properties  are  chosen  in  such  a  way  that  cavities  operate  at 
X  =  852  nm  (the  wavelength  corresponding  to  the  atomic  D2 
transition  in  133Cs). 
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The  fabrication  process  starts  by  the  spinning  of  100  nm 
thick  high  molecular  weight  PMMA  (polymethylmethacry¬ 
late)  on  top  of  the  wafer.  The  PMMA  layer  is  subsequently 
baked  on  a  hot  plate  at  150  °C  for  20  min.  A  desired  2D  PC 
pattern  is  beamwritten  on  the  PMMA  by  electron  beam  li¬ 
thography  in  a  Hitachi  S-4500  electron  microscope.  The  ex¬ 
posed  PMMA  is  developed  in  a  3:7  solution  of 
2-ethoxy  ethanol:  methanol  for  30  s.  The  pattern  is  then  trans¬ 
ferred  into  the  Al^Gai  _xAs  layer  using  the  Cl2  assisted  ion 
beam  etching.  After  that,  the  sacrificial  AlAs  layer  is  dis¬ 
solved  in  hydrofluoric  acid  (HF)  diluted  in  water.  HF  attacks 
AlAs  very  selectively  over  Al^Ga^^As  for  v<0.4  [13]. 
Therefore  the  percentage  of  A1  in  our  Al^Ga^^As  layer  is 
around  30%.  Finally,  the  remaining  PMMA  may  be  dis¬ 
solved  in  acetone. 

Three  scanning  electron  microscopy  (SEM)  pictures 
showing  top  views  of  fabricated  microcavity  structures  are 
shown  in  Fig.  15.  We  are  currently  working  to  measure  the 
passive  optical  properties  of  such  microcavities  in  order  to 
validate  our  theoretical  predictions. 


V.  CONCLUSION 


FIG.  16.  Q  factor  computed  using  the  FDTD  method  and  from 
the  expression  (Al),  for  the  structure  analyzed  in  Sec.  II A  2. 
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In  conclusion,  we  have  theoretically  demonstrated  that  PC 
cavities  can  be  designed  for  strong  interaction  with  atoms 
trapped  in  one  of  the  PC  holes.  At  present  we  are  working  on 
further  optimization  of  the  design  and  the  characterization  of 
fabricated  structures. 

Critical  issues  for  further  investigation  include  efficient 
coupling  of  light  in  and  out  of  the  PC  microcavity,  as  well  as 
accurate  estimation  of  surface  effects  that  could  perturb 
atomic  radiative  structure  within  the  small  defect  hole.  The 
extremely  small  mode  volume  in  these  structures  also  poses 
an  interesting  theoretical  question  of  how  standard  cavity 
QED  models  must  be  modified  when  the  single-photon  Rabi 
frequency  exceeds  the  atomic  hyperfine  spacing. 
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APPENDIX:  THE  EFFECT  OF  FRACTIONAL  EDGE 
DISLOCATIONS 

Any  wavefront  can  be  considered  as  a  source  of  second¬ 
ary  waves  that  combine  to  produce  distant  wavefronts,  ac¬ 
cording  to  the  Huygens  principle.  Let  us  assume  that  we 
know  the  field  distribution  across  the  plane  S,  positioned  in 
the  near  field,  above  the  free-standing  membrane  and  parallel 
to  the  membrane  surface.  The  far  fields  can  be  considered  as 
arising  from  the  equivalent  current  sheets  on  this  plane. 
Therefore,  we  can  calculate  the  far  field  distribution  and  the 
total  averaged  radiated  power  into  the  half- space  above  the 
plane  S  [14]: 


L<p 

2 

Le 

N°+i 

+ 

N* - 

V 

where  N  and  L  represent  radiation  vectors,  whose  compo¬ 
nents  in  the  rectangular  coordinate  system  are  proportional  to 
Fourier  transforms  of  tangential  field  components  at  the 
plane  S  [15]: 


Nx=-FT2(Hy)\lt], 

(A2) 

Ny  =  FT2(Hx)  |*v 

(A3) 

L=FT2{Ey)\-h, 

(A4) 

Ly=-FT2(EX)  |*t|. 

(A5) 

-  277 

k ||  =  — —  sin  0(x  cos  cf)  +  y  sin  <fi) , 

A 

(A6) 

FT2(f(x,y))=  f  f  dxdyf(x,y)ei^x+kyy\ 

(A7) 
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Therefore,  just  by  knowing  the  Fourier  transforms  of  the 
tangential  field  components  at  the  plane  S ,  we  can  evaluate 
the  time-averaged  radiated  power  P.  From  the  previous  ex¬ 
pressions  it  is  clear  that  the  wave  vector  of  interest  k\\  lies 
within  the  light  cone  for  any  values  of  angles  6  and  in  the 
circular  polar  coordinate  system  (i.e.,  |^J^27t/X).  This  im¬ 
plies  that  the  radiated  power  P  depends  only  on  the  wave- 
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vector  components  located  within  the  light  cone.  It  is  also 
clear  that  by  suppressing  the  Fourier  components  within  the 
light  cone,  one  can  reduce  P.  Having  in  mind  that  for  the 
v-dipole  mode  the  Ey  and  Hx  fields  are  odd  with  respect  to 
both  v  and  y  symmetry  axes,  Ny  and  Lx  do  not  contribute 
significantly  to  the  integral  in  expression  (Al)  (they  are  both 
equal  to  zero  at  any  point  in  the  k  space  with  either  kx  or  ky 
equal  to  zero).  On  the  other  hand,  Ex  and  Hy  field  compo¬ 
nents  are  even  with  respect  to  both  x  and  y  axes  and  their 
Fourier  transforms  are  generally  nonzero  at  small  wave- 
vector  values.  However,  by  tuning  the  elongation  factor  p, 
one  can  balance  the  energy  in  the  positive  and  negative  field 
lobes  and  minimize  the  Fourier  components  of  Ex  and  Hy 
within  the  light  cone.  This  also  leads  to  a  decrease  in  the 
radiated  power  P.  We  can  conclude  that  the  improvement  in 
the  Q  factor  after  the  application  of  fractional  edge  disloca¬ 


tions  is  due  to  the  suppression  of  the  wave-vector  compo¬ 
nents  composing  the  defect  mode,  which  are  located  within 
the  light  cone.  A  more  detailed  explanation  of  this  phenom¬ 
enon  and  how  it  can  be  used  to  improve  Q  factors  of  other 
types  of  modes  will  be  presented  in  our  forthcoming  publi¬ 
cations  [15]. 

The  Q  factor  of  a  mode  can  be  expressed  as  Q 
=  o)(W/P ),  where  W  is  the  total  energy  of  a  mode  in  the 
half-space  z^O.  The  comparison  between  the  Q  factor  cal¬ 
culated  using  the  method  presented  in  this  appendix,  and  Q± 
previously  estimated  using  the  FDTD,  for  the  structure  from 
the  Sec.  II A  2,  is  shown  in  Fig.  16.  The  plane  S  is  positioned 
directly  above  the  surface  of  the  membrane  in  this  case. 
From  Fig.  5  follows  that  the  total  Q  factor  saturates  at  about 
17  000,  when  the  number  of  PC  layers  around  the  defect 
increases  in  the  structure  with  p  =  3 .  This  is  very  close  to  the 
maximum  Q  value  estimated  from  the  expression  (Al). 
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Optimization  of  the  Q  Factor  in  Photonic 
Crystal  Microcavities 
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Abstract — We  express  the  quality  factor  of  a  mode  in  terms  of  the 
Fourier  transforms  of  its  field  components  and  prove  that  the  re¬ 
duction  in  radiation  loss  can  be  achieved  by  suppressing  the  mode’s 
wavevector  components  within  the  light  cone.  Although  this  is  in¬ 
tuitively  clear,  our  analytical  proof  gives  us  insight  into  how  to 
achieve  the  Q  factor  optimization,  without  the  mode  delocalization. 
We  focus  on  the  dipole  defect  mode  in  free-standing  membranes 
and  achieve  Q  >  104,  while  preserving  the  mode  volume  of  the 
order  of  one  half  of  the  cubic  wavelength  of  light  in  the  material. 
The  derived  expressions  and  conclusions  can  be  used  in  the  opti¬ 
mization  of  the  Q  factor  for  any  type  of  defect  in  planar  photonic 
crystals. 


Index  Terms — FDTD  methods,  Fourier  transforms,  integrated  Fig-  1.  Estimating  the  radiation  field  at  the  observation  point  O  from  the 

optics,  optical  resonators,  optics,  optoelectronic  devices,  Q  factor.  known  near  field  at  the  surface  S. 


I.  Introduction 

NE  OF  the  greatest  challenges  in  photonic  crystal 
research  is  the  construction  of  optical  microcavities  with 
small  mode  volumes  and  large  quality  factors,  for  efficient 
localization  of  light.  Beside  standard  applications  of  these 
structures  (such  as  lasers  or  filters),  they  can  potentially  be 
used  for  cavity  QED  experiments,  or  as  building  blocks  for 
quantum  networks.  Although  three-dimensional  (3-D)  photonic 
crystals  offer  the  opportunity  to  manipulate  light  in  all  three 
dimensions  in  space,  many  research  groups  have  focused  their 
efforts  on  planar  photonic  crystals  (i.e.,  two-dimensional  (2-D) 
photonic  crystals  of  finite  depth)  in  recent  years  [1]— [13]. 
The  fabrication  procedures  of  planar  photonic  crystals  are 
much  simpler  than  those  of  their  3-D  counterparts,  but  their 
light  confinement  is  only  “quasi-3D”  and  resulting  from  the 
combined  action  of  the  2-D  photonic  crystal  and  internal 
reflection.  The  imperfect  confinement  in  the  third  dimension 
produces  some  unwanted  out-of-plane  loss  (radiation  loss), 
which  is  usually  a  limiting  factor  in  the  performance  of  these 
structures.  The  problem  of  the  Q  factor  optimization  in  planar 
photonic  crystal  microcavities  has  been  addressed  recently  by 
several  groups  [2],  [14]— [17] .  The  cavities  that  we  proposed 
have  a  potential  for  achieving  Q  >  104  together  with  the  mode 
volume  of  the  order  of  one  half  of  the  cubic  wavelength  of  light 
in  the  material  [2],  [16].  We  have  also  recently  demonstrated 
an  experimental  Q  factor  of  2800  in  this  type  of  structure, 
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for  which  the  theoretically  predicted  Q  was  around  4000 
[18].  In  our  earlier  work  [2],  we  have  only  briefly  addressed 
the  mechanism  behind  the  reduction  of  radiation  loss  in  our 
structures:  the  suppression  of  wavevector  components  of  the 
defect  mode  that  are  positioned  within  the  light  cone.  In  this 
paper,  we  discuss  this  phenomenon  in  detail  and  derive  the 
analytical  expression  relating  the  Q  factor  of  a  mode  to  the 
Fourier  transform  of  the  mode  pattern.  We  also  show  how  to 
suppress  the  wavevector  components  within  the  light  cone, 
without  delocalizing  a  mode.  Although  our  study  focuses  only 
on  the  dipole  mode,  the  derived  relations  are  universal  and 
conclusions  can  be  used  in  the  optimization  of  the  Q  factor  for 
any  type  of  mode  and  defect  in  planar  photonic  crystals. 

II.  Relation  Between  the  Q  Factor  and  the 
Fourier  Transform  of  a  Mode 

The  3-D  finite  difference  time-domain  (FDTD)  analysis  can 
provide  us  with  the  near-field  distribution  of  the  analyzed  mi¬ 
crocavity.  FDTD  analysis  of  the  far  field  would  require  large 
amounts  of  computer  memory  and  would  be  computationally 
inefficient.  However,  we  can  compute  the  far  field  starting  from 
the  known  near-field  distribution.  Any  wavefront  can  be  consid¬ 
ered  as  a  source  of  secondary  waves  that  add  to  produce  distant 
wavefronts,  according  to  Huygens  principle.  Fet  us  assume  that 
we  know  the  field  distribution  across  the  surface  5,  positioned 
in  the  near  field  and  above  the  free-standing  membrane,  as  in 
Fig.  1.  Our  goal  is  to  estimate  the  far  field  at  the  observation 
point  O.  The  far  fields  can  be  considered  as  arising  from  the 
equivalent  current  sheets  at  the  surface  S  [19].  For  example,  let 
S  be  the  plane  positioned  at  z  =  0,  parallel  to  the  surface  of  the 
membrane,  and  at  a  small  distance  Az  from  it.  This  choice  of 
surface  S  will  allow  us  to  relate  the  Q  factor  of  a  mode  to  the 
Fourier  transform  of  its  field  pattern.  The  equivalent  sources  in 


0018-91 97/02$  1 7 . 00  ©  2002  IEEE 


VUCKOVIC  et  al :  OPTIMIZATION  OF  THE  Q  FACTOR  IN  MICROCAVITIES 


851 


the  plane  S  can  be  represented  in  terms  of  the  surface  electric 
(J5)  and  magnetic  (Ms)  currents 

Js  —  n  x  H  —  xHy  yHx  (1) 

Ms  =  —n  x  E  =  xEy  —  yEx  (2) 

where  n  is  a  normal  to  the  surface  5,  and  E  and  H  =  B/po 
are  the  electric  and  magnetic  fields,  respectively.  In  a  homoge¬ 
neous,  isotropic  medium  above  5,  a  retarded  potential  A  and 
a  second  retarded  potential  F  can  be  estimated  from  the  previ¬ 
ously  introduced  surface  currents 


^  f  T  e~ikr 

A=II°  jsA^dS  (3) 

_  (  M  e~ikr 

(4) 

where  k  is  defined  as  A:  =  2n/X  =  tu/c  ( A  is  the  mode  wave¬ 
length  measured  in  air)  and  r  is  the  distance  between  the  point 
where  the  potentials  are  evaluated  and  the  surface  element  dS 
(i.e.,  between  the  points  O  and  P). 

From  Fig.  1,  it  follows  that  r  ~  tq  —  rf  cos(Vj).  Let  us  now 
introduce  the  radiation  vectors  L  and  N 


N=  Jseikr'  cos(^)  dS 

Js 

(5) 

L  =  J  Mseikr'cos^  dS. 

(6) 

Then  we  have 

ifcr0  ^ 

A=y  o- - N 

4.7V  Vo 

(7) 

p—ikro  ^ 

F=e  o- - L. 

4.7V  Vo 

(8) 

From  Fig.  1,  we  also  have 

k 

kr!  cos(V^)  =  —  (xx!  +  yy!) 

To 

(9) 

where  ( xl ,  yf ,  0)  are  the  coordinates  of  the  point  P  in 

the  plane 

5,  and  (x,  y ,  z)  are  the  coordinates  of  the  observation  point  O. 

From  (5)  and  (6),  it  follows  that  radiation  vectors  N  and 
L  represent  the  2-D  Fourier  transforms  of  the  surface  cur¬ 
rents  Js  and  Ms,  evaluated  at  the  value  of  the  wavevector 
A: 1 1  =  k(x/r0x  +  y/r0y)  (in  rectangular  coordinates),  i.e., 
k\\=k  sin  9( cos  <j>x  +  sin  </>£)  in  circular  polar  coordinates 


N  =  FT2(Js 


fc||=fc(x/r0j  y/r0) 


FE 


L  =  FT2(Ms) 

(/(^  2/))  =  JJ dxdyf(x,  y)elkr^x 

=  JJ dxdyf(x,  y) 


fc||=fc(x/r0j2//r0) 
y) 


(10) 

(ii) 


(12) 


Fig.  2.  Microcavity  structure  consisting  of  a  single  defect  (produced  by 
reducing  the  radius  of  the  central  hole  to  rde  t  /a  —  0.2  from  r/a  —  0.275) 
and  a  fractional  edge  dislocation  of  order  p  —  4  along  the  x  axis.  The  applied 
discretization  is  20  pixels  per  periodicity  a. 


Components  of  radiation  vectors  can,  therefore,  be  expressed  in 
terms  of  the  Fourier  transforms  of  the  field  components  at  the 
surface  S 


Nx 

=  -FT2{Hy)\h 

(13) 

Ny 

=  FT2{Hx)\h 

(14) 

Fx 

=  FT2{Ey)\h 

(15) 

Ly 

=  -^2(^)1^,, 

(16) 

h 

f  x  y  \ 

=  k  (  — ,  —  )  =  k  sin  0( cos  <bx  +  sin  (by) . 

\ro  r0) 

(17) 

It  is  important  to  note  that,  for  any  observation  point  O,  the 
previously  introduced  wavevector  k\\  lies  within  the  light  cone 
(i.e.,  |A?|||  <  k,  where  k  =  2tv/X).  Therefore,  radiation  vectors 
are  purely  determined  by  Fourier  components  located  within  the 
light  cone. 

Far  fields  can  be  expressed  in  terms  of  retarded  potentials  as 


E  =  -itoA-  %  V(V  -i)--VxF 
kz  e0 

(18) 

H  =  -iwF  -  V(V  ■F)--VxA 
kz  yo 

(19) 

Under  the  assumption  that  all  terms  in  fields  decaying  faster 
than  1  /r0  can  be  neglected,  the  electric  field  components  at  an 
arbitrary  point  O  are 

Ee  -  rjHtj,  -  i  ^  ( rjNe  +  L#) 

(20) 

E4,  =  rjHe  =  i  ^  +  Le) 

(21) 

_  I  To 

V  eo 

(22) 

852 


IEEE  JOURNAL  OF  QUANTUM  ELECTRONICS,  VOL.  38,  NO.  7,  JULY  2002 


E..  p-P 


P=0 


S..P-0 


B^,  p-i 


Fig.  3.  Field  components  of  the  x -dipole  mode  at  the  surface  S  positioned  at  approximately  d/4  from  the  surface  of  the  membrane.  The  analyzed  structure  is 
shown  in  Fig.  2. 


where  (r0,  <j>)  represent  the  coordinates  of  the  point  O  in 

the  spherical  polar  coordinate  system.  The  radiation  intensity 
(power  per  unit  solid  angle)  is  then  equal  to  [19] 


m  «  =  ^ 


Ne  +  — 

2 

+ 

V 

V 

(23) 


and  the  total  averaged  power  radiated  into  the  half- space  z  >  0 
is  given  by 


/*7T / 2 

Pi  =  /  /  de  d(psm(9)K(97  «/>).  (24) 

do  do 

The  radiation  vectors  in  spherical  polar  coordinates  can  be 
expressed  from  their  components  in  rectangular  coordinates 


N#  —  (Nx  cos  (j)  +  Ny  sin  </>)  cos  6  (25) 

N 0  =  — iVx  sin  <j>  +  iVy  cos  </>  (26) 


where  TV^,  A^y,  Lx ,  and  were  previously  given  as  the  2-D 
Fourier  transforms  of  the  appropriate  field  components  tangen¬ 
tial  to  the  surface  S.  Hence,  just  by  knowing  the  Fourier  trans¬ 


forms  of  the  tangential  field  components  at  the  plane  5,  we  can 
evaluate  the  total  averaged  power  radiated  and  the  far- field  dis¬ 
tribution.  Furthermore,  the  radiated  power  depends  only  on  the 
wavevector  components  located  within  the  light  cone.  There¬ 
fore,  the  reduction  in  radiation  loss  and  the  improvement  in  the 
Q  factor  can  be  achieved  by  suppressing  the  Fourier  components 
within  the  light  cone  or  by  redistributing  them  outside  the  light 
cone. 

In  the  case  when  most  of  the  radiated  power  is  collected  at 
vertical  incidence  (i.e.,  at  small  0),  (24)  can  be  simplified  as 
follows: 


P2 


V 

8A  2k2 


V 

8A  2k2 


Nx  +  ^ 
V 


+ 


Nv 


Px 

V 


FT2(Hy )  + 1  FT2(Ex) 


+ 


FT2{Hx) 


-  FT2{Ey) 


(27) 
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The  integral  of  the  cross  terms  in  (27)  gives  approximately  one 
half  of  the  radiated  power.  This  can  be  proved  easily  by  starting 
from  the  expansion  of  fields  in  terms  of  the  Fourier  compo¬ 
nents  and  the  expression  for  the  radiated  power  as  the  integral 
of  the  z  component  of  the  Poynting  vector  T  over  the  surface  S. 
This  leads  to  the  following  expression  for  the  averaged  radiated 
power: 

=  II  dk(\FT2{Hx)\2  +  \FT2{Hy)\2 

|fc,il<fc 

+2  | FT2(Ex)\2  +  I  \FT2(Ey)\2^j  .  (28) 


It  is  important  to  note  that,  if  some  field  component 
u(x,  y)  is  odd  with  respect  to  the  x  coordinate  [i.e., 
u(x,  y)  —  —  u(— x,  y)],  then  its  Fourier  transform  must 

be  equal  to  zero  for  any  point  in  the  Fourier  space  with  kx  —  0. 
Similarly,  any  field  component  which  is  odd  with  respect  to 
the  y  coordinate  has  a  Fourier  transform  which  is  zero  for  any 
point  with  ky  =  0. 

Let  us  introduce  the  radiation  factor  RF  which  is  directly 
proportional  to  the  radiated  power  P 

RFi  =  W’  *  =  F2,  3,  ...  (29) 

where  W  represents  the  total  energy  of  a  mode  in  the  half- space 
above  the  middle  of  the  membrane.  The  radiation  Q  factor  of 
a  mode  (which  is  a  measure  of  the  radiation,  out-of-plane  loss) 
can  be  expressed  as 


n  W 
Q  =  lo  — 


UJ 

RF' 


(30) 


III.  Effect  of  Fractional  Edge  Dislocations 
on  the  Q  Factor  of  the  Dipole  Mode 
in  Free-Standing  Membranes 

We  have  recently  proposed  the  design  and  fabrication  of  op¬ 
tical  microcavities  in  free-standing  membranes  with  Q  >  104 
for  the  dipole  mode,  and  mode  volumes  of  the  order  of  one 
half  of  the  cubic  wavelength  of  light  (measured  in  the  mate¬ 
rial)  [2],  [16].  The  dramatic  improvement  in  Q  factors  over 
single  defect  microcavities  (without  a  significant  increase  in 
the  mode  volume)  was  obtained  by  introducing  a  novel  type  of 
photonic  crystal  lattice  defect,  consisting  of  the  elongation  of 
holes  along  the  symmetry  axes.  We  call  this  type  of  defect  a 
fractional  edge  dislocation ,  by  analogy  with  edge  dislocations 
in  solid  state  physics.  Edge  dislocations  are  formed  by  intro¬ 
ducing  extra  atomic  planes  into  the  crystal  lattice.  On  the  other 
hand,  we  insert  here  only  fractions  of  the  atomic  planes  along 
the  symmetry  axes  of  the  photonic  crystal,  as  shown  in  Fig.  2. 
Hole-to-hole  distances  are  preserved  under  this  deformation, 
and  the  half-spaces  y  >  p/2  and  y  <  —p/2  maintain  the  unper¬ 
turbed  photonic  crystal  geometry. 

We  consider  again  some  of  the  microcavities  that  we  pro¬ 
posed  in  [2].  The  unperturbed  photonic  crystal  parameters  are 
r/a  =  0.275,  d/a  =  0.75,  and  n  =  3.4,  where  r,  a,  d,  and  n 
represent  the  hole  radius,  the  periodicity  of  the  triangular  lattice, 
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Fig.  4.  Fourier  components  of  the  x -dipole  mode  in  the  structure  from  Fig.  2.  A 
fractional  edge  dislocation  is  of  the  order  p  —  0  in  this  case.  The  light  cone  can 
be  represented  as  a  disk  with  the  radius  approximately  equal  to  0.015  located  in 
the  center  of  each  square.  The  horizontal  and  vertical  axes  correspond  to  kx  /2  k 
and  ky/2  w,  respectively. 


the  thickness  of  the  slab,  and  the  refractive  index  of  the  semi¬ 
conductor  material,  respectively.  The  choice  of  photonic  crystal 
parameters  is  discussed  in  more  detail  in  our  previous  work  [2], 
[20].  Briefly,  we  limit  the  r/a  ratio  to  rather  modest  values  of 
around  0.3,  in  order  to  minimize  the  out-of-plane  losses  pro¬ 
duced  by  the  vertical  scattering  at  the  edges  of  holes.  Since  the 
reduction  in  r/a  leads  to  a  decrease  in  the  size  of  the  bandgap,  it 
is  important  not  to  reduce  the  hole  radius  too  much,  in  order  to 
preserve  the  lateral  confinement  and  small  mode  volume  (e.g., 
we  do  not  use  r/a  below  0.275).  The  d/ a  ratio  of  our  structures 
is  usually  between  0.65  and  0.75,  and  we  were  able  to  design  mi¬ 
crocavities  with  very  high  Q  factors  at  both  ends  of  this  range, 
without  a  significant  change  in  the  mode  volume  [2],  [20].  The 
reasons  for  choosing  this  thickness  range  are  the  following:  if 
the  slab  is  too  thin,  the  mode  is  not  confined  well  within  it  ver¬ 
tically,  and  it  interacts  more  strongly  with  the  substrate  (posi¬ 
tioned  at  around  A/ 2  underneath  the  bottom  membrane  surface 
in  our  structures  [2]),  which  reduces  its  Q  factor.  Furthermore, 
the  fabrication  of  thin  suspended  membranes  is  difficult,  and 
these  structures  are  not  robust.  On  the  other  hand,  if  d/a  is 
too  large,  the  structure  is  multimode  in  the  vertical  direction, 
which  is  also  undesirable.  In  the  FDTD  method,  we  apply  the 
discretization  of  20  pixels  per  periodicity  a.  Therefore,  a  frac¬ 
tional  edge  dislocation  of  order  p  =  1  corresponds  to  the  inser¬ 
tion  of  extra  material  whose  thickness  is  equal  to  l/20a.  In  the 
microcavity  of  our  interest,  the  central  hole  radius  is  decreased 
to  Gief  /a  =  0.2  and  a  fractional  edge  dislocation  of  order  p  is 
applied  along  the  x  axis,  as  shown  in  Fig.  2.  The  dipole  mode’s 
frequency  decreases  as  a  function  of  the  elongation  parameter 
p  [2],  [20],  and  it  is  desirable  to  start  in  the  elongation  process 
with  a  mode  whose  frequency  is  close  to  the  edge  of  the  air 
band,  allowing  enough  space  to  achieve  the  optimum  Q±  within 
the  bandgap  when  the  structure  is  tuned.  In  that  case,  the  lateral 
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Fig.  5.  The  2-D  Fourier  transforms  of  the  even  field  components  of  the  x -dipole  mode  in  the  structure  shown  in  Fig.  2,  as  a  function  of  the  elongation  parameter 
p.  The  light  cone  can  be  represented  as  a  disk  inscribed  into  the  square.  Clearly,  the  intensities  of  the  Fourier  transforms  within  the  light  cone  are  minimized  for 
p  —  2 ,  where  the  Q  factor  reaches  its  maximum. 


confinement  is  preserved  and  Q\\  can  be  improved  by  increasing 
the  number  of  photonic  crystal  layers  around  the  defect.  This  is 
one  of  the  reasons  for  reducing  the  defect  hole  radius  to  only 
0.2a.  The  other  reason  is  our  long-term  goal,  a  photonic  crystal 
cavity  QED  with  neutral  atoms  [2],  for  which  we  need  a  strong 
field  intensity  within  an  air  hole  large  enough  to  place  a  neutral 
atom,  without  significant  surface  effects.  Field  components  of 
the  x-dipole  mode  in  the  analyzed  structure  are  shown  in  Fig.  3, 
as  a  function  of  the  elongation  parameter  p.  For  the  a: -dipole 
mode,  the  Ex  and  By  components  are  even,  while  the  Ey  and 
Bx  components  are  odd  with  respect  to  both  symmetry  axes  x 
and  y.  Therefore,  it  is  expected  that  Ey  and  Bx  (i.e.,  Lx  and 
Ny)  do  not  contribute  significantly  to  the  radiated  power  in  this 
case,  since  their  Fourier  transforms  are  equal  to  zero  along  both 
the  kx  and  ky  axes.  This  is  also  illustrated  in  Fig.  4.  Therefore, 
in  the  case  of  the  analyzed  a: -dipole  mode  we  can  approximate 
the  expression  (28)  even  further  as 

A  =  2  Wk*  //  dkJk<  '  (™^)|2 

|fc,il<fc 

+^\FT2{Ex)\^j.  (31) 

In  order  to  minimize  the  radiated  power,  it  is  necessary  to  min¬ 
imize  (within  the  light  cone)  the  Fourier  transforms  of  the  even 
field  components  Ex  and  By .  In  the  general  case,  these  Fourier 
transforms  are  nonzero  at  small  values  of  \k\\  |  (i.e.,  in  the  light 
cone).  However,  they  can  be  minimized  by  balancing  the  in¬ 
tensities  of  positive  and  negative  field  lobes.  Indeed,  we  can 
observe  in  Fig.  3  that,  by  varying  the  elongation  parameter  p. 


we  also  tune  the  sizes  of  the  central  (negative)  lobes  in  Ex  and 
By,  as  well  as  the  intensity  distribution  between  the  positive  and 
negative  lobes.  Therefore,  the  tuning  in  p  is  expected  to  lead  to 
tuning  in  the  Fourier  transforms  of  the  even  field  components, 
and  subsequently  to  tuning  in  radiated  powers. 

The  Fourier  components  of  the  x-dipole  mode  in  the  struc¬ 
ture  with  p  =  0  are  shown  in  Fig.  4.  When  the  elongation  pa¬ 
rameter  p  changes  in  the  analyzed  range  from  0  to  4,  peaks  in 
the  Fourier  space  preserve  their  position,  but  their  intensities  are 
tuned.  This  can  be  observed  in  Fig.  5.  Clearly,  Fourier  compo¬ 
nents  within  the  light  cone  are  minimized  for  p  —  2,  where  the  Q 
factor  reaches  its  maximum.  Therefore,  the  optimization  of  the 
Q  factor  of  the  dipole  mode  (after  the  application  of  fractional 
edge  dislocations)  is  a  result  of  suppression  of  the  wavevector 
components  within  the  light  cone.  This  suppression  is  a  product 
of  balancing  between  the  energies  of  the  positive  and  negative 
field  lobes  of  the  even  field  components.  The  balancing  is  ob¬ 
tained  by  tuning  the  sizes  of  the  negative  lobes  with  insertion 
of  extra  material  along  the  symmetry  axis.  The  Q  factor  op¬ 
timization  is  achieved  in  this  case  without  a  significant  mode 
delocalization. 

In  our  FDTD  calculations  [2],  the  total  Q  factor  is  separated 
into  the  lateral  (Qy)  and  vertical  (Q±)  quality  factors.  Q±  is 
a  measure  of  radiation  loss,  while  Q\\  corresponds  to  the  loss 
through  the  mirrors  in  the  lateral  directions,  which  can  be  re¬ 
duced  by  adding  more  layers  of  photonic  crystal.  The  boundary 
for  separation  of  vertical  from  lateral  loss  (i.e.,  Q±  from  Q\\)is 
positioned  approximately  at  A/ 2  from  the  surface  of  the  mem¬ 
brane,  as  suggested  in  our  early  work  [21].  We  have  discussed  in 
our  recent  publication  [2]  that  this  choice  of  boundary  excludes 
some  small  portion  of  radiation  loss  from  Q± ,  and  the  total  Q 
factor  of  the  analyzed  dipole  mode  achievable  by  increasing  the 
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(a) 


(b) 

Fig.  6.  Q  factors  estimated  from  the  FDTD,  or  from  the  Fourier  transforms  of 
the  tangential  field  components.  The  plane  S  is  positioned  (a)  directly  above  the 
surface  of  the  membrane,  at  a  distance  equal  to  d/4  from  it,  and  (b)  at  a  distance 
equal  to  A/ 2  from  the  surface  of  the  membrane. 

number  of  the  PC  layers  around  the  defect  (also  referred  to  as 
the  limit  of  total  Q  factor)  is  somewhat  smaller  than  Q± .  For 
this  reason,  we  now  believe  that  a  better  choice  of  boundary  for 
separation  of  Q±  from  Q\\  would  be  the  one  positioned  directly 
above  the  surface  of  the  membrane.  However,  for  the  purpose 
of  comparing  our  new  results  [2]  with  our  earlier  work  [21],  we 
preserve  this  boundary  positioned  at  A/2  from  the  surface  of  the 
membrane. 

The  radiation  Q  factors  are  evaluated  using  the  method  pre¬ 
sented  in  this  paper,  and  results  are  shown  in  Fig.  6.  The  plane 
S  (above  which  we  integrate  the  radiated  power)  is  positioned 
directly  above  and  at  A/2  above  the  surface  of  the  membrane, 
in  Fig.  6(a)  and  (b),  respectively.  Therefore,  in  the  latter  case, 
we  expect  a  better  agreement  between  the  radiation  Q  factors 
( lo/RF )  estimated  using  our  new  method  and  Q±  estimated 
from  the  FDTD  calculations,  but  in  the  former  case  the  newly 
calculated  Q  factors  are  a  better  approximation  of  the  limit  of  the 
total  Q  factors.  In  Fig.  6(a),  Q±  factors  are  larger  than  Q  factors 
calculated  as  u  / RF ,  because  u  /RF  includes  practically  all  ra¬ 
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diation  (out-of-plane)  loss,  while  Q±  includes  only  the  radiation 
loss  above  A/2  from  the  surface  of  the  membrane.  In  Fig.  6(b), 
both  the  plane  S  and  the  boundary  for  separation  of  Q±  from  Q\\ 
are  positioned  at  A/ 2  from  the  membrane  surface,  and  a  better 
agreement  with  Q±  from  the  FDTD  simulations  is  observed. 
However,  Q±  is  somewhat  smaller,  due  to  numerical  inaccu¬ 
racy.  Radiation  factors  RF2,  RF%,  and  RF±  are  estimated  under 
the  assumption  that  most  of  the  radiation  is  collected  at  vertical 
incidence.  This  is  not  really  true  in  the  case  of  the  x-dipole,  for 
which  reason  there  is  an  offset  between  the  Q  factors  evaluated 
from  RFi ,  i  —  2,  3,  4,  and  Q  estimated  from  RF±,  which  does 
not  make  any  assumptions  regarding  the  direction  of  radiation. 

IV.  Conclusion 

We  have  presented  a  method  for  estimating  the  Q  factor  of 
a  mode  and  its  radiation  loss  from  the  known  Fourier  trans¬ 
form  of  the  near-field  distribution.  By  applying  this  approach  to 
high  Q  structures  that  we  have  proposed  recently  [2],  we  have 
proven  that  the  optimization  of  the  Q  factor  of  the  dipole  defect 
mode  (after  the  application  of  fractional  edge  dislocations)  re¬ 
sults  from  the  suppression  of  the  wave  vector  components  within 
the  light  cone.  This  suppression  is  a  result  of  balancing  between 
the  positive  and  negative  lobes  in  the  even  field  components. 
The  balancing  is  obtained  by  tuning  the  sizes  of  the  negative 
lobes  with  insertion  of  extra  material  along  the  symmetry  axis. 
Although  our  analysis  focuses  on  the  dipole  mode  only,  a  sim¬ 
ilar  approach  can  be  applied  to  any  type  of  microcavity  formed 
in  planar  photonic  crystals. 
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We  show  how  to  design  families  of  operational  criteria  that  distinguish  entangled  from  separable 
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are  tractable  due  to  powerful  computational  and  theoretical  methods  for  the  class  of  convex  optimization 
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Entanglement  is  one  of  the  most  striking  features  of 
quantum  mechanics.  Not  only  is  it  at  the  heart  of  the  vio¬ 
lation  of  Bell  inequalities  [1],  but  it  has  lately  been  rec¬ 
ognized  as  a  very  useful  resource  in  the  field  of  quantum 
information.  Entanglement  can  be  used  to  perform  several 
important  tasks  such  as  teleportation,  quantum  key  distri¬ 
bution,  and  quantum  computation  [2].  Despite  its  wide¬ 
spread  importance,  there  is  no  procedure  that  can  tell  us 
whether  a  given  state  is  entangled  or  not,  and  considerable 
effort  has  been  dedicated  to  this  problem  [3,4].  In  this 
Letter  we  apply  powerful  tools  of  optimization  theory  for 
problems  known  as  semidefinite  programs  to  construct  a 
hierarchy  of  tests  for  entanglement. 

A  bipartite  mixed  state  p  is  said  to  be  separable  [5]  (not 
entangled)  if  it  can  be  written  as  a  convex  combination  of 
pure  product  states 

p  =  ®  I0/X0/I  >  (!) 

where  |  ifi)  and  |  <fii)  are  state  vectors  on  the  spaces  J~CA 
and  J~CB  of  subsystems  A  and  B ,  respectively,  and  pt  >  0, 
X/  Pi  =  L  If  a  state  admits  such  a  decomposition,  it  can 
be  created  by  local  operations  and  classical  communica¬ 
tion,  and  hence  it  cannot  be  an  entangled  state. 

Several  operational  criteria  have  been  proposed  to 
identify  entangled  states.  Typically  these  are  based  on 
simple  properties  obeyed  by  all  separable  states  and  are 
thus  necessary  but  not  sufficient  conditions  for  separabil¬ 
ity  (although  some  sufficient  conditions  for  separability 
are  known  [6]).  The  most  famous  of  these  criteria  is 
based  on  the  partial  transposition  and  was  first  introduced 
by  Peres  [7].  It  was  shown  by  Horodecki  et  al.  [8] 
to  be  both  necessary  and  sufficient  for  separability  in 
3~L 2  0  J~L 2  and  J~L \  0  J/3.  If  p  has  matrix  elements 
Pikji  =  (i I  0  (k\p\j)  0  |/)  then  the  partial  transpose  pTk 
is  defined  by  p^ji  =  Pjk,ih  If  a  state  is  separable,  then  it 
must  have  a  positive  partial  transpose  (PPT).  To  see  this, 
consider  the  decomposition  (1)  for  p.  Partial  transposition 
takes  \if/i)(i//i\  to  I  so  the  result  of  this  operation 
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is  another  valid  density  matrix  and  must  be  positive.  Thus 
any  state  for  which  pTa  is  not  positive  semidefinite  is 
necessarily  entangled.  This  criterion  has  the  advantage  of 
being  very  easy  to  check,  but  there  are  PPT  states  that  are 
nonetheless  entangled  as  was  first  demonstrated  in  [9]. 

Our  separability  criteria  will  also  be  based  on  simple 
computationally  checkable  properties  of  separable  states. 
Let  p ,  defined  on  J~CA  0  J~CB  0  J~CA ,  be  given  by 

P  =  ®  \4>i)(4>i\  ®  l*Ai)<*Ail  •  (2) 

First,  p  is  an  extension  of  p  (that  is,  the  partial  trace  over 
the  third  party  C  is  equal  to  p,  Tr c[p]  =  p).  Second, 
the  state  is  symmetric  under  interchanging  the  two  copies 
of  J~CA.  To  put  this  more  formally  we  define  the  swap 
operator  P  such  that  P|/)  0  | k)  0  |  j)  =  \  j)  0  \k)  0  |/). 
We  have  P2  =  /,  and  tt  =  (/  +  P)/2  is  a  projector  onto 
the  symmetric  subspace.  Since  irpir  =  p,  the  extension 
p  has  support  only  on  this  subspace.  Finally  the  extension 
p  is  a  tripartite  separable  state.  This  means  that  it  will 
have  positive  partial  transposes  with  respect  to  any  of  the 
parties,  and  in  particular  we  have  pTa  >  0  and  pTe  >  0. 

We  may  now  formulate  an  explicit  separability  criterion 
based  on  the  existence  of  the  extension  discussed  above. 
If  the  state  p  on  J~CA  0  3~CB  is  separable  then  there  is 
an  extension  p  on  J~CA  0  J~CB  0  J~CA  such  that  irpir  = 
p,  pTa  >  0,  and  pTe  >0.  Note  that  the  symmetry  of 
the  extension  means  that  if  pTa  >0  then  pTc  >0,  so 
including  this  would  not  make  a  stronger  test.  We  may 
generalize  this  criterion  to  an  arbitrary  number  of  copies 
of  both  J~CA  and  J~CB.  If  the  state  p  on  J~CA  0  J~CB  is 
separable  then  there  is  an  extension  p  with  support  only  on 
the  symmetric  subspace  of  J~CAk  0  J~CBl  such  that  p  has 
a  positive  partial  transpose  for  all  partitions  of  the  k  +  l 
parties  into  two  groups.  Since  the  extensions  are  required 
to  be  symmetric,  it  is  only  necessary  to  test  the  possible 
partitions  into  two  groups  that  are  not  related  by  permuting 
copies  of  J~CA  and  J~CB.  Including  testing  for  positivity  of 
the  extension  itself,  there  are  \(k  +  1)  (/  +  l)/2l  distinct 
positivity  checks  to  be  satisfied  by  p . 
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These  results  generate  a  hierarchy  of  necessary  condi¬ 
tions  for  separability.  The  first  is  the  usual  PPT  test  for 
a  bipartite  density  matrix  p.  If  the  test  fails,  the  state  is 
entangled;  if  the  test  is  passed,  the  state  could  be  separable 
or  entangled.  In  the  latter  case  we  look  for  an  extension  p 
of  p  to  three  parties  such  that  irpir  =  p  satisfies  the  PPT 
test  for  all  possible  partial  transposes.  If  no  such  extension 
exists,  then  p  must  be  entangled.  If  such  an  extension  is 
possible,  the  state  could  be  separable  or  entangled,  and  we 
need  to  consider  an  extension  to  four  parties,  and  so  on. 

Each  test  in  this  sequence  is  at  least  as  powerful  as 
the  previous  one.  We  can  see  this  by  showing  that,  if 
there  is  a  PPT  extension  pn  to  n  parties,  then  there  must 
be  a  PPT  extension  pn~\  to  n  —  1  parties.  Let  pn-\  = 
Tr x[_Pn\  X  being  one  of  the  copies  of  A  or  B.  Then  pn-\ 
will  inherit  from  pn  the  property  of  having  its  support  on 
the  symmetric  subspace.  Let’s  assume  that  it  is  not  PPT. 
Then  there  is  a  subset  I  of  the  parties  such  that  pn-i 
has  a  negative  eigenvalue,  where  Tj  represents  the  partial 
transpose  with  respect  to  all  the  parties  in  subset  I.  Let  \e) 
be  the  corresponding  eigenvector  and  let  {|/)}  be  a  basis  of 
the  system  X  over  which  the  partial  trace  was  performed. 
Since  pn  is  PPT,  then  (e\ (ilpf1  \e)  \i)  >  0,  for  all  i.  Then 

Y.{e\(i\p^\e)\i)  =  (e\ Trx[pJJ]  \e)  >0.  (3) 

i 

Since  X  £  I,  we  can  commute  the  trace  and  the  par¬ 
tial  transpose,  and,  using  pn~\  =  Tr x[_Pn\  we  have 
(e\pn-i\e)  >  0,  which  contradicts  the  fact  that  \e)  is  an 
eigenvector  of  pn-\  with  negative  eigenvalue. 

The  problem  of  searching  for  the  extension  can  be 
solved  efficiently,  since  it  can  be  stated  as  a  particular 
case  of  the  class  of  convex  optimizations  known  as 
semidefinite  programs  (SDP)  [10].  A  SDP  corresponds  to 
the  optimization  of  a  linear  function,  subject  to  a  linear 
matrix  inequality  (LMI).  A  typical  SDP  will  be 

minimize  cTx ,  ^ 

subject  to  F(x)  >  0, 

where  c  is  a  given  vector,  x  =  (x\, . . .  ,xm),  and  F(x)  = 
F0  +  X/  Y/F/,  for  some  fixed  n-by-n  Hermitian  matrices 


Fj.  The  inequality  in  the  second  line  of  (4)  means  that  the 
matrix  F(x)  is  positive  semidefinite.  The  vector  x  is  the 
variable  over  which  the  minimization  is  performed.  In  the 
particular  instance  in  which  c  =  0,  there  is  no  function  to 
minimize  and  the  problem  reduces  to  whether  or  not  it  is 
possible  to  find  x  such  that  F(x)  is  positive  semidefinite. 
This  is  termed  a  feasibility  problem.  The  convexity  of 
SDPs  has  made  it  possible  to  develop  sophisticated  and 
reliable  analytical  and  numerical  methods  for  them  [10]. 

The  separability  criteria  we  introduced  above  may  all 
be  formulated  as  semidefinite  programs.  For  brevity  we 
will  explicitly  consider  only  the  problem  of  searching  for 
an  extension  of  p  to  three  parties.  We  will  also  relax  the 
symmetry  requirements  on  the  extension  p ,  and  we  will 
ask  only  PpP  =  p.  This  increases  the  size  of  the  SDP, 
but  simplifies  the  setup.  Let  {crf}/=1  ^2  and  {erf}j=\  ^ 
be  bases  for  the  space  of  Hermitian  matrices  that  operate 
on  J~Ca  and  J~Cb,  respectively,  such  that  they  satisfy 

Tr(crfcrJ)  =  aSjj  and  Tr(crf)  =  Sn,  (5) 

where  X  stands  for  A  or  B,  and  a  is  some  constant — the 
generators  of  SU (n)  could  be  used  to  form  such  a  ba¬ 
sis.  We  can  then  expand  p  in  the  basis  {erf  0  erf},  and 
write  p  =  X//  Pij  cf  ®  o-f,  with  ptj  =  a~2Jv[paf  0 
erf\  We  can  write  the  extension  p  in  a  similar  way 

P  =  X  PkjiWt  ®  of  ®  ®  erf  ®  o-f} 

i<k 

+  Xpkjk^k  ®  of  ®  O-f  ,  (6) 

kj 

where  we  have  explicitly  used  the  symmetry  between  the 
first  and  third  party.  We  also  need  to  satisfy  Tr c(p)  =  p . 
Using  (5),  and  the  fact  that  the  erf  0  erf  form  a  basis  of 
the  space  of  Hermitian  matrices  on  J~Ca  ®  we  obtain 
pij i  =  pij.  The  remaining  components  of  p  will  be  the 
variables  in  our  SDP.  The  LMIs  come  from  requiring  that 
the  state  p  and  its  partial  transposes  be  positive  semidefi¬ 
nite.  For  example,  the  condition  p  >  0  will  take  the  form 
F(x)  =  F0  +  X/  Y/F/  >  0  if  we  define 


Fo  =  ^fpijerf  0  erf  0  erf  +  ^  PijWt  ®  erf  0  erf  +  erf  0  erf  0  erf], 

j  /=2j  =  l 

F^i  =  o-f  0  erf  0  erf ,  i  >  2, 

Fijk  =  (o-f  0  erf  0  af  +  erf  0  erf  0  erf},  k  >  i  >  2. 

The  coefficients  pijk(k  A  1  ,k  >  i)  play  the  role  of  the  variable  x.  There  are  m  =  (d\d\  —  d\d\)/2  components  of 
x,  where  dj  is  the  dimension  of  J~Ci.  Each  F  is  a  square  matrix  of  dimension  n  =  d\ds.  Positivity  of  the  partial 
transposes  Ta  and  Tb  leads  to  two  more  LMIs,  pTa  >  0  and  pTe  >0.  The  F  matrices  for  these  two  LMIs  are  related 
to  the  matrices  Ftjk  by  the  appropriate  partial  transposition.  We  can  write  these  three  LMIs  as  one,  if  we  define  the 
matrix  G  =  p  ©  p  A  0  p  B,  so  for  example,  Go  =  F0  ©  F0  ©  F0  (a  block-diagonal  matrix  C  =  A  ©  B  is  positive 
semidefinite  if,  and  only  if,  both  A  and  B  are  positive  semidefinite).  So  the  feasibility  problem  reduces  to  attempting 
to  find  pijk(k  ±  l,k  >  i)  with  G  >  0.  In  fact,  the  SDP  corresponding  to  minimizing  t  subject  to  Uaba  +  G  >  0 
is  always  feasible  and  performs  better  numerically.  A  positive  optimum  gives  a  value  of  p*  such  that  (1  —  p)p  + 
plAB/dAds  is  entangled  for  all  0  <  p  <  p*.  Looking  for  an  extension  on  J-C®*  0  3~L ®l  is  a  semidefinite  program 
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with  m  =  (dA+£  1  )2(dB+}  1  )2  -  d2Ad2B  variables  and  a 
matrix  G  with  \(k  +  1)  (/  +  l)/2l  blocks  of  dimension  at 
moot  (dA+\k/2\-\V2(dB+\l/2\-:iv1 
m0Sl  V  \k/2\  >  1  Ml 21  1  ‘ 

Numerical  SDP  solvers  are  described  in  detail  in  [10]. 
Typically  they  involve  the  solution  of  a  series  of  least 
squares  problems,  each  requiring  a  number  of  operations 
scaling  with  a  problem  size  as  0(m2n2).  For  SDPs  with 
a  block  structure  these  break  into  independent  parts,  each 
with  a  value  of  n  determined  by  the  block  size.  The  num¬ 
ber  of  iterations  required  is  known  to  scale  no  worse  than 
0(n1/2).  Thus  for  any  fixed  value  of  (k,  l)  the  computa¬ 
tion  involved  in  checking  our  criteria  scales  no  worse  than 
0(dlAk^2dlB^2)  which  is  polynomial  in  the  system  size. 

Using  the  SDP  solver  SeDuMi  [11],  we  applied  the  first 
criterion  {k  =  2,  l  =  1)  to  several  examples  of  PPT  en¬ 
tangled  states  with  dA  =  2,  dB  =  4  or  dA  =  3,  dB  =  3. 
On  a  500  MHz  desktop  computer  a  single  state  could  be 
tested  in  under  a  second  for  dA  =  2,  dB  =4  and  in  about 
eight  seconds  for  dA  =  3,  dB  =  3.  For  the  one  and  two 
parameter  families  of  PPT  entangled  states  described  in 
[3,9,12]  we  performed  a  systematic  search  of  the  parame¬ 
ter  space,  in  each  case  testing  hundreds  or  thousands  of 
different  states.  We  checked  4000  randomly  chosen  ex¬ 
amples  of  the  seven  parameter  family  of  PPT  entangled 
states  in  [13].  We  also  checked  the  PPT  entangled  states 
constructed  from  unextendable  product  bases  in  [14].  We 
did  not  find  any  PPT  entangled  state  with  an  extension  of 
the  required  form,  thus  verifying  the  entanglement  of  all 
these  states.  Very  close  to  the  separable  states  the  test  was 
inconclusive  due  to  numerical  uncertainties.  Uncertainties 
and  one  example  are  discussed  more  fully  below. 

A  very  useful  property  of  a  SDP  is  the  existence  of  the 
dual  problem.  If  a  problem  can  be  stated  as  a  SDP  like  (4), 
usually  called  the  primal  problem,  then  the  dual  problem 
corresponds  to  another  SDP  that  can  be  written  as 


maximize  —  Tr[FoZ] 

subject  to  Z  >  0  (7) 

Tr [F/Z]  =  a , 

where  the  matrix  Z  is  Hermitian  and  is  the  variable  over 
which  the  maximization  is  performed.  For  any  feasible 
solutions  of  the  primal  and  dual  problems  we  have 

cTx  +  Tr [F0Z]  =  Tr [F(x)Z]  >0,  (8) 


where  the  last  inequality  follows  from  the  fact  that  both 
F(x)  and  Z  are  positive  semidefinite.  Then,  for  the  par¬ 
ticular  case  of  a  feasibility  problem  (c  =  0),  Eq.  (8)  will 
read  Tr[F0Z]  >  0.  This  result  can  be  used  to  give  a  certifi¬ 
cate  of  infeasibility  for  the  primal  problem:  if  there  exists  Z 
such  that  Z  >  0,  Tr[F;Z]  =  0,  that  satisfies  Tr[FoZ]  < 
0,  then  the  primal  problem  must  be  infeasible . 

In  the  context  of  entanglement,  the  role  of  the  “certifi¬ 
cate”  is  played  by  observables  known  as  entanglement  wit¬ 
nesses  (EW)  [8,15].  An  EW  for  a  state  p  satisfies 

Tr[psepW]>0  and  Tv[pW]  <  0,  (9) 


where  psep  is  any  separable  state.  If  our  primal  SDP  is  in¬ 
feasible  (which  means  that  the  state  p  must  be  entangled), 
the  dual  problem  provides  a  certificate  of  that  infeasibility 
that  can  be  used  to  construct  an  EW  for  p . 

First,  we  note  that,  due  to  the  block  diagonal  structure 
of  the  LMI,  we  can  restrict  any  feasible  dual  solution  Z  to 
have  the  same  structure,  i.e.,  Z  =  Zq  ©  Z\A  ©  Z2  where 
the  Z/  are  operators  on  J~CA  0  J~CB  0  J~CA.  Second,  we 
have  Tr[G0Z]  =  Tr[F0(Z0  +  Z i  +  Z2)].  We  defined  F0 
as  a  linear  function  of  p  so  that  Fo  =  A(p),  where  A  is 
a  linear  map  from  J~CA  0  J~CB  to  J~CA  0  J~CB  ®  3~CA.  We 
can  now  define  an  operator  Z  on  J~CA  0  J~CB  through  the 
adjoint  map  A*  such  that  Z  =  A*(Z0  +  Z\  +  Z2)  and 

Tr[pZ]  =  Tr[A(p)(Z0  +  Zx  +  Z2)\  =  Tr[G0Z].  (10) 

If  psep  is  any  separable  state,  we  know  that  the  primal 
problem  is  feasible  (the  extension  p  exists).  Then,  using 
Tr[GoZ]  >  0  and  (10),  we  have  Tr[psepZ]  >  0  for  any  Z 
obtained  from  a  dual  feasible  solution.  For  this  particular 
problem,  if  the  primal  is  not  feasible  (which  means  p  is  an 
entangled  state),  a  feasible  dual  solution  ZEW  that  satisfies 
Tr[GoZEW]  <  0  always  exists.  Using  (10)  we  can  see  that 
the  corresponding  operator  ZEw  satisfies  Tr[pZEw]  <  0 
which  together  with  Tr[psepZEw]  ^  0  means  that  ZEw  is 
an  entanglement  witness  for  p . 

In  numerical  work,  if  the  SDP  solver  cannot  find  an 
extension  p  it  constructs  the  matrices  Z;.  Evaluating 
Tr[pZEW]  and  verifying  the  three  positivity  conditions  pro¬ 
vide  an  independent  check  of  the  result.  Unless  this  check 
is  not  conclusive — for  example,  if  Tr[pZEW]  is  not  sig¬ 
nificantly  different  from  zero — we  are  able  to  definitively 
conclude  that  no  p  exists. 

If  W  is  an  EW,  then  for  any  product  state  \xy)  we  have 

E(x,y )  =  (xy\ W\xy)  =  X ^ijkix’tiy]xkyi  ^  0,  (11) 

ijkl 

where  {v;,  yt)  are  the  components  of  |v),  |  y)  in  some  basis, 
and  W ijki  are  the  matrix  elements  of  W  in  the  same  basis. 
Equation  (11)  states  that  the  biquadratic  Hermitian  form 
E  associated  with  W  must  be  positive  semidefinite  (PSD). 
It  is  not  hard  to  show  that  all  of  the  EWs  generated  by 
Eq.  (10)  satisfy  the  relation 

(xyv|ZEW  0  I\xyx)  =  (xyx  \  (Zq  +  Z\  +  Z2)  \xyx) 

=  (xyx\Zo\xyx)  +  {x*  yx\z[A\x*yx) 

+  (xy*x\zlB\xy*x) .  (12) 

Since  Z0,  Z\A,  and  Z2B  are  positive  by  construction  the 
biquadratic  Hermitian  form  E(x,y)(x\x)  has  a  decom¬ 
position  as  a  sum  of  squared  (SOS)  magnitudes.  This 
guarantees  that  E(x,y)  is  PSD.  It  can  be  shown  that  our 
first  separability  criterion  detects  all  entangled  states  that 
possess  an  EW  such  that  E  may  be  written  in  this  form. 
The  dual  program  to  our  initial  SDP  may  be  interpreted  as 
a  search  for  an  entanglement  witness  of  this  type.  Equally, 
the  Peres-Horodecki  criterion  detects  the  entanglement 
of  those  states  which  possess  entanglement  witnesses 
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for  which  (11)  may  be  written  directly  as  a  SOS — the 
decomposable  entanglement  witnesses  [16]  such  that 
W  =  P  +  QTa  for  some  PSD  P  and  Q.  In  general,  if 
there  is  no  EW  W  such  that  (11)  is  a  SOS,  we  can  search 
over  W  for  which  (11)  is  a  SOS  when  multiplied  by 
(x\x)k~l(y  | y)l~l  for  some  k,  l  >  1.  By  duality,  this 
corresponds  to  our  (k,  l )  separability  criterion. 

As  an  example  illustrating  the  methodology,  consider 
the  state  described  in  [3],  Section  4.6,  given  by 

Pa  =  y  l<A+X*A+l  +  -yo-+  +  -  -  Pcr+p ,  (13) 


with  0  <  a  <  5,  \if/+)  =  ^  Z?=o  l«)>  and  cr+  = 

|(|01)<01|  +  1 12) <12|  +  1 20) <20|).  Notice  that  pa  is 
invariant  under  the  simultaneous  change  of  a  — ►  5  —  a 
and  interchange  of  the  parties.  The  state  is  separable  for 
2  <  a  <  3  and  not  PPT  for  a  >  4  and  a  <  1 .  Numeri¬ 
cally,  entanglement  witnesses  could  be  constructed  for 
pa  in  the  range  3  +  e  <  a  <  4  (and  1  <  a  <  2  —  e) 
with  6  >  10-8.  A  witness  for  a  >  3  can  be  extracted 
from  these  by  inspection 

ZEW  =  2(|00)  (00|  +  1 11)  <11 1  +  1 22)  <22 1 )  +  |02) 

X  <02 1  +  |10)<10|  +  |21><21|  -  3|<A+)<iK|. 
This  observable  is  non-negative  on  separable  states: 


2<xy|ZEW|xy)<*  |x)  =  \lx0X\y2  ~  x2x0 y*  -  x^yol2  +  |2 xoXqJo  “  2xiXo;yi  +  “  ^o^l2 

+  |2v0Voy2  “  2viV2yi  +  x2x2y2  ~  x0x2y0\2  +  |2x0X| v0  -  2x2x2y!  +  x2x\y2  -  x\x\yi\2 
+  3\x2x0y\  -  x\x2yl\2  +  3\x\x\y0  -  x2x^y2 12  +  3|x2X2y2  ~  x0x2yo\2  +  3|x2XiJ2  -  xiXjjil2 


>  0. 

The  expected  value  on  the  original  state  is  Tr [ZEWpa]  = 
7(3  —  of),  demonstrating  entanglement  for  all  a  >  3. 

The  reformulation  of  our  separability  tests  as  a  search 
for  SOS  decompositions  of  the  forms  E(x ,  y)  provides  con¬ 
nections  with  existing  results  in  real  algebra  (see  [17]  for 
a  discussion  of  the  SDP-based  approach  in  a  general  set¬ 
ting).  By  Artin’s  positive  solution  to  Hilbert’s  17th  prob¬ 
lem,  for  any  real  PSD  form  /(x)  there  exists  a  SOS  form 
h(x ),  such  that  the  product  f(x)h(x)  is  SOS  [18].  Find¬ 
ing  such  an  h(x)  and  SOS  decomposition  proves  that  /  is 
PSD.  For  a  fixed  SOS  form  h(x,y ),  we  may  write  a  SDP 
that  attempts  to  find  EWs  such  that  h(x,y)E(x,y )  is  SOS. 
In  our  hierarchy  of  criteria  the  form  h  is  restricted  to  be 
(x  |  x)k~l(y  |  y)l~l.  While  it  is  conceivable  that  every  PSD 
bi-Hermitian  form  is  SOS  when  multiplied  by  appropriate 
factors  of  this  type,  currently  we  do  not  have  a  proof.  De¬ 
ciding  whether  a  form  is  positive  is  computationally  hard 
and  so  this  connection  to  positive  forms  also  promises  to 
shed  light  on  the  computational  complexity  of  the  separa¬ 
bility  problem. 

In  this  Letter  we  introduced  a  hierarchy  of  separability 
tests  that  are  computationally  tractable  and  strictly  stronger 
than  the  PPT  criterion.  Only  the  second  step  in  this  se¬ 
quence  of  tests  was  required  to  detect  the  entanglement  of 
a  wide  class  of  known  PPT  entangled  states.  The  method  is 
based  on  the  application  of  semidefinite  programs.  By  ex¬ 
ploiting  the  duality  property  of  these  problems,  we  showed 
how  to  construct  entanglement  witnesses  for  states  that  fail 
any  separability  test  in  the  sequence.  Finally,  the  wide 
range  of  applications  of  semidefinite  programming,  along 
with  the  work  reported  here  and  in  [19],  suggests  that  it 
may  become  a  useful  tool  in  quantum  information  and  in 
quantum  theory  in  general. 
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Atomic  ensembles  have  shown  to  be  a  promising  candidate  for  implementations  of  quantum  information 
processing  by  many  recently  discovered  schemes.  All  these  schemes  are  based  on  the  interaction  between 
optical  beams  and  atomic  ensembles.  For  description  of  these  interactions,  one  assumed  either  a  cavity-QED 
model  or  a  one-dimensional  light  propagation  model,  which  is  still  inadequate  for  a  full  prediction  and 
understanding  of  most  of  the  current  experimental  efforts  that  are  actually  taken  in  the  three-dimensional  free 
space.  Here,  we  propose  a  perturbative  theory  to  describe  the  three-dimensional  effects  in  interaction  between 
atomic  ensembles  and  free-space  light  with  a  level  configuration  important  for  several  applications.  The 
calculations  reveal  some  significant  effects  that  were  not  known  before  from  the  other  approaches,  such  as  the 
inherent  mode-mismatching  noise  and  the  optimal  mode-matching  conditions.  The  three-dimensional  theory 
confirms  the  collective  enhancement  of  the  signal-to-noise  ratio  which  is  believed  to  be  one  of  the  main 
advantages  of  the  ensemble-based  quantum  information  processing  schemes,  however,  it  also  shows  that  this 
enhancement  needs  to  be  understood  in  a  more  subtle  way  with  an  appropriate  mode-matching  method. 
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I.  INTRODUCTION 

Recently,  many  interesting  schemes  have  been  proposed 
that  use  atomic  ensembles  with  a  large  number  of  identical 
atoms  as  the  basic  system  for  quantum  state  engineering  and 
for  quantum  information  processing.  For  instance,  one  can 
use  atomic  ensembles  for  generation  of  substantial  spin 
squeezing  [1-3]  and  continuous  variable  entanglement 
[4-6],  for  storage  of  quantum  light  [7-11],  for  realization  of 
scalable  long-distance  quantum  communication  [12],  and  for 
efficient  preparation  of  many -party  entanglement  [13].  The 
experimental  candidates  of  atomic  ensembles  can  be  either 
some  cold  or  ultracold  atoms  in  a  trap  [2,10],  or  a  cloud  of 
room-temperature  atomic  gas  contained  in  a  glass  cell  with 
coated  walls  [3,6,11].  The  schemes  based  on  atomic  en¬ 
sembles  have  some  special  advantages  compared  with  the 
quantum  information  schemes  based  on  the  control  of  single 
particles:  first,  laser  manipulation  of  atomic  ensembles  with¬ 
out  separately  addressing  the  individual  atoms  is  normally 
easier  than  the  coherent  control  of  single  particles;  second, 
and  more  important,  atomic  ensembles  with  suitable  level 
configurations  could  have  some  kinds  of  collectively  en¬ 
hanced  coupling  to  certain  optical  mode  (called  the  signal 
light  mode)  due  to  the  many-atom  interference  effects. 
Thanks  to  this  enhanced  coupling,  we  can  obtain  collective 
enhancement  of  the  signal-to-noise  ratio  (that  is,  the  ratio 
between  the  controllable  coherent  interaction  and  the  uncon¬ 
trollable  noisy  interactions)  compared  with  the  single¬ 
particle  case  if  we  choose  to  manipulate  the  appropriate 
atomic  and  optical  modes.  This  collective  enhancement  plays 
an  important  role  in  all  the  recent  schemes  based  on  the 
atomic  ensembles. 
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To  describe  the  interaction  between  atomic  ensembles  and 
optical  beams,  in  particular,  to  understand  the  collective  en¬ 
hancement  of  the  signal-to-noise  ratio,  normally  one  assumes 
a  simple  cavity-QED  model  [7,8,12,1]  or  a  one-dimensional 
light  propagation  model  [9,4]  with  an  independent  spontane¬ 
ous  emission  rate  for  each  atom.  In  contrast  to  this,  most  of 
the  current  experiments  are  done  with  free-space  atomic  en¬ 
sembles  coupling  directly  to  the  three-dimensional  optical 
beams  [3,6,10,11].  It  is  not  obvious  that  the  predictions  from 
the  simple  models  will  always  be  valid  for  this  real,  much 
more  complicated,  experimental  situations.  To  fully  predict 
and  understand  the  real  experiments,  one  needs  to  answer 
various  questions  associated  with  the  three-dimensional  in¬ 
teraction  effects,  for  instance,  what  is  the  inherent  mode 
structure  of  the  signal  light  when  we  have  a  definite  geom¬ 
etry  of  the  atomic  ensemble?  Can  we  achieve  a  good  mode 
matching  with  the  matching  efficiency  in  principle  arbitrarily 
close  to  one?  What  is  the  noise  magnitude  associated  with 
density  fluctuation  (induced  by  the  random  initial  distribu¬ 
tion  of  the  atom  positions  and  the  random  atomic  motion)  of 
the  atomic  ensemble?  In  the  three-dimensional  configuration, 
is  there  still  the  collective  enhancement  of  the  signal-to-noise 
ratio? 

In  this  paper,  we  will  try  to  provide  answers  to  the  above 
important  questions  for  a  Raman-type  A -level  configuration 
that  is  useful  for  scalable  long-distance  quantum  communi¬ 
cation  [12]  and  for  many-party  entanglement  generation 
[13].  In  general,  it  is  very  challenging  to  build  a  full  quantum 
theory  from  the  first  principles  to  describe  the  interaction 
between  the  many-atom  ensemble  and  the  infinite-mode  op¬ 
tical  field  and  to  give  definite  answers  to  the  above-listed 
questions.  In  this  paper,  we  will  use  a  perturbative  approach 
by  assuming  that  the  Raman  pumping  laser  is  very  weak. 
There  are  several  motivations  to  use  a  perturbative  approach: 
first,  the  schemes  in  Refs.  [12,13]  for  scalable  quantum  com- 
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munication  and  for  many-party  entanglement  generation  can 
work  in  the  weak-pumping  limit,  so  the  perturbative  ap¬ 
proach  describes  a  useful  realistic  situation.  Second,  the  per¬ 
turbative  calculations  allow  us  to  investigate  the  three- 
dimensional  effects  and  to  give  definite  answers  to  the 
questions  listed  above  from  the  first  principles  without  any 
doubtful  approximation.  Finally,  we  expect  that  the  three- 
dimensional  effects  should  be  at  least  to  some  extent  inde¬ 
pendent  of  the  power  of  the  pumping  laser,  so  the  perturba¬ 
tive  calculations  could  give  at  least  some  indications  for  the 
three-dimensional  interaction  picture  in  other  regions. 

The  calculations  reveal  some  significant  results  that  are 
unexpected  from  the  simple  models,  for  instance,  it  turns  out 
that  due  to  the  density  fluctuation  of  the  atomic  ensemble, 
there  will  be  two  sources  of  noise  for  the  light- atomic- 
ensemble  interaction:  one  is  the  spontaneous  emission  loss 
and  the  other  is  the  inherent  mode-mismatching  noise  (here, 
by  “inherent,”  we  mean  that  this  mode  mismatching  is  not 
from  any  technical  imperfection).  Thus,  we  need  to  use  two 
quantities  to  describe  the  signal-to-noise  ratios,  and  need  to 
keep  a  balance  between  these  two  sources  of  noise  by  appro¬ 
priate  mode-matching  methods  to  optimize  the  setup  for  ap¬ 
plications.  The  intuitive  mode-matching  method  will  result  in 
a  quite  large  inherent  mode  mismatching  noise.  It  is  better  to 
use  some  other  mode-matching  methods  that  reduce  the 
mode-matching  noise  at  the  cost  of  increasing  the  spontane¬ 
ous  emission  loss.  This  can  optimize  the  setup  for  some  ap¬ 
plications,  such  as  the  ones  in  Refs.  [12,13],  since  there  the 
spontaneous  emission  loss  has  a  far  less  important  influence. 
The  calculations  in  this  paper  also  demonstrate  that  in  the 
realistic  three-dimensional  configuration,  one  can  still  obtain 
large  collective  enhancement  of  the  signal-to-noise  ratio  for 
atomic  ensembles  compared  with  the  single-atom  case, 
which  is  an  important  feature  of  this  kind  of  systems. 

It  is  also  helpful  to  make  a  comparison  between  the  col¬ 
lective  enhancement  of  the  signal-to-noise  ratio  we  consider 
here  and  some  other  collective  optical  effects,  such  as  the 
super-radiance.  The  similarity  lies  in  that  both  phenomena 
involve  many- atom  interference.  However,  there  are  also  im¬ 
portant  differences.  For  super-radiance,  it  is  a  stimulated 
emission  effect,  and  the  enhancement  is  in  the  emission 
speed.  At  the  initial  stage,  coherence  is  built  between  differ¬ 
ent  atoms  from  the  spontaneous  emissions.  After  the  coher¬ 
ence  has  been  built,  super-radiance  can  be  understood  even 
from  a  classical  interference  picture.  For  the  light- atomic- 
ensemble  interaction  considered  here,  we  are  still  completely 
in  the  spontaneous  emission  region  that  needs  a  full  quantum 
description,  and  the  enhancement  is  not  in  the  emission 
speed,  but  in  the  signal-to-noise  ratio  when  we  only  manipu¬ 
late  and  measure  a  definite  atomic  mode  (called  the  symmet¬ 
ric  collective  atomic  mode)  and  a  definite  signal  light  mode 
(which  is  the  optical  mode  collinear  with  the  Raman  pump¬ 
ing  light).  The  coupling  between  the  above  atomic  and  the 
optical  modes  are  coherent  for  different  atoms  (which  means, 
for  this  coupling  there  is  a  certain  phase  relation  between 
different  atoms),  while  the  coupling  of  these  two  modes  to 
other  atomic  and  other  optical  modes  (called  the  noise 
modes)  are  inherent  with  random  phase  relations  between 
different  atoms  (the  randomness  comes  from  the  random 


atom  positions).  We  have  collective  enhancement  of  the 
signal-to-noise  ratio  since  the  coherent  coupling  interferes 
constructively  for  different  atoms.  Of  course,  in  the  three- 
dimensional  free  space,  there  are  infinite  optical  modes,  and 
the  signal  light  mode  changes  continuously  to  other  noisy 
optical  modes  when  we  continuously  vary  the  solid  angle. 
Due  to  the  continuous  change  from  the  coherent  coupling  to 
the  inherent  coupling,  we  have  nontrivial  mode-matching 
problem,  and  we  get  some  inherent  mode-mismatching 
noise. 

We  also  would  like  to  mention  that  there  are  some  early 
important  works  on  the  transverse  effects  of  the  Raman  in¬ 
teraction  by  using  the  three-dimensional  light  propagation 
equations  [14].  However,  it  is  not  clear  to  us  how  to  use  this 
approach  to  describe  the  collective  enhancement  of  the 
signal-to-noise  ratio,  and  in  particular,  how  to  give  definite 
answers  to  the  questions  listed  above.  There  is  also  some 
unpublished  effort  to  try  to  figure  out  the  three-dimensional 
effects  in  another  light- atom  interaction  configuration  [15]. 
The  interaction  picture  is  not  yet  clear  to  us. 

This  paper  is  arranged  as  follows:  in  Sec.  II,  we  explain 
the  interaction  scheme  and  the  basic  ideas  of  the  applications 
of  this  interaction  scheme  for  quantum  information  process¬ 
ing.  With  the  applications  in  mind,  we  can  focus  our  efforts 
on  the  most  relevant  quantities  that  we  need  to  calculate.  We 
will  also  explain  in  this  section  the  basic  interaction  picture 
between  the  light  and  the  atomic  ensemble  from  our  calcu¬ 
lations.  Then,  in  the  next  section,  we  will  describe  the  theo¬ 
retical  model  for  this  light-atom  interaction  from  the  first 
principles,  and  solve  this  model  by  using  a  perturbative  ap¬ 
proach.  In  Sec.  IV,  we  will  discuss  the  properties  of  this 
solution,  in  particular,  we  will  define  and  calculate  the  mode 
structure  of  the  signal  light,  the  spontaneous  emission  ineffi¬ 
ciency,  and  the  inherent  mode-mismatching  noise.  Section  V 
is  devoted  to  the  discussion  of  the  appropriate  mode  match¬ 
ing  methods  for  real  experiments.  We  show  that  by  appropri¬ 
ate  mode  matching  methods,  we  can  keep  a  balance  between 
the  two  sources  of  noise  mentioned  above  in  order  to  opti¬ 
mize  this  setup  for  applications.  Finally,  in  Sec.  VI  we  sum¬ 
marize  the  results. 

II.  THE  INTERACTION  SCHEME  AND  ITS 
APPLICATIONS 

The  basic  element  of  our  system  is  an  atomic  ensemble, 
that  consists  of  a  cloud  of  Na  identical  atoms  with  the  rel¬ 
evant  level  structure  shown  in  Fig.  1.  A  pair  of  metastable 
lower  states  \g )  and  Is)  can  correspond,  for  instance,  to  hy- 
perfine  or  Zeeman  sublevels  of  electronic  ground  state  of 
alkali  atoms.  The  relevant  coherence  between  the  levels  | g) 
and  | s)  can  be  maintained  for  a  long  time,  as  has  been  dem¬ 
onstrated  both  experimentally  [6,10,11]  and  theoretically 
[7-9].  All  the  atoms  are  initially  prepared  in  the  ground  state 
\g)  through  optical  pumping. 

The  ensemble  is  then  illuminated  by  a  weak  pumping  la¬ 
ser  pulse  that  couples  the  transition  \g)— >\e)  with  a  large 
detuning  A,  and  we  look  at  the  spontaneous  emission  light 
from  the  transition  \e)—>\s),  whose  polarization  and/or  fre¬ 
quency  are  assumed  to  be  different  from  the  pumping  laser. 
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FIG.  1.  The  relevant  atomic  level  structure  with  \g),  \s),  a  pair 
of  ground  or  metastable  states,  and  \e),  the  excited  state. 

There  are  two  interaction  configurations:  in  the  first  configu¬ 
ration,  the  pumping  laser  is  shined  on  all  the  atoms  so  that 
each  atom  has  an  equal  small  probability  to  be  excited  into 
the  state  \s)  through  the  Raman  transition.  In  the  second 
configuration,  as  is  reported  in  the  experiment  [6],  the  pump¬ 
ing  laser  is  focused  with  a  transverse  area  smaller  than  the 
transverse  area  of  the  atomic  ensemble,  so  only  part  of  the 
atoms  are  illuminated  by  the  laser  at  each  instant.  However, 
during  the  light-atom  interaction  period,  all  the  atoms  in  the 
ensemble  are  moving  fast,  and  they  frequently  enter  and 
leave  the  interaction  region.  As  a  result,  each  atom  still  has 
an  equally  small  probability  to  be  excited  into  the  state  | s). 

As  we  will  show  in  the  next  section,  after  the  atomic  gas 
interacts  with  a  weak  pumping  laser,  there  will  be  a  special 
atomic  mode  sS9  called  the  symmetric  collective  atomic 
mode,  and  a  special  optical  spontaneous  emission  mode  as 
(which  couples  to  the  transition  |e)— ^s)),  called  the  signal 
light  mode.  The  symmetric  collective  atomic  mode  and 

the  signal  light  mode  as  are  defined,  respectively,  by 

Na 

s,=( WY)2  \8h(s\,  (i) 

i—  1 

as=  f  f£akd3 k,  (2) 

where  ak  represents  the  plane- wave  mode  with  the  wave 
vector  k  (we  have  used  the  plane-wave  modes  as  the  eigen- 
modes  for  the  expansion  of  the  optical  spontaneous  emission 
field).  The  operators  ak  satisfy  the  standard  commutation  re¬ 
lations  [ak,ak,]  =  S( k— k'),  and  fk  is  the  normalized  signal 
mode  function  whose  explicit  form  will  be  specified  in  Sec. 
IV.  We  just  need  to  mention  here  that  a s  represents  the  spon¬ 
taneous  emission  light  that  is  basically  collinear  with  the 
pumping  laser,  with  a  distribution  only  over  a  very  small 
solid  angle.  The  particularity  of  the  modes  and  as  comes 
from  the  fact  that  they  are  dominantly  correlated  with  each 
other,  which  means,  if  an  atom  is  excited  to  the  symmetric 
collective  mode  ss ,  the  accompanying  spontaneous  emission 
photon  will  most  probably  go  to  the  signal  light  mode  a  s , 
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FIG.  2.  For  a  many-atom  ensemble,  The  intuitive  interaction 
picture  between  the  atomic  modes  and  the  free-space  optical  modes, 
where  cl  represents  the  “good”  correlation  between  the  collective 
atomic  mode  and  the  signal  light  mode,  c2  represents  the  sponta¬ 
neous  emission  loss,  and  c3  represents  the  inherent  mode- 
mismatching  noise.  Broader  connections  stand  for  stronger  correla¬ 
tions  between  the  excitations  in  the  corresponding  modes. 

and  vice  versa.  There  are  still  many  other  atomic  modes  in 
the  ensemble  and  infinite  other  optical  modes  in  the  sponta¬ 
neous  emission  field,  and  these  atomic  and  optical  modes  are 
correlated  with  each  other  in  a  complicated  way.  However, 
the  correlation  between  the  good  modes  ss  and  as  is  quite 
“pure,”  and  these  two  modes  are  only  weakly  correlated 
with  the  other  atomic  and  optical  modes  that  contribute  to 
noise.  The  correlation  between  the  atomic  mode  and  other 
optical  modes  contributes  to  the  spontaneous  emission  loss, 
and  the  correlation  between  the  signal  mode  as  and  other 
atomic  modes  contributes  to  the  inherent  mode-mismatching 
noise.  The  interaction  picture  for  this  system  is  schematically 
shown  in  Fig.  2. 

The  applications  of  this  system  for  quantum  information 
processing  exactly  comes  from  the  almost  pure  correlation 
between  the  modes  ss  and  a  s .  If  we  neglect  the  weak  corre¬ 
lations  between  the  good  modes  s  s,  a  s  and  the  other  noisy 
atomic  and  optical  modes,  that  is,  if  we  neglect  the  two 
sources  of  noise  illustrated  in  Fig.  2,  then  all  the  noisy  modes 
can  be  traced  over,  and  we  get  effectively  a  two-mode  prob¬ 
lem.  The  excitations  in  the  modes  ss  and  as  can  be  both 
separately  measured.  For  the  signal  mode  a  s ,  this  is  done 
through  a  single-photon  detector  with  an  appropriate  mode¬ 
matching;  and  for  the  atomic  mode  ,  this  can  be  done  by 
first  mapping  the  atomic  excitation  to  an  excitation  in  the 
signal  mode  through  a  repumping  laser  pulse  [12],  and  then 
detecting  it  again  through  a  single-photon  detector.  Using  the 
almost  pure  correlation  between  the  modes  ss  and  a  s ,  one 
can  generate  some  preliminary  entanglement  between  two 
distant  atomic  ensembles  by  only  linear  optics  means,  which 
forms  the  important  first  step  for  applications  of  this  setup 
for  different  kinds  of  quantum  information  processing  tasks 
detailed  in  Refs.  [12,13].  Here,  we  will  briefly  explain  the 
basic  ideas  on  how  to  entangle  atomic  ensembles  using  the 
correlation  between  the  modes  and  as .  This  explanation 
helps  us  to  define  the  most  relevant  quantities  for  the  appli¬ 
cations  that  we  need  to  calculate. 

The  setup  for  entanglement  generation  between  the  two 
distant  atomic  ensembles  L  and  R  is  shown  in  Fig.  3.  We 
apply  simultaneously  two  short  Raman  pumping  pulses  on 
the  ensembles  L  and  R ,  respectively,  so  that  for  each  en¬ 
semble  the  light  scattered  to  the  signal  mode  as  has  a  mean 
photon  number  much  smaller  than  1.  The  signal  modes  are 
then  coupled  to  optical  channels  (such  as  fibers)  through 
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FIG.  3.  The  schematic  setup  for  generating  entanglement  be¬ 
tween  two  distant  atomic  ensembles  L  and  R. 

mode  matching  after  the  filters,  which  are  polarization  and 
frequency  selective  to  filter  the  pumping  light.  The  signal 
pulses  after  the  transmission  channels  interfere  at  a  50%- 
50%  beam  splitter  (BS),  with  the  outputs  detected  respec¬ 
tively  by  two  single-photon  detectors  D 1  and  D 2.  If  either 
D 1  or  D2  registers  one  photon,  the  process  is  finished.  Oth¬ 
erwise,  we  first  apply  a  repumping  laser  pulse  to  the  transi¬ 
tion  \s)^\e)  on  the  ensembles  L  and  R  to  set  the  atoms  back 
to  the  ground  state  | g),  then  the  same  Raman  driving  pulses 
as  the  first  round  are  applied  to  the  transition  \g)^\e)  and 
we  detect  again  the  photon  number  of  the  signal  modes  after 
the  beam  splitter.  This  process  is  repeated  until  finally  we 
have  a  click  in  either  D 1  or  D2  detector. 

We  show  now  how  the  entanglement  is  generated  between 
the  ensembles  L  and  R  if  we  successfully  register  one  photon 
in  Dl  or  D2  detector.  To  understand  this,  let  us  first  look  at 
one  ensemble.  Before  the  Raman  pumping  pulse,  the  collec¬ 
tive  atomic  mode  ss  and  the  signal  light  mode  as  are  in 
vacuum  states,  which  are  denoted  respectively  by  |0a),  |0p). 
If  we  neglect  the  small  noisy  correlations,  after  the  weak 
pumping,  the  state  of  the  modes  ss  and  a s  has  the  following 
form  (see  the  next  section  for  the  detailed  derivation): 

\<f>)  =  \%)\0p)+JFc4al\°a)\()p)  +  o(Pc)>  (3) 

where  pc<  1  is  the  probability  for  one  atom  excited  to  the 
collective  atomic  mode  sS9  and  o(pc)  represents  the  high- 
order  terms  with  more  excitations  whose  probabilities  are 
equal  to  or  smaller  than  p2c  . 

In  Fig.  3,  the  pumping  pulses  excite  both  ensembles  si¬ 
multaneously,  the  whole  system  is  thus  described  by  the  state 
I  $)l®  I  <£)/?,  where  |  <fi)L  and  |  (f>)R  are  given  by  Eq.  (1)  with 
all  the  operators  and  states  distinguished  by  the  subscript  L 
or  R ,  respectively.  The  two  signal  light  modes  are  superposed 
at  the  beam  splitter,  and  a  photodetector  click  in  either  D 1  or 
D 2  measures,  respectively,  the  operators  a\a+  or  a^_a_ 
with  a±  =  (asL±el(pasR)/ ^2.  Here,  cp  denotes  the  difference 
of  the  phase  shifts  in  the  two-side  optical  channels.  Condi¬ 
tional  on  the  detector  click,  we  should  apply  a  projection 
operator  a+  or  a-  onto  the  whole  state  |  |  (f>)R  .  If  we 

neglect  the  high-order  corrections  in  Eq.  (3),  the  projected 
state  of  the  ensembles  L  and  R  thus  has  the  form 

I^^LK  =  (4L±eI'^)/V2|0a)L|0  a)R,  (4) 


which  is  maximally  entangled  in  the  excitation  number  basis. 
The  generation  of  this  kind  of  state  forms  the  basis  of  further 
applications  in  quantum  information  processing  [12,13].  If 
we  take  into  account  the  high-order  terms  in  Eq.  (3),  the 
fidelity  between  the  actually  generated  state  and  the  ideal 
state  (4)  will  decrease  by  an  amount  proportional  to  pc  ,  and 
this  decrease  will  contribute  to  the  final  fidelity  imperfection 
of  the  schemes  in  Refs.  [12,13].  For  the  applications  de¬ 
scribed  by  these  schemes,  we  need  to  fix  the  fidelity  imper¬ 
fection  to  be  small,  which  means  that  we  should  keep  a  small 
excitation  probability  pc  by  controlling  the  intensity  of  the 
Raman  pumping  laser. 

Now  we  look  at  the  influence  of  the  two  types  of  noisy 
correlations  illustrated  in  Fig.  2.  The  spontaneous  emission 
loss  can  be  quantified  by  the  probability  pspon  of  the  event 
that  the  scattered  photon  goes  to  some  other  optical  modes 
instead  of  the  signal  mode  as  while  the  accompanying  atom 
is  excited  to  the  collective  atomic  mode  ss .  Without  the 
spontaneous  emission  loss,  for  each  round  of  Raman  pump¬ 
ing,  we  succeed  with  a  probability  2 pc  to  get  a  detector 
click,  which  prepares  the  entangled  state  (4).  However,  the 
spontaneous  emission  loss  means  that  even  if  an  atom  is 
excited  to  the  mode  ss  (which  has  a  probability  pc  for  each 
ensemble),  the  accompanying  photon,  with  a  probability  1 
—  pspon,  cannot  be  registered.  For  entanglement  generation, 
the  fidelity  imperfection  to  the  ideal  state  (4)  is  given  by  the 
excitation  probability  of  the  atomic  mode  ss ,  which  is  now 
pc /(l  ~p spon)  instead  of  pc  [we  always  use pc  to  denote  the 
possibility  of  the  good  event  described  by  Eq.  (3)].  To  fix  the 
fidelity  imperfection  to  be  small,  in  the  presence  of  the  spon¬ 
taneous  emission  loss,  we  need  to  further  decrease  the  exci¬ 
tation  probability  pc  by  a  factor  of  1  —  /?spon,  which  means 
that  we  have  a  smaller  probability  to  register  one  signal  pho¬ 
ton.  So,  as  a  result  of  this  noise,  the  preparation  efficiency 
(the  success  probability  of  the  scheme)  is  decreased  by  a 
factor  of  1  —p  spon  when  we  fix  the  fidelity  imperfection. 

The  inherent  mode-mismatching  noise  can  be  quantified 
by  the  probability  pmo de  of  the  event  that  an  atom  is  excited 
to  some  other  atomic  modes  instead  of  the  collective  mode  ss 
while  the  accompanying  photon  goes  to  the  right  signal 
mode  as .  In  this  case,  we  can  register  a  photon  from  the 
single-photon  detectors,  but  the  atomic  mode  ss  will  be  ac¬ 
tually  still  in  the  initial  vacuum  state  |0a).  So,  this  noise  will 
add  some  vacuum  component  to  the  generated  state  between 
the  ensembles  L  and  R.  In  the  presence  of  this  noise,  the 
generated  state  between  the  ensembles  L  and  R  is  mixed  with 
the  form 


Plr(c0’<P)  ~  y  (Co|OaOa)zJ?(OaOj  +  <pl)> 

(5) 

where  the  vacuum  coefficient  c0  is  basically  given  by  the 
conditional  probability  pmode  f°r  this  noise  contribution.  Ac¬ 
tually,  as  has  been  shown  in  Ref.  [12],  there  are  other  sources 
of  noise  which  can  contribute  to  the  vacuum  coefficient  c0, 
such  as  the  detector  dark  counts,  and  the  detector  inefficiency 
in  the  succeeding  entanglement  connection  scheme  detailed 
in  Ref.  [12].  It  has  also  been  shown  there  that  the  vacuum 


023818-4 


THREE-DIMENSIONAL  THEORY  FOR  INTERACTION  .  .  . 


PHYSICAL  REVIEW  A  66,  023818  (2002) 


c 

C1  ( 

.►  CLc 

■  ►  ^  s 

Other 

optical  modes 

FIG.  4.  The  intuitive  interaction  picture  for  the  single-atom  case 
with  the  corresponding  symbols  having  the  same  meaning  as  Fig.  2. 
There  is  only  one  atomic  mode,  but  the  spontaneous  emission  loss 
represented  by  c2  becomes  much  larger  compared  with  the  good 
correlation  c  1 . 


cally,  and  we  expand  it  into  plane  wave  modes  sse(r,t ) 
00  f  akel(k  r~Mkt)d3 k  with  mk=  |k|c,  and  ak ,  the  correspond¬ 
ing  annihilation  operator.  Then,  the  Hamiltonian  describing 
the  light-atom  interaction  has  the  following  form  in  the  in¬ 
teraction  picture  (setting  h  =  1 ) 

Na  r  Na 

H(t)  =  A'Z  Ofee+  (TlegLl(Y,,t)elkaZ 

i=  1  [  i  =  1 

+  2  O’lej  S^eake~l[k +  j  , 

(6) 


component  noise  will  be  finally  automatically  purified,  and 
thus  has  no  influence  on  the  final  communication  fidelity. 
However,  it  has  a  more  significant  influence  on  the  efficiency 
than  the  spontaneous  emission  loss.  For  applications  in  Refs. 
[12,13],  to  get  a  better  overall  efficiency,  it  is  better  to  keep 
a  balance  between  the  mode-mismatching  noise  pmode  and 
the  spontaneous  emission  inefficiency  pspo n,  with  /?mode  sig¬ 
nificantly  smaller  than  /?spon. 

It  is  helpful  to  make  a  comparison  here  with  the  single¬ 
atom  case.  For  a  single  atom  interacting  with  the  free-space 
light  with  the  same  level  configuration  as  shown  in  Fig.  1, 
there  is  only  one  atomic  mode  given  by  s  =  |g)(s|,  but  there 
are  still  infinite  optical  modes.  We  can  still  identify  the  for¬ 
ward  scattered  optical  mode  as  as  the  signal  mode.  The  in¬ 
teraction  picture  is  then  shown  intuitively  by  Fig.  4.  For  a 
single  atom,  there  is  no  inherent  mode-mismatching  noise, 
but  the  spontaneous  emission  inefficiency  becomes  much 
larger.  If  the  atom  is  excited  to  the  level  | s)  (that  is,  to  the 
mode  s)  through  the  Raman  laser  pumping,  the  accompany¬ 
ing  photon  has  a  probability  to  go  to  all  the  possible  direc¬ 
tions  [16],  and  thus  has  only  a  very  small  possibility  to  go  to 
the  signal  mode  a  s .  We  will  calculate  in  the  following  sec¬ 
tions  the  spontaneous  emission  inefficiencies  for  the  signal 
atom  case  as  well  as  for  the  atomic  ensemble  case,  and  com¬ 
pare  them  with  the  different  mode-matching  methods  to  see 
whether  there  is  collective  enhancement  of  the  signal-to- 
noise  ratio  for  the  many-atom  ensemble. 

III.  THE  THEORETICAL  MODEL  FOR  THE 
LIGHT-ATOMIC-ENSEMBLE  INTERACTION  AND  ITS 
SOLUTION 

We  will  now  go  to  the  detailed  description  of  the  light- 
atomic-ensemble  interaction.  From  this  description,  we  will 
calculate  the  signal-light  mode  structure  fk ,  the  spontaneous 
emission  inefficiency  pspo n,  and  the  inherent  mode- 
mismatching  noise  pmo de.  In  the  interaction  configuration 
shown  by  Fig.  1,  the  Raman  pumping  laser  can  be  described 
classically  by  neglecting  its  small  quantum  variance,  and  is 
assumed  to  be  propagating  basically  along  the  z  direction. 
So,  we  can  write  its  amplitude  as  sge(r,t) 
=  u(r,t)el^koZ~Mot\  where  (o0  =  k0c  =  2irc/X0  is  the  carrier 
frequency  and  u(r,t)  is  a  slowly  varying  function  of  the 
coordinate  r  and  the  time  t.  The  field  sse  coupling  to  the 
transition  \e)— >\s)  should  be  described  quantum  mechani¬ 


where  the  detuning  A  =  coeg—  co0  with  coeg=  <oe—  cog ,  the 
frequency  difference  between  the  atomic  levels,  a l^v 
=  |  pi )  i {  v  |  ( pi ,  v  =  g ,  e ,  s )  are  the  transition  operators  of  the 
ith  atom,  rt  is  the  coordinate  of  the  ith  atom,  and  geg  ,gke  are 
the  coupling  coefficients  that  are  proportional  to  the  dipole 
moments  of  the  corresponding  transitions.  The  coefficient 
gke  depends  in  general  on  the  direction  of  the  wave  vector  k 
by  the  dipole  pattern.  In  Eq.  (6),  the  carrier  frequency  of  the 
spontaneous  emission  field  sse(r,t)  is  given  by  co0—  cosg, 
and  its  relevant  frequency  width  can  be  estimated  by  the 
natural  width  T  of  the  excited  level  \e)9  which  means  that  the 
modes  ak  with  the  frequency  difference  between  ook  and 
c o0  —  o)sg  much  larger  than  the  width  T  will  have  negligible 
influence  on  the  system  dynamics.  We  neglect  in  Eq.  (6)  the 
spontaneous  emission  back  to  the  level  | g)  since  it  is  not 
important  for  our  purpose  (with  the  detection  method  speci¬ 
fied  in  Sec.  II),  and  has  no  influence  on  all  of  our  results. 

If  both  the  natural  width  T  and  the  frequency  spreading  of 
the  pumping  laser  u(r,t)  are  significantly  smaller  than  the 
detuning  A,  we  can  adiabatically  eliminate  the  upper  level 
\e)  through  the  standard  technique,  and  the  resulting  adia¬ 
batic  Hamiltonian  is  given  by 

#«  =  -{  2  oigu(r i,t)  f 

1  k  l2^ 

Xd3k+H.c. - j—  Y  vlgg\u(*i,t)\2,  (7) 

where  Awk=  a>k—  (co0—  cosg)  and  Ak=k—k0z0  with  z0,  the 
unit  vector  in  the  z  direction.  We  have  neglected  in  Eq.  (7) 
the  Stark  shift  of  the  level  |^)  since  it  is  much  smaller  than 
the  other  terms.  (The  spontaneous  emission  field  is  much 
weaker  than  the  pumping  field).  The  last  term  in  Eq.  (7)  [the 
Stark  shift  of  the  level  | g)]  can  be  eliminated  if  we  make  a 
phase  rotation  of  the  basis  {|g),|s)}  which  will  transform 
alsg  to  alsged\se8\2^)S0\^i^\2d\  Thus,  we  can  simply  drop 
off  the  last  term,  and  replace  u(ri9t)  by  u'(ri9t) 
=  u{Yi  j)eli<\geg\2lA^o\u(ri  ’r)l2<a?r>  in  the  following,  for  simplic¬ 
ity  of  the  symbol,  we  will  denote  u'(ri9t)  still  by  u(ri9t)9 
and  denote  —geggke/A  by  a  single  coefficient  gk. 

At  the  beginning,  all  the  atoms  are  in  the  ground  state  | g)9 
and  all  the  optical  modes  ak  are  in  the  vacuum  state.  We 
denote  this  initial  state  of  all  the  atomic  and  optical  modes  by 
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|  vac).  Then,  if  the  Raman  pumping  laser  u(r,t)  is  short  and 
weak  enough,  we  can  expand  the  final  state  |  T' y)  into  a  per¬ 
turbative  expansions.  To  the  second  order  of  the  perturbation, 
the  final  state  has  the  form 


with 

Hi(ri,t)  =  aisgj  r,- ,f)*-''Ak'r^3k.  (11) 


I  */> 


H(  T)dr 


f  H(T1)H(T2)dT1dT2 
0  Jo 


vac),  (8) 


where  t0  is  the  duration  of  the  Raman  pulse,  and  T[  •  •  •  ] 
denotes  the  time-ordered  product. 

At  this  point,  we  should  note  that  for  an  atomic  vapor,  the 
atom  positions  rt  are  randomly  distributed,  and  during  the 
light-atom  interaction,  the  atoms  are  moving  fast  in  the  room 
temperature  [17].  Therefore,  we  should  treat  r,  in  the  Hamil¬ 
tonian  (7)  as  stochastic  variables.  We  make  the  following 
two  assumptions  about  the  properties  of  these  variables:  (i) 
For  different  atoms  i  and  j ,  r;  and  r;-  do  not  correlate  with 
each  other;  (ii)  Different  stochastic  variables  r*  obey  the 
same  probability  distribution  that  is  determined  by  the  geom¬ 
etry  of  the  atomic  cell.  Note  that  these  two  assumptions  are 
well  satisfied  by  the  room-temperature  atomic  gas.  If  the 
atom  positions  r*  behave  as  classical  random  variables,  we 
should  take  an  average  of  the  final  state  | T'y)  over  the  joint 
probability  distribution  of  these  variables,  and  the  resulting 
state  becomes  mixed  with 


Only  the  term  in  the  large  bracket  of  the  Hamiltonian  Ht 
depends  on  the  variable  r, .  By  the  property  (i)  of  the  vari¬ 
ables  rt ,  we  have 

(}  =  <//i){r|}<^.){r(}+^<i/?){rj}-<//I.){2r(}}. 

(12) 

By  the  property  (ii)  of  the  variables  rt ,  we  know  that 
(u(rt  ,t)e~lAk  ri){ r }  becomes  independent  of  the  atom  index 
/,  so  the  average  of  the  Hamiltonian  H(t )  has  the  simple 
form 

r;}=V^Vs  [  SkakelA,°k‘(u(ri  d) lAk  r‘) {r.}d3k, 

(13) 

where  ss  is  the  symmetrical  collective  atomic  operators  de¬ 
fined  in  Eq.  (1).  We  will  write  the  initial  vacuum  state  |vac) 
as  a  tensor  product  of  the  atomic  part  |vac)a  and  the  optical 
part  |vac)p  ,  and  denote  the  atomic  state  |vac)fl  simply  by 
| s)i.  With  this  notation,  a  combination  of  Eqs.  (8)— (13) 
yields  the  following  form  for  the  averaged  state  pf. 


Pf=(\VfW){ri},  (9) 

where  the  symbol  ( •  •  •  ){r.j  denotes  the  average  over  all  the 
variables  rt . 

To  take  the  average  in  Eq.  (9),  first  we  note  that  the 
Hamiltonian  (7)  can  be  written  as 

Na 

H{t)  =  X  ff,(r,,f)  (10) 


Pf=(l-p2-pc)\'tre&)('trtff\+P2Pn  +  °(P2),  (14) 

where  the  effective  pure  state  |'Treff)  (notnormalized)  is 

l^eff>  =  (  I-*  jo°(^(T)){r,.}^Tjlvac),  (15) 

with  (Treff|^/'eff)=  1  +Pc  »  and  the  noise  component  p2pn  is 
given  by 


TV 

PiPn=  S  |s)i<j|®  J  dTldr2j  d3 kj  d3k'gkg*,e,(Aa,kTl~A,°k'T2)allvac)p(vaclakr 

X{<«(r<,T1)«nr<,T2)e-'(k-k')-%^-<«(r<,r1)e-'Ak-r'>{r(}<«(r,,T2)eiAk'-r'>{rj}}, 


(16) 


with  the  value  of  p2  determined  by  the  normalization  of  pn  , 
i.e.,  by  tr (pn)  =  1.  In  Eq.  (14),  o(p2)  represents  the  higher- 
order  terms  compared  with  the  magnitude  of  p2  and  pc , 
which  will  be  neglected  in  the  following.  We  also  ignored  the 
second-order  cross  terms  proportional  to  (Tl  a^\  vac)(vac|  or 
|  vac) (vac |  crlgsa k  in  writing  Eq.  (14),  since  they  have  no  con¬ 
tributions  to  the  quantities  we  will  calculate  below.  The  so¬ 
lution  (14)  serves  as  the  starting  point  for  discussions  of 
various  properties  of  this  system  in  the  following  section. 


IV.  PROPERTIES  OF  THE  LIGHT-ATOMIC-ENSEMBLE 
INTERACTION 

Now,  let  us  analyze  the  properties  of  the  light-atomic- 
ensemble  interaction  revealed  by  the  solution  (14).  First,  we 
look  at  the  effective  pure  state  |''Peff)  by  neglecting  the  noise 
component  p2pn  .  The  state  l^eff)  has  exactly  the  same  form 
as  the  ideal  state  (3)  that  is  the  starting  point  for  all  the 
applications.  To  see  this  clearly,  we  can  write  |  ^ eff )  as 
|1Peff)  =  (l  +  vac),  by  defining  the  signal  mode 
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afs=J  fkald3k 

=  -i^  f  d3 k  f  '0dTgkaleiA^T 
yjpcJ  Jo 

X  (  m  ( r(- ,  t)  e  _  ‘ Ak  r'){r;} »  (17) 


where  the  value  of  pc  is  determined  by  the  normalization  of 
the  operator  a]  ,  i.e.,  by  the  condition  [as,a]]=  1 . 

We  are  particularly  interested  in  the  spatial  structure  of 
the  signal  mode  a] ,  since  one  needs  to  know  this  spatial 
structure  to  design  mode  matching  in  practical  experiments. 
The  integration  f  d3k  can  be  expressed  as  f  d3k 
=  fa°-0S8-md(Of4irdfi‘i  where  dll  =  sin  OdOdcp  represents 

the  infinitesimal  solid  angle  and  8  is  the  bandwidth  of  the 
spontaneous  emission  field  which  is  in  the  order  of  the  natu¬ 
ral  width  T  of  the  excited  level  \e).  The  spatial  structure  of 
the  mode  a]  is  determined  by  the  superposition  coefficient 
fk=f((o,Q)  as  a  function  of  the  solid  angle  fl.  Typically,  the 
size  L  of  the  atomic  ensemble  (centimeter  long  or  less)  sat¬ 
isfies  the  condition  TL/c<  1,  and  u>sgLlc<  1  is  either 
around  GHz  or  zero,  depending  on  whether  one  uses  the 
hyperfine  level  or  the  Zeeman  sublevel  for  the  \s)  state).  In 
this  case,  Ak-r;  =  (k— k0z0) -r~k0(k/|k|  —  z0) •  r;- ,  which 
only  depends  on  the  direction  of  k,  and  becomes  independent 
of  the  frequency  co.  We  assume  that  the  Raman  pumping 
field  can  be  factorized  as  u(r,t)  =  «i(r)^(z,f),  where 
/z/(z,0  is  a  slowly  changing  function  of  the  coordinate,  and 
can  be  well  approximated  as  a  constant  along  the  size  of  the 
atomic  ensemble.  With  this  condition,  the  superposition 
function  /(co, II)  is  factorized  as  a  product  of  the  frequency 
part  and  the  spatial  part  /^(I2),  and  the  spatial  part 

/xi(1 1)  becomes  independent  of  the  emission  frequency  co 
and  the  interaction  time  t0.  To  have  a  more  explicit  expres¬ 
sion  of  the  spatial  structure  /n( II)  of  the  signal  mode  a]  ,  we 
assume  that  the  atoms  are  distributed  by  the  normalized  dis¬ 
tribution  function  pdis(r)  [with  / pdis(r)d3r=  1],  which  is  de¬ 
termined  by  the  geometry  of  the  ensemble.  As  we  mentioned 
before,  the  interaction  coefficient  gk  normally  also  depends 
on  the  direction  of  the  wave  vector  k  in  the  form  of  a  dipole 
pattern,  but  it  varies  very  slowly  with  the  angle  H  compared 
with  other  contributions  in  /^(H),  so  we  drop  it  off  when 
considering  the  spatial  structure  of  the  signal  mode.  Under 
this  circumstance,  /^(I2)  is  simply  expressed  as 


■/ 


d3ru1(r)pdis(r) 


gikftz(l  —  cos  0)  —  ikQsin  6{x  cos  cp+y  sin  cp)  (18) 


To  further  simplify  this  expression,  we  assume  a  Gaussian 

pump  beam  with  u±( r)  =  e~^x  +y  )/ro,  and  a  Gaussian  form 
for  the  transverse  part  of  the  atomic  distribution  function 
with  [18] 


^dis(r)  = 


-J—e-(x2+y2VR20  (-L/2=£z=SL/2), 

7 tLRq 


Pdis(r)  =  0,  (z<~  L/2,or,z>L/2),  (19) 


where  r0  and  R0  characterize  the  radius  of  the  pump  beam 
and  the  radius  of  the  atomic  ensemble,  respectively,  and  L  is 
the  length  of  the  ensemble.  In  this  case,  we  have  an  analytic 
expression  for  the  signal  mode  function 


/n(ft)  = 


rl+Rl 


e  (l/4)^0r0/?0sm26»/(r0-l-Z?0)s^ncJ  ^2  _  j  ? 


(20) 


where  the  function  sine  is  defined  as  sinc(v)  =  sin(v)/x  From 
this  expression,  we  see  that  the  signal  photon  mainly  goes  to 
the  forward  direction.  The  signal  mode  is  inside  a  small  cone 
around  0=0  with  A  6  characterized  by 
min[  V^o  +  R20/(k0r0R0) ,  1  /  V^L] . 

We  have  shown  that  the  first  component  of  the  solution 
(14)  exactly  contributes  to  the  ideal  coherent  process,  and 
have  specified  the  spatial  structure  of  the  signal  mode.  Now 
we  analyze  the  contributions  of  the  noise  described  by  the 
second  component  of  Eq.  (14).  From  Eq.  (14),  we  see  that 
the  noise  component  p2P«  *s  expressed  as  a  tensor  product  of 
the  atomic  density  operator  p^  =  ^^1\s)i(s\  and  the  remain¬ 
ing  optical  density  operator  ppn .  Besides  the  symmetric  col¬ 
lective  atomic  mode  and  the  signal  optical  mode  as ,  there 
are  many  other  noise  modes  contributing  to  the  atomic  den¬ 
sity  operator  pan  and  the  optical  density  operator  ppn .  These 
noise  modes  correlate  with  each  other  in  a  complicated  way, 
however,  these  correlations  do  not  contribute  to  the  noise  of 
the  relevant  dynamics  if  the  two  signal  modes  are  not  in¬ 
volved  in  the  correlations,  since  the  reduced  density  operator 
of  the  modes  ss  and  a  s ,  which  describes  the  relevant  dynam¬ 
ics,  will  not  be  influenced  by  these  correlations  after  we  take 
trace  over  all  the  noise  modes.  Nevertheless,  the  correlations 
between  the  good  modes  and  the  noisy  mode  will  have  in¬ 
fluence  on  the  relevant  dynamics,  and  these  contribute  to  the 
two  sources  of  noise  we  have  mentioned  before:  the  sponta¬ 
neous  emission  loss  and  the  inherent  mode-matching  ineffi¬ 
ciency.  These  two  sources  of  noise  are  described  quantita¬ 
tively  by  the  conditional  probabilities  pspon  and  pmo de, 
respectively.  The  pspon  represents  the  probability  that  an  atom 
is  excited  to  the  right  mode  ^  ,  but  the  accompanying  photon 
does  not  go  to  the  signal  mode  as .  From  the  solution  (14)  to 
the  whole  density  operator  pf ,  this  possibility  can  be  ex¬ 
pressed  as 


'  spon 


=  i 


tr((0a0p|aAp/sX|0a0p)) 

tr«0akP/4l0a» 


(21) 


where  tr(  •  •  • )  represents  the  trace  over  all  the  remaining 
atomic  and  optical  modes  involved  in  the  operator  pf.  Simi- 
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larly,  pmoc le  represents  the  probability  that  a  photon  is  emitted 
to  the  signal  mode  a  s ,  but  the  accompanying  atomic  excita¬ 
tion  is  not  in  the  right  mode  ss ,  and  this  possibility  can  be 
expressed  from  the  density  operator  P/  as 


P  mode  1 


tr((0fl0p|aAP/.?X|0fl0p)) 

tr((0p\aspfal\0p)) 


(22) 


In  the  following,  we  need  to  calculate  these  two  probabilities 
to  quantify  the  noise  magnitudes. 

From  the  solution  (14),  we  can  derive  the  probability 
tr((0j^pytfj|0a))  for  one  atomic  excitation  in  the  mode  ss  , 
the  possibility  tr((0p\aspfdl\0p))  for  one  photon  in  the 
mode  a s ,  and  the  joint  possibility 

tY((0a0p\assspfsla]\0a0p))  for  one  atom  in  the  mode  ss  and 
one  photon  in  the  mode  a s .  They  are  respectively  given  by 

H{0a\ssPfst\0a))=pc+p2/Na,  (23) 

tr((0p\aspfal\0p))=pc  +  XPc’  (24) 

H(0a0p\assspfsla'l\0a0p))  =  pc  +  xPc/Na,  (25) 

where  p2  and  pc  are  obtained,  respectively,  from  the  normal¬ 


ization  of  pn  and  a  J  [see  Eq.  (16)  and  Eq.  (17)],  and  the  ratio 
X  is  defined  as 


X=Natr((0p\asppnal\0p))/pc,  (26) 


with  ppn  ,  the  optical  part  of  the  noise  component  p2pn  as  we 
have  specified  before. 

From  the  above  equations,  we  see  that  we  only  need  to 
calculate  the  two  ratios  pc/p2  and  x  t0  determine  the  noise 
probabilities  psp0n  and  Pmode-  As  we  have  mentioned  before, 
the  shape  u(r,t)  of  the  pump  beam  is  typically  decomposed 
as  u(r,t)  =  u±(r)pii(z,t)  with  fii(z,t)  approximately  inde¬ 
pendent  of  z  along  the  size  of  the  atomic  ensemble.  In  this 
case,  the  two  ratios  pc/p2  and  x  have  much  simplified  ex¬ 
pressions,  which,  become  independent  of  the  interaction  de¬ 
tails,  such  as  the  interaction  time  or  the  bandwidth  of  the 
coupling  field,  and  depend  only  on  some  spatial  integrations 
determined  by  the  geometry  of  the  atomic  ensemble  and  the 
pump  beam.  For  simplicity,  we  also  neglect  the  slow  varia¬ 
tion  of  the  coupling  coefficient  gk  with  the  direction  of  k. 
Under  these  conditions,  the  ratios  pclp2  and  x  are  given 
respectively  by 


TV 

Pi 


f  Jfi{|<M±(ri)e-'Akr-){r}|2} 

J  477 

/4^^{^lMj-(r,)|2>{ri}—  l<MJ-(r')e-,"Akr‘>{rf}|2} 


(27) 


X= 


f  ^a{|<Mr>-'Ak  %}|2}  f  dil  [  |«x(ri-)|2e-,(k-k')-%} 

J  477  J  477  J  477 

X<«i*(ri)e,‘Ak-r'){r|}<«x(rf)«-,'Ak' ■%}}-!• 


(28) 


Similar  to  the  case  for  calculating  the  signal  mode  structure  we  also  assume  a  Gaussian  pump  beam  and  a  Gaussian 

form  for  the  transverse  atomic  distribution  function  as  is  shown  by  Eq.  (19).  In  this  case,  we  have  the  following  analytic 
expressions  for  these  two  ratios: 


^ - At  f  sin  0dd\e  ( l/2)  VoA>osm2t'/(''o+/'Vx  sinc2f  k()L  sin2  —  j  j , 

l+Rl)2  Jo  [  \  2/j 


Pc  _  rl(rO+2Ro)  fn 
Pi  2(r; 


(29) 


('■o  +  ^o)2 


X  = 


ro(ro+2Ro) 

r  77  r  77 

X  sin  0sin  O' dOdO 

Jo  Jo 


J  sin^J^l^  (1/2>VoV-  °dr0+R0)sinc2^koL 


20 
,sm  - 


-2 


r  I  ___  f  „  —  (l/4)/:QrQt?Q(sin2^+sin2^?,  —  2  sin  0  sin  tf'cos  <p)l(r^  +  2R^)j 

2  77  Jo 


X  e  -  ( W)k20r20R20(sm2 6+  sin2 O'  )(r2  +  R^sinC 


, e 

0\ 

1  o\ 

i  /  ?e'\ 

1 

k0L\ 

sin2- 

-sm  ~y 

sincl  k()L  sin  — 

sincl  k0L  sin  —  j 

(30) 
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In  writing  Eq.  (29),  we  have  assumed  k0L>l  and  have  ne¬ 
glected  the  term  which  is  about  1  lk0L  times  smaller. 

From  the  two  ratios  pc/p 2  and  x,  the  spontaneous  emis¬ 
sion  loss  and  the  inherent  mode-matching  inefficiency  are 
directly  written  as 

PsponKl+V^c/^r1,  (31) 

Pm  ode=V(l+X)-  (32) 

The  approximations  in  Eqs.  (31)  and  (32)  are  valid  under  the 
condition  Na>  1  (we  neglected  the  terms  that  are  l/Na  times 
smaller).  To  minimize  the  two  sources  of  noise  pspon  and 
p mode  *  it  is  better  to  have  a  large  Napc  / p2  and  a  small  x •  We 
will  discuss  the  details  in  the  following  section  on  how  to 
minimize  these  two  noise  by  using  different  kinds  of  mode¬ 
matching  methods. 

At  the  end  of  this  section,  we  would  like  to  make  a  brief 
comparison  with  the  single-atom  case.  For  the  case  of  a 
single  atom,  if  the  position  of  this  atom  is  also  fluctuating  in 
the  space,  the  above  calculation  is  still  valid  but  with  Na 
=  1.  From  Eqs.  (21) -(26),  we  see  that  in  the  case  of  Na 
=  1 ,  we  always  have  pmo de=0.  Thus,  for  the  case  of  a  single 
atom,  the  spontaneous  emission  loss  is  the  only  source  of 
noise,  as  has  been  shown  intuitively  in  Fig.  4.  However,  in 
this  case,  the  spontaneous  emission  inefficiency  pspon,  which 
can  be  approximated  by  pspon—  1  —  (1  +  x)PcIPi  since  pc 
<p2,  becomes  much  larger.  If  we  compare  the  efficiency 
(defined  as  l—pSVOn)  from  the  spontaneous  emission  noise 
between  the  single-atom  case  and  the  atomic-ensemble  case, 
we  see  that  this  efficiency  is  increased  by  about  a  factor  of 
Na  for  the  atomic  ensemble.  This  is  what  we  called  the  col¬ 
lective  enhancement  of  the  signal-to-noise  ratio  for  the 
many- atom  ensemble. 

V.  MODE-MATCHING  METHODS 

The  first  mode-matching  method  is  to  choose  the  signal 
mode  as  the  mode  for  detection,  with  the  mode  function 
exactly  given  by  the  inherent  spatial  structure  (20)  of  the 
signal  light.  The  mode  function  (20)  is  similar  to  a  Gaussian 
function,  especially  when  k0rl>L  and  k0Rl>L.  [It  would 
be  an  exact  Gaussian  function  if  the  axial  distribution  of 
pdis(r)  is  Gaussian.]  One  can  couple  a  Gaussian  mode  into  a 
single-mode  optical  fiber  with  a  good  technical  mode¬ 
matching  efficiency.  For  this  case  with  an  exact  mode  match¬ 
ing,  the  two  noise  probabilities  psp on  and  /?mode  are  exactly 
given  by  Eqs.  (29) -(32).  We  now  go  to  the  calculations  of 
these  two  noise  probabilities.  For  this  purpose,  we  need  to 
understand  first  how  to  get  a  large  Napc/p2  and  a  small  x  to 
minimize  the  noise  /?spon  and  pmo de. 

First,  we  give  an  estimation  of  the  ratio  Napc/p2.  It  has  a 
definite  physical  meaning.  From  Eq.  (29),  we  see  that  the 
integration  function  is  significantly  different  from  zero  only 
when  the  integration  variable  0  falls  into  a  small  cone  with 
0  ^0<6f,  where  Of  is  estimated  by  Of 
~min(l/^k0L,l/k0refft)  with  reff=r0R0/ V^o+^o-  Thus, 
the  result  of  the  integration  can  be  estimated  by  Oj .  We  also 
know  that  the  total  atom  number  can  be  approximated  by 
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N where  na  denotes  the  average  atom  number 
density.  From  this,  we  see  that  the  ratio  Napclp2  is  estimated 
by 

A apc  I  1  2  1  2  \ 

— - mini  ^naX0reff,—naX0L\.  (33) 

If  we  assume  that  the  two  bounds  for  Napclp2  are  compa¬ 
rable,  which  means  that  l/^k0L~  l/k0reff,  or  the  Fresnel 
number  defined  as  F=irr2eff/\0L  is  in  the  order  of  1,  we 
have  Napc/p2~na\lL/(47r)~d0/(47r),  where  the  on- 
resonance  optical  depth  d0  is  simply  defined  as  d0=na\^L. 
Therefore,  to  have  a  small  spontaneous  emission  inefficiency 
p  spon  ?  we  need  to  use  an  optically  dense  ensemble  with  dQ 
>1  (the  off-resonance  optical  depth  can  still  be  much 
smaller  than  1  due  to  the  large  detuning). 

Next,  we  consider  how  to  minimize  the  ratio  x  by  choos¬ 
ing  appropriate  interaction  configurations.  From  Eq.  (30),  we 
see  that  x  increase  with  the  ratio  7?0/r0.  This  can  be  intu¬ 
itively  understood  as  follows:  the  noise  characterized  by 
p  mode  and  x  comes  from  the  density  fluctuation  of  the  atomic 
ensemble.  With  a  larger  R0 ,  the  atoms  have  more  space  to 
move  around,  and  one  thus  has  a  relatively  larger  density 
function  and  a  larger  noise  ratio  x  [19].  Thus,  to  minimize  x> 
we  choose  a  configuration  with  r0>R0  in  the  following, 
which  means  the  pumping  laser  is  shined  on  all  the  atoms 
with  a  broad  cross  section.  In  this  case,  the  effective  radius 

reff~R  0- 

Now  we  would  like  to  calculate  the  noise  probabilities 
p  spon  and  /?mode  in  a  more  accurate  way  in  the  case  of  exact 
mode  matching.  This  requires  us  to  carry  out  the  complicated 
integrations  in  Eqs.  (29)  and  (30),  which  is  only  possible  by 
using  numerical  methods.  The  spontaneous  emission  ineffi¬ 
ciency  ps p0n  depends  on  the  optical  depth  d0  =  na\lL.  We 
have  numerically  calculated  the  value  of  pspon  versus  the 
optical  depth  d0  under  different  geometries  of  the  atomic 
ensemble.  The  geometry  of  the  ensemble  is  described  by  the 
Fresnel  number  F=ttRI/X0L.  The  calculation  result  is 
shown  in  Fig.  5.  From  the  figure,  we  see  that  the  spontaneous 
emission  inefficiency  is  insensitive  to  the  geometry  of  the 
ensemble.  The  two  curves  with  F=  1  and  F=  10  basically 
overlap.  With  a  much  smaller  F  =  0.1,  the  inefficiency  pspo n 
increases  by  about  a  factor  of  2.  On  the  other  hand,  /?spon  is 
sensitive  to  the  optical  depth  dQ .  It  decreases  with  l/dQ  ap¬ 
proximately  linearly,  which  confirms  the  rough  estimation 
from  the  above.  From  the  curves,  we  can  write  psp on  approxi¬ 
mately  as  PsVon=  1/(1  +  d0/26)  for  the  Fresnel  number  F 
from  1  to  10.  In  the  above  numerical  calculation,  we  as¬ 
sumed  some  typical  values  for  the  parameters  with  \0 
—  0.8  /zm  and  L— 1  cm,  but  actually  the  result  is  very  in¬ 
sensitive  to  the  parameter  k0L  as  long  as  it  is  still  much 
larger  than  1 .  For  an  atomic  cell  with  a  typical  number  den¬ 
sity  na~  1012/  cm3  and  length  L— 1  cm,  the  optical  depth 

— 6.4X103,  and  we  have  a  spontaneous  emission  ineffi¬ 
ciency  /7spon— 0.4%,  which  is  very  small. 

The  inherent  mode-matching  inefficiency  pmode  is  deter¬ 
mined  by  the  geometry  of  the  ensemble,  and  does  not  depend 
on  the  value  of  the  optical  depth.  We  have  numerically  cal- 
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FIG.  5.  The  spontaneous  emission  inefficiency  pspon  versus  the 
on-resonance  optical  depth  d0  of  the  ensemble  under  different  ge¬ 
ometries  characterized  by  the  Fresnel  number  F.  We  have  F=  10  for 
the  solid  curve,  F—  1  for  the  dotted  curve,  and  F  =  0.1  for  the 
dashed  curve. 

culated  the  value  of  /?mode .  It  turns  out  that  pmode  is  actually 
also  insensitive  to  the  geometric  parameters  k0L  and  F  as 
long  as  they  are  still  in  the  typical  region.  For  instance,  we 
have  p mode~ 24%  for  F=  1,  /?mode— 25%  for  F=10,  and 
mode~  23%  for  F  =  0.1.  In  each  case,  we  change  the  length 
L  from  0.3  cm  to  3  cm  (with  \0~0.8  /zm),  and  the  value  of 
p mode  basically  does  not  change  at  all.  Note  that  the  typical 
inherent  mode-matching  inefficiency  pmo de— 24%  is  quite 
large,  and  we  cannot  efficiently  reduce  it  by  controlling  the 
geometry  of  the  ensemble.  As  we  have  mentioned  before,  the 
mode  mismatching  noise  has  a  more  significant  influence  on 
the  efficiency  of  the  application  schemes  than  the  spontane¬ 
ous  emission  noise.  The  large  inherent  mode-matching  inef¬ 
ficiency  is  a  disadvantage  of  the  exact  mode-matching 
method. 

To  reduce  the  inefficiency  pmo de,  we  consider  the  follow¬ 
ing  simple  improvement  to  the  exact  mode-matching 
method.  After  the  light- atomic-ensemble  interaction,  the 
scattered  light  is  focused  by  a  lens,  and  we  apply  an  aperture 
on  the  focal  plane  of  the  lens  to  collect  the  signal  light  only 
from  a  small  cone  in  the  forward  direction  with  0  ^O^0D, 
where  the  detection  angle  0D  can  be  controlled  by  the  size  of 
the  aperture.  We  call  this  kind  of  mode  matching  the  filtered 
exact  mode  matching.  By  the  exact  mode  matching,  we  de¬ 
tect  the  photon  scattered  to  the  signal  mode;  and  by  the  fil¬ 
tered  exact  mode  matching,  we  only  detect  the  scattered  pho¬ 
ton  that  is  in  the  signal  mode  and  at  the  same  time  lies  in  the 
filtering  cone  with  O^0^0D.  Due  to  the  additional  restric¬ 
tion,  we  now  have  a  less  probability  of  success  to  register  the 
photon.  The  spontaneous  emission  inefficiency  /?spon  should 
increase,  but  through  this  scarification,  we  can  significantly 
reduce  the  noise  pmo de ,  which  means,  if  we  register  the  pho¬ 
ton  with  this  additional  restriction,  we  are  more  confident 
that  the  accompanying  atomic  excitation  will  go  to  the  col¬ 
lective  atomic  mode  ss.  To  calculate  pspon  and  pmo de  for  the 
filtered  exact  mode  matching,  we  can  still  use  Eqs.  (21)- 
(26),  but  the  signal  mode  as  should  be  confined  inside  the 


FIG.  6.  The  spontaneous  emission  inefficiency  pspon  (solid 
curve)  and  the  inherent  mode  mismatching  probability  p  mode 
(dashed  curve)  versus  the  filtering  angle  0D  with  the  Fresnel  num¬ 
ber  F=l. 


small  cone  with  O^0^0D.  Inside  this  filtering  cone,  the 
signal  mode  has  the  same  mode  structure  as  given  by  Eq. 
(20).  With  this  modified  signal  mode,  we  find  that  /?mode  is 
still  given  by  Eqs.  (32)  and  (30),  but  all  the  integrations  of  0 
and  0'  in  Eq.  (30)  can  only  be  taken  from  0  to  the  filtering 
angle  0D  instead  of  from  0  to  tt.  The  spontaneous  emission 
inefficiency  /?spon  now  has  the  following  expression: 


P 


spon 


=  i 


NaPc\e^eD 
NaPc  +  Pl  ’ 


(34) 


where  we  have  assumed  Na>  1  for  the  approximation,  and 
the  symbol  pc\e^eD  means  that  the  integration  of  0  in  pc  can 
only  be  taken  from  0  to  0D .  From  Eq.  (34),  we  need  to 
calculate  two  ratios,  pc/p 2  and  pc\  o^oDIP2>  f°r  /?Spon-  The 
ratio  pc/p 2  is  exactly  given  by  Eq.  (29),  and  the  ratio 
Pc\ <9s£  eD  I  Pi  has  the  same  form  as  Eq.  (29),  but  the  integra¬ 
tion  of  0  is  only  from  0  to  0D. 

We  have  numerically  calculated  the  noise  probabilities 
p  spon  and  pmodQ  versus  the  filtering  angle  and  the  result  is 
shown  in  Fig.  6  with  the  Fresnel  number  F=  1  and  the  op¬ 
tical  depth  do~1.9Xl03.  From  the  figure,  we  see  that  by 
decreasing  the  filtering  angle  0D ,  we  can  significantly  re¬ 
duce  the  noise  pmo de .  The  cost  is  that  at  the  same  time  the 
inefficiency  pspon  significantly  increases.  However,  as  we 
have  mentioned  before,  though  both  of  the  noises  pspon  and 
p mode  can  be  corrected  in  the  application  schemes  in  Refs. 
[12,13],  the  mode-mismatching  noise  pmo de  has  a  more  se¬ 
vere  influence  on  the  final  efficiency  of  the  schemes  than  the 
spontaneous  emission  noise  /?spon.  Thus,  for  these  applica¬ 
tions,  it  is  worth  choosing  an  appropriate  filtering  angle  0D 
with  /?mode  significantly  smaller  than  £>spon.  For  instance,  if 
we  choose  the  angle  $£>  —  0,002,  the  mode-mismatching 
noise  pm ode— 0.9%,  which  is  basically  negligible  compared 
with  other  sources  of  noise.  At  the  same  time,  pspon 
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FIG.  7.  The  spontaneous  emission  inefficiency  pspon  and  the 
inherent  mode-mismatching  probability  p  mode  versus  the  filtering 
angle  0D  with  the  Fresnel  number  F=  10  (the  two  dashed  curves) 
and  F=0.1  (the  two  solid  curves).  The  spontaneous  emission  inef¬ 
ficiency  increases  with  the  decrease  of  the  angle  0D  while  the 
mode-mismatching  noise  decreases. 


structure  fa(kl)  for  detection.  Instead,  we  detect  the  photon 
in  all  the  modes  that  lie  in  the  small  cone  with  0  ^0^0D. 
We  call  this  the  simple  filtering  method.  This  seems  to  be  a 
bad  mode-matching  method,  since  one  does  not  make  any 
distinction  between  the  modes  in  this  small  cone.  If  the  reg¬ 
istered  photon  does  not  come  from  the  right  mode  a  s ,  the 
accompanying  atomic  excitation  will  be  most  probably  not  in 
the  collective  mode  ss ,  and  one  thus  has  a  significantly 
larger  inherent  mode-mismatching  noise  pmo de.  However, 
the  observation  here  is  that  if  Op  is  sufficiently  small,  basi¬ 
cally  only  the  signal  mode  exists  in  this  small  cone,  and  all 
the  other  modes  make  negligible  contributions.  The  impor¬ 
tant  question  is  how  small  the  angle  0D  should  be  to  guar¬ 
antee  a  small  pmo de.  Note  that  for  the  simple  filtering 
method,  the  spontaneous  emission  inefficiency  pspon  can  be 
calculated  in  the  same  way  as  the  filtered  exact  mode¬ 
matching  method  by  the  use  of  Eq.  (34).  But  we  have  a 
different  expression  for  pmo de,  which  in  this  case  is  defined 
as  the  probability  that  the  accompanying  atomic  excitation 
does  not  go  to  the  collective  mode  ss  when  one  photon  (from 
any  optical  modes)  is  detected  in  the  small  cone  with  0^0 
^  0D  .  This  possibility  can  be  expressed  as 

Pc\ e^e  +  ( VNa)tr(p2pn)  |  g^e 

P  mode~  1  _|_tr  /  \|  ’  05) 

P  c  I  0^  0D  ^  lr\P2Pn)  I  0^  eD 


—  66.6%,  which  seems  to  be  quite  large.  However,  to  over¬ 
come  this  noise,  we  need  only  to  increase  the  repetitions  of 
the  entanglement  generation  scheme  in  Refs.  [12,13]  by  an¬ 
other  factor  of  3  [given  by  1/(1  —  /?Spon)]>  which  is  actually 
only  a  moderate  cost.  This  choice  would  be  much  better  than 
the  case  without  any  filtering,  where  we  have  an  inherent 
mode-mismatching  noise  pmo de— 24%.  For  the  purpose  of 
guiding  the  choice  of  the  best  experimental  configuration,  we 
list  several  important  values  of  /?spon  and  p  mode  versus  the 
angle  0D  in  the  following  table: 

0D  0.0015  0.0020  0.0025  0.0040  0.0055 

Fmode  0-31%  0.92%  2.05%  8.98%  17.5% 

Pspon  79.6%  66.6%  52.9%  19.5%  5.02% 

We  have  also  calculated  the  noise  probabilities  pspon  and 
p  mode  versus  the  angle  0D  under  different  geometries  of  the 
ensemble  characterized  by  the  Fresnel  number  F.  For  F 
=  10  and  F=0.1,  the  results  are  shown  in  Fig.  7.  In  all  the 
calculations,  we  assumed  the  same  optical  depth  dQ~  1.9 
X  103  and  the  cell  length  1  cm.  The  qualitative  proper¬ 
ties  of  the  curves  for  different  Fresnel  numbers  are  quite 
similar,  but  one  needs  to  shift  the  appropriate  angle  0D  .  With 
a  large  Fresnel  number  F ,  one  needs  to  choose  a  smaller 
angle  0D .  The  appropriate  filtering  angle  0D  changes  by  a 
factor  of  2-3  if  the  Fresnel  number  changes  by  a  factor  of 
10. 

Finally,  we  describe  a  mode-matching  method  that  is  the 
simplest  for  implementation.  For  this  method,  we  still  use  an 
aperture  to  select  the  scattered  light  from  the  small  cone  with 
O^0^0D.  But  now  we  do  not  use  any  technical  mode 
matching  to  choose  only  the  signal  mode  with  the  right  mode 


where  the  trace  tr(  •  •  • )  is  over  all  the  atomic  and  the  optical 
modes,  and  the  symbol  |  e <==  qd  has  the  same  meaning  that  the 
integrations  of  0  in  pc  and  tr(p2pn)  are  only  from  0  to  0D . 
In  the  limit  of  Na>  1 ,  /?mode  can  still  be  written  in  the  form 

^mode^/O+X*).  With 

1  Pc\e^eD 

Xs+ 1  tt(P2Pn)\e^0D+Pc\e^0D 

e  — (1/2  )k20R20sin2e 


=  - - r—  f  ^sin  ddd\ 

( 1  —  COS  0£>)  Jo  [ 


X  sine2 


k0L  sin2  - 


(36) 


In  writing  Eq.  (36),  we  have  assumed  the  same  kind  of 
atomic  distribution  function  /?dis(r)  and  the  pump  mode 
function  u±( r)  as  we  have  specified  before.  We  can  use  Eqs. 
(36)  to  numerically  calculate  the  mode  mismatching  prob¬ 
ability  pmo de  versus  the  filtering  angle  0D  for  the  simple  fil¬ 
tering  method.  The  result,  together  with  the  curve  for  /?spon 
versus  0D  ,  is  shown  in  Fig.  8  for  the  Fresnel  number  F=  1 . 
In  the  calculation,  we  assumed  the  same  optical  depth  d0 
~1.9X  103.  From  the  figure,  we  see  that  if  0D  is  large,  the 
mode-mismatching  probability  pmo de  is  very  large,  but  as 
decreases,  pmo de  can  still  tend  to  zero.  This  confirms  our 
intuitive  observation.  With  the  same  parameters,  for  the 
simple  filtering  method,  we  should  further  decrease  the  fil¬ 
tering  angle  0D  by  a  factor  of  2-3  to  get  the  optimal  con¬ 
figuration  compared  with  the  filtered  exact  mode-matching 
method.  Of  course,  due  to  the  decrease  of  the  optimal 
the  corresponding  spontaneous  emission  inefficiency  pspon 
significantly  increases.  To  guide  the  experimental  choice  of 
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FIG.  8.  The  spontaneous  emission  inefficiency  pspon  (dashed 
curve)  and  the  inherent  mode-mismatching  probability  pmode  (solid 
curve)  versus  the  filtering  angle  0D  with  the  Fresnel  number  F=  1 
for  the  simple  filtering  method. 

the  optimal  0D ,  we  also  list  some  important  values  of  /?spon 
and  p  mode  for  the  simple  filtering  method 

0D  0.0006  0.0010  0.0014  0.0020  0.0032 

Pmode  1.76%  4.78%  9.10%  17.5%  37.4% 

pspon  96.4%  90.3%  82.0%  66.6%  35.1% 

For  the  simple  filtering  method,  we  also  calculated  the 
noise  probabilities  /?spon  and  pmo de  under  different  geometries 
of  the  ensemble.  For  the  Fresnel  number  F=  10  and  F  =  0.1, 
the  results  are  shown  in  Fig.  9.  We  assumed  the  same  optical 
depth  and  the  cell  length  in  these  calculations.  Since  the 
qualitative  picture  revealed  by  this  figure  is  quite  similar  to 


FIG.  9.  The  spontaneous  emission  inefficiency  pspon  and  the 
inherent  mode-mismatching  probability  pmo de  versus  the  filtering 
angle  0D  with  the  Fresnel  number  F=  10  (the  two  dashed  curves) 
and  F  =  0.1  (the  two  solid  curves)  for  the  simple  filtering  method. 


the  case  of  the  filtered  exact  mode  matching,  we  do  not  need 
to  discuss  its  properties  in  details. 

For  experimental  realizations,  the  simple  filtering  method 
can  be  much  easier  than  other  mode-matching  methods.  The 
calculation  here  shows  that  the  price  we  need  to  pay  for  this 
simplification  is  to  choose  a  smaller  filtering  angle  with  a 
significantly  increased  spontaneous  emission  inefficiency. 
Note  that  for  the  application  schemes  in  Refs.  [12,13],  this 
price  seems  to  be  still  acceptable.  For  instance,  if  we  choose 
0D— 0.001,  the  mode-mismatching  noise  pmo de— 4.8%  has 
already  been  small.  In  this  case,  the  additional  repetitions  of 
the  entanglement  generation  scheme  in  Refs.  [12,13]  is  given 
by  the  factor  1/(1  —p spon)  —  l0,  which  is  not  too  much. 

VI.  SUMMARY 

In  summary,  we  have  developed  a  theory  in  the  weak- 
pumping  limit  to  describe  the  three-dimensional  effects  in 
the  interaction  between  the  free- space  light  and  the  many- 
atom  ensemble.  The  calculations  demonstrate  some  interest¬ 
ing  results  that  are  not  known  from  the  other  approaches, 
first,  it  shows  that  the  signal  light  has  an  inherent  spatial 
mode  structure,  which  is  determined  together  by  the  geom¬ 
etry  of  the  ensemble  and  the  mode  structure  of  the  pump 
beam.  Second,  it  reveals  that  there  will  be  two  sources  of 
noise  during  the  light-atom  interaction.  One  is  the  spontane¬ 
ous  emission  inefficiency,  that  is  inversely  proportional  to 
the  on-resonance  optical  depth  of  the  ensemble,  and  the  other 
is  the  inherent  mode-mismatching  noise,  that  arises  from  the 
density  fluctuation  of  the  ensemble,  and  should  be  fully  de¬ 
termined  by  the  geometry  of  the  ensemble  for  room- 
temperature  atomic  cells.  It  turns  out  that  the  inherent  mode- 
mismatching  noise  is  quite  large  if  one  collects  the  signal 
light  through  the  exact  mode-matching  method,  and  one  can¬ 
not  efficiently  reduce  this  noise  by  optimizing  the  geometry 
of  the  ensemble  since  in  the  typical  parameter  region,  this 
noise  is  insensitive  to  the  cell  geometry.  Finally,  we  show  an 
effective  way  to  reducing  the  inherentmode-mismatching 
noise  by  adding  an  aperture  to  select  the  light  only  from  a 
small  emission  cone.  By  this  means,  we  can  efficiently  re¬ 
duce  the  mode-mismatching  noise  at  the  price  of  increasing 
the  spontaneous  emission  inefficiency.  It  is  worth  doing  this 
since  the  spontaneous  emission  inefficiency  is  far  less  impor¬ 
tant  for  some  application  schemes.  There  are  two  methods 
for  this  purpose:  the  filtered  exact  mode-matching  method 
and  the  simple  filtering  method.  Both  methods  can  reduce 
the  inherent  mode-matching  noise  if  one  chooses  an  appro¬ 
priate  filtering  angle. 

The  calculations  in  this  paper  show  that  there  is  a  large 
collective  enhancement  of  the  signal-to-noise  ratio  if  one 
compares  the  many-atom  ensemble  with  the  single-atom 
case,  which  confirms  the  prediction  from  some  simple  theo¬ 
retical  models,  and  demonstrates  the  validity  of  this  impor¬ 
tant  observation  for  the  complicated  realistic  situations. 

At  the  end  of  this  paper,  we  should  mention  that  the  cal¬ 
culations  here  also  show  that  for  any  mode-matching 
method,  we  cannot  make  both  the  spontaneous  emission  in¬ 
efficiency  /?spon  and  the  inherent  mode-mismatching  noise 
p  mode  negligible  for  the  room-temperature  atomic  cell  even  if 
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the  cell  has  a  large  on-resonance  optical  depth.  This  result 
does  not  have  much  influence  on  the  application  schemes  in 
Refs.  [12,13]  since  they  are  inherently  robust  to  the  noise 
pspon  and  p  mode  •  As  a  consequence  of  this  inherent  robust¬ 
ness,  a  non-negligible  but,  not  huge,  noise  only  has  a  mod¬ 
erate  influence  on  the  efficiency  of  the  schemes.  However, 
for  any  of  the  application  schemes  of  atomic  ensembles  that 
are  not  inherently  robust,  such  as  the  quantum  light  memory 
scheme  [7-9]  or  the  continuous  variable  teleportation 
scheme  [4-6],  a  non-negligible  noise  will  decrease  the 
scheme  fidelity.  The  calculations  here  do  not  directly  apply 
to  these  schemes,  since  they  work  out  of  the  perturbation 
region.  However,  it  indeed  raises  the  interesting  question 
whether  there  is  some  theoretical  limit  on  the  best  achievable 
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fidelity  of  these  schemes  if  one  uses  free-space  atomic  cells 
for  their  implementation. 
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We  propose  an  experimentally  feasible  scheme  to  generate  the  Greenberger-Horne-Zeilinger-type  of 
maximal  entanglement  between  many  atomic  ensembles  based  on  laser  manipulation  and  single-photon 
detection.  The  scheme,  with  inherent  fault  tolerance  to  the  dominant  noise  and  efficient  scaling  of  the 
efficiency  with  the  number  of  ensembles,  allows  one  to  maximally  entangle  many  atomic  ensembles 
within  the  reach  of  current  technology.  Such  a  maximum  entanglement  of  many  ensembles  has  wide 
applications  in  the  demonstration  of  quantum  nonlocality,  high-precision  spectroscopy,  and  quantum 
information  processing. 
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Quantum  entanglement  links  two  or  more  distant  sub¬ 
systems  in  a  profound  quantum  mechanical  way.  Such 
a  link  has  found  wide  applications  in  the  demonstration 
of  quantum  nonlocality  [1,2],  high-precision  spectroscopy 
[3],  and  quantum  information  processing  including  com¬ 
putation,  communication,  and  cryptography  [4,5].  There 
were  great  experimental  efforts  recently  to  get  more  and 
more  subsystems  entangled  [6-9],  since,  with  more  sub¬ 
systems  entangled,  quantum  nonlocality  becomes  more 
striking  [1,2],  and  the  entanglement  is  more  useful  for 
various  applications  [3-5].  In  most  of  the  experimental  ef¬ 
forts,  the  subsystems  are  taken  as  single  particles,  and  until 
now  three  to  four  atoms  or  photons  have  been  entangled 
with  a  linear  ion  trap  [8],  with  a  spontaneous  parametric 
down  converter  [6,9],  or  with  a  high- <2  cavity  [7].  There 
are  also  proposals  to  entangle  indistinguishable  atoms  in 
Bose-Einstein  condensates  [10],  or  to  weakly  entangle  two 
macroscopic  atomic  ensembles  [11],  and  the  latter  has  been 
demonstrated  in  a  recent  exciting  experiment  [12]. 

In  all  the  experimental  efforts,  it  is  hard  to  continu¬ 
ously  increase  the  number  of  the  entangled  subsystems 
due  to  the  fast  exponential  decrease  of  the  preparation 
efficiency  [6,7,9]  or  due  to  noise  and  imperfections  in 
the  setup  [7,8].  Here,  we  propose  a  scheme  to  generate 
the  GHZ  type  of  maximal  entanglement  between  many 
atomic  ensembles  with  the  following  features:  First,  the 
scheme  has  built-in  fault  tolerance  and  is  robust  to  real¬ 
istic  noise  and  imperfections.  As  a  result,  the  physical 
requirements  of  the  scheme  are  moderate  and  well  fit  the 
experimental  technique.  Second,  the  preparation  efficiency 
of  the  Greenberger-Horne-Zeilinger  (GHZ)  entanglement 
decreases  only  with  the  number  of  ensembles  by  a  slow 
polynomial  law.  Such  an  efficient  scaling  makes  it  pos¬ 
sible  to  maximally  entangle  many  (such  as  tens  of)  en¬ 
sembles  with  the  current  technology.  Our  scheme  is  based 
on  Raman-type  laser  manipulation  of  the  ensembles  and 
single-photon  detection  which  postselects  the  desired  en¬ 
tangled  state  in  a  probabilistic  fashion.  In  contrast  to  the 
belief  that  entangling  schemes  based  on  postselections  will 
necessarily  suffer  from  the  fast  exponential  degradation  of 

003 1-9007/02/ 88(17)/ 170402(4)$20.00 


PACS  numbers:  03.65.Ud,  03.67.-a,  42.50.Gy 

the  efficiency,  we  design  a  scheme  which  circumvents  this 
problem  by  making  use  of  quantum  memory  available  in 
atomic  internal  levels. 

The  basic  element  of  our  system  is  an  ensemble  of 
many  identical  alkali  atoms,  whose  experimental  realiza¬ 
tion  can  be  either  a  room- temperature  atomic  gas  [12,13]  or 
a  sample  of  cold  trapped  atoms  [14,15].  The  relevant  level 
structure  of  the  atom  is  shown  in  Fig.  1.  From  the  three 
levels  | g),  |/z),  |u),  we  can  define  two  collective  atomic  op¬ 
erators  s  =  (l/V^)X/=i  \g)i(s\  with  s  =  h,v,  where 
Na  »  1  is  the  total  atom  number.  The  atoms  are  initially 
prepared  through  optical  pumping  to  the  ground  state  | g), 
which  is  effectively  a  vacuum  state  |vac)  of  the  operators 
h,  v .  The  h ,  v  behave  similar  to  independent  bosonic  mode 
operators  as  long  as  most  of  the  atoms  remain  in  the  state 
| g).  A  basis  of  the  “polarization”  qubit  (in  analogy  to  the 
language  for  photons)  can  be  defined  from  the  states  |  H)  = 
h^ | vac)  and  \  V)  =  |  vac),  which  have  an  experimentally 

demonstrated  long  coherence  time  [12-15].  Single-bit 
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FIG.  1.  The  relevant  atomic  level  structure  with  |g)  the  ground 
state,  \e)  the  excited  state,  and  | h)  and  \v)  the  two  metastable 
states  (e.g.,  Zeeman  or  hyperfine  sublevels)  for  storing  a  qubit 
of  information.  The  three  levels,  | g),  \e),  and  | h)  can  be  coupled 
through  a  Raman  process  which  is  useful  for  measurement  of 
the  collective  atomic  excitation  in  the  state  | h)  [16]  and  for  gen¬ 
erating  preliminary  entanglement  between  two  ensembles  [17]. 
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rotations  in  this  basis  can  be  done  with  high  precision  by 
shining  Raman  pulses  or  radio-frequency  pulses  on  all  the 
atoms.  The  excitations  in  the  mode  h  can  be  transferred 
to  optical  excitations  [16]  and  then  detected  by  single¬ 
photon  detectors.  Such  a  transfer  has  a  high  efficiency 
even  for  a  free-space  ensemble  due  to  the  collectively 
enhanced  coherent  interaction  as  has  been  demonstrated 
both  in  theory  [16]  and  in  experiments  [13,14]. 

The  first  step  for  generation  of  many-party  entanglement 
is  to  share  an  excitation  between  the  modes  /z;,/z;-  in  two 
distant  ensembles  i ,  j .  This  can  be  readily  done  through 
a  scheme  in  the  recent  quantum  repeater  proposal  [17], 
where  one  prepares  the  state  (h]  +  e 1  ^ij  /z  j )  /  V 2  |  vac)  [18], 
with  <pij  =  cf)j  —  cf)i ,  an  unknown  phase  difference  fixed 
by  the  optical  channel  connecting  the  i,j  ensembles.  This 
state,  after  a  single-bit  rotation,  can  be  transferred  to  the 
useful  form, 

I’J'/j)  =  (h]  +  e'^w))/V2|vac).  (1) 

The  basic  idea  of  the  preparation  scheme  in  Ref.  [17]  is  as 
follows:  One  excites  the  ensembles  i  and  /,  respectively, 
through  a  short  weak  Raman  pulse  applied  to  the  transition 
\g)  —>  \e)  so  that  the  forward- scattered  Stokes  light  from 
the  transition  \e)  — ►  |  h)  has  a  mean  photon  number  much 
smaller  than  1.  The  forward- scattered  Stokes  lights  from 
the  two  ensembles  are  then  interfered  at  a  beam  splitter  and 
further  detected  by  two  single-photon  detectors.  If  we  suc¬ 
cessfully  get  a  detector  click,  we  do  not  know  from  which 
ensemble  this  registered  photon  comes;  due  to  this  indis- 
tinguishability,  the  accompanying  collective  atomic  exci¬ 
tation  should  be  distributed  over  the  ensembles  i  and  j 
with  an  equal  probability  amplitude,  and  we  get  the  state 
described  previously.  This  preparation  scheme  has  the  fol¬ 
lowing  two  features:  First,  the  preparation  succeeds  only 
with  a  controllable  small  probability  po  for  each  Raman 
driving  pulse,  and  needs  to  be  repeated  in  average  l/po 
times  for  the  final  successful  state  generation,  with  the  to¬ 
tal  preparation  time  ~  l/(pofp),  where  fp  is  the  repe¬ 
tition  frequency  of  the  Raman  pulses.  Second,  the  scheme, 
with  inherent  resilience  to  noise,  is  well  based  on  the  cur¬ 
rent  technology  of  laser  manipulation.  We  can  safely  use 
it  as  our  first  step,  to  generate  the  state  (1)  with  a  fidelity 
F  =  1  —  po  very  close  to  the  unity  by  controlling  the 
probability  po.  For  instance,  with  a  typical  repetition  fre¬ 
quency  fp  =  10  MHz,  one  may  prepare  the  state  (1)  with 
a  fidelity  F  =  1  —  po  ~  99%  in  a  time  to  ~  10  pi s. 

Based  on  the  preparation  of  the  state  (1),  now  we  show 
how  to  generate  effective  many-party  entanglement  be¬ 
tween  n  such  atomic  ensembles.  We  prepare  the  state  (1) 
between  the  i  and  z  +  1  ensembles  for  each  i  from  1  to  zz, 
and  get  the  following  state: 

|¥>  =  (1/V2"  )  rW  +  ei4,'**vU i)  | vac) ,  (2) 

i  =  1 

where  for  convenience  we  have  assumed  the  notation  zz  + 
1  =  1  for  the  subscripts,  and  have  used  the  same  symbol 


| vac)  to  denote  the  vacuum  of  the  whole  zz  ensembles.  In 
the  expansion  of  the  state  (1),  there  are  only  two  compo¬ 
nents  which  have  one  excitation  on  each  ensemble.  This 
component  state  is  given  by 


l^eff)  =  (1/V2) 


e 


|  vac), 


(3) 


with  =  Y!U  0M  +  L  which  is  exactly  the  zz-party  GHZ- 
type  maximally  entangled  state  in  the  polarization  ba¬ 
sis.  Note  that,  for  any  practical  application  of  the  GHZ 
entanglement  [1-4],  the  state  preparation  should  be  suc¬ 
ceeded  by  a  measurement  of  the  polarization  of  the  excita¬ 
tion  on  each  ensemble,  which  can  be  done  for  our  system 
by  combining  single-bit  rotations,  such  as  Hadamard  trans¬ 
formations,  with  the  number  detection  of  the  mode  hi 
through  single-photon  detectors.  If  in  this  measurement 
we  keep  only  the  results  for  which  an  excitation  appears 
on  each  ensemble  (i.e.,  postselect  the  case  when  the  de¬ 
tector  on  each  side  registers  a  click),  the  states  (2)  and  (3) 
become  effectively  equivalent  since  the  other  components 
in  the  state  (2)  have  no  contributions  to  the  measurement. 
Through  this  postselection  technique,  we  can  simply  pre¬ 
pare  the  state  (2),  which,  whenever  we  put  it  into  applica¬ 
tions,  yields  effectively  the  GHZ  entanglement  described 
by  the  effective  state  (3).  Here  and  in  the  following,  we 
label  a  component  of  the  full  state  as  the  effective  state  if 
only  this  component  has  contributions  to  the  application 
measurements  (which  are  the  measurements  required  for 
the  detection  or  application  of  the  generated  state).  The 
effective  state  is  the  state  postselected  by  the  application 
measurements. 

For  applications  of  the  GHZ  entanglement,  we  need  also 
to  know  the  phase  (ft  in  the  effective  state  (3),  which  is 
fixed  by  the  whole  setup  and  in  principle  can  be  measured. 
However,  a  better  way  is  to  directly  cancel  this  unknown 
phase  (ft  with  the  following  method.  Assume  that  we  have 
an  even  number  zz  of  the  ensembles.  The  pair  of  ensembles 
i  and  i1  =  zz  +  2  —  i  are  put  in  near  proximity  so  that 
the  ensembles  z,  z  +  1  and  i' ,  (z  +  1);  can  be  connected 
through  the  same  optical  channel,  which  fixes  the  phases  to 
satisfy  the  relation  (f>u+ 1  =  <£;/,(; +  1  y  =  ~(f{i+\y,v  [19]. 
With  this  relation,  the  accumulated  phase  <ft  is  exactly 
canceled  to  zero. 

The  above  preparation  scheme  of  the  effective  GHZ  en¬ 
tanglement  is  robust  to  realistic  noise  and  imperfections. 
The  dominant  noise  in  this  system  is  the  photon  detec¬ 
tor  inefficiency,  the  transferring  inefficiency  (induced  by 
the  spontaneous  emission  loss)  of  the  excitation  from  the 
atomic  mode  /?/  to  the  optical  mode,  and  the  small  de¬ 
cay  of  the  atomic  excitation  in  each  ensemble.  All  the 
above  noise  is  well  described  by  loss  of  excitations  with 
an  overall  loss  probability  denoted  by  77.  Note  that,  by 
including  the  detector  inefficiency,  we  have  automatically 
taken  into  account  that  the  single-photon  detectors  cannot 
perfectly  distinguish  between  single  and  two  photons.  It 
is  easy  to  see  that  loss  of  excitations  has  influence  only  on 
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the  success  probability  to  register  an  excitation  from  each 
ensemble.  Whenever  the  excitation  is  registered,  its  polar¬ 
ization  is  still  perfectly  entangled  as  shown  by  the  effective 
state  (3). 

Now  we  consider  the  efficiency  of  this  scheme,  which 
can  be  described  by  the  total  time  needed  to  successfully 
register  the  effective  GHZ  entanglement.  The  preparation 
of  the  factor  state  (1)  is  probabilistic;  however,  due  to 
the  available  quantum  memory  provided  by  the  metastable 
atomic  modes  h,  v,  the  preparation  time  t\  of  the  state  (2) 
is  at  most  nto  if  its  factor  states  are  prepared  one  after 
the  other,  and  can  be  reduced  to  t\  ~  (in  the  case  of 
n  <  1  / po )  if  its  factor  states  are  prepared  independently 
at  the  same  time.  In  contrast  to  this,  in  the  case  of  no 
quantum  memory,  one  would  need  about  l/po  repeats 
of  the  Raman  pulses  for  a  successful  preparation  of  the 
state  (2),  and  a  total  time  ^o/Po  1  »  ^o*  After  prepa¬ 
ration  of  the  state  (2),  the  projection  efficiency  (success 
probability)  from  the  state  (2)  to  the  effective  GHZ  state 
(3)  is  given  by  (1  —  rj)n  /2n~l,  where  we  have  assumed 
the  same  loss  probability  r\  for  each  ensemble.  So  the  to¬ 
tal  time  for  registering  the  n -party  GHZ  entanglement  is 
T  ~  t()2n~l /(l  —  r])n ,  which  increases  with  the  number 
of  ensembles  exponentially  by  the  factor  2/(1  —  y).  Note 
that  this  increase  has  been  much  slower  than  the  case  for 
spontaneous  parametric  down-conversion,  where  the  expo¬ 
nential  increasing  factor  is  about  2  orders  larger  due  to  the 
absence  of  quantum  memory  [6,9]. 

We  can  in  fact  further  improve  the  scheme  to  get  a  much 
more  efficient  scaling  of  the  efficiency,  with  the  time  T 
increasing  with  the  party  number  n  only  polynomially.  The 
improved  scheme  is  divided  into  the  following  three  steps. 

(i)  We  start  with  two  pairs  of  ensembles  1,2  and  3,4, 
prepared  in  the  state  I ''Pi, 2)  0  I  ^3,4)  with  |'vP;j)  in  the  form 
of  Eq.  (1).  We  then  connect  these  two  disjoint  pairs  by 
preparing  the  state  I  ^2,3)*  The  ensembles  2  and  3  will 
not  be  involved  any  more  in  the  following  steps  for  state 
preparation,  so  we  can  immediately  put  them  into  applica¬ 
tions  by  doing  the  same  type  of  measurements  on  them  as 
if  we  had  generated  n -party  GHZ  entanglement.  In  these 
measurements,  if  one  excitation  is  registered  from  each  en¬ 
semble  2  and  3,  we  succeed  and  will  go  on  with  the  next 
step.  Otherwise,  we  simply  repeat  the  above  process  until 
we  succeed.  Upon  success,  only  the  component  I^Pi-4)  of 
the  state  1^1,2)  ®  ^3,4)  ®  I ^2,3)  has  contributions  to  the 
measurement  with 

1^1-4}  =  (\/'J2)(h\h\h\  +  viv^vi)  | vac),  (4) 

where  for  simplicity  we  have  neglected  the  phase  <pij+ 1, 
since  they  will  finally  cancel  each  other  with  the  method 
described  previously.  If  loss  of  excitations  with  a  loss 
probability  y  is  taken  into  account  for  detections  on  the 
ensembles  2, 3,  a  registered  click  might  result  from  two 
excitations,  and  in  this  case  there  will  be  no  excitation  in 
the  ensembles  1  and  4.  So  with  the  loss,  upon  success 


of  step  (i),  instead  of  I^Pi-4)  the  effective  state  of  the 
ensembles  1-4  is  actually  described  by 

Pi  -4  =  (I^Pl-4-)  (^Pl  -4I  +  C\py  ac)/(l  +  Cl),  (5) 

with  the  vacuum  coefficient  c\  =  2y,  where  pYSLC  stands 
for  the  vacuum  component  with  no  excitation  in  the  unde¬ 
tected  ensembles  1  and  4.  The  probability  of  a  successful 
detection  on  both  of  the  ensembles  2  and  3  is  given  by 
p  1  =  (1  —  y)2(l  +  2 y )/4,  which  means  that  we  need  to 
repeat  the  process  in  average  l/pi  times  for  the  final  suc¬ 
cess  of  step  (i). 

(ii)  In  step  (ii)  we  further  extend  the  number  of  entan¬ 

gled  ensembles  in  the  effective  state  (5).  Assume  that  we 
have  applied  the  method  of  step  (i)  in  parallel  to  the  two 
disjoint  sets  of  ensembles  1-4  and  5-8,  with  their  effec¬ 
tive  states  [each  in  the  form  of  Eq.  (5)]  denoted  by  p  1-4 
and  P5-8,  respectively.  We  connect  these  two  sets  by  first 
preparing  the  state  1^4,5)  [in  the  form  of  Eq.  (1)]  and  then 
putting  the  ensembles  4, 5  into  application  measurements 
as  described  in  step  (i).  Upon  success  of  these  measure¬ 
ments  with  one  excitation  registered  from  each  ensemble, 
the  postselected  state  of  the  ensembles  1-8  is  effectively 
described  by  p  i_g  which  is  similar  to  Eq.  (5),  but  with 
an  increased  vacuum  coefficient  and  with  I^Pi-4)  re- 
placed  by  =  (1/V2)(n?=/  +  Y\\=2vh  Ivac), 

(k  =  8).  Whenever  the  measurement  fails,  we  repeat 
the  whole  state  preparation  from  step  (i).  The  above 
connection  process  can  be  continued  with  the  number  n  of 
effectively  entangled  ensembles  doubled  for  each  time  of 
connection.  After  i  times  connection,  we  have  n  =  2l  +  1. 
The  success  probability  and  the  new  vacuum  coefficient 
of  the  /th  connection  are  denoted,  respectively,  by  p,- 
and  C{,  which  satisfy  the  following  recursion  relations 
with  the  previous  vacuum  coefficient  c;- 1  through  pi  = 
(1  -  t?)2(  1  +  277  +  2c,-i)/[4(l  +  c,-i)2],  and  a  = 
2c;-i  +  2y.  From  these  recursion  relations,  we  have 
Cj  =  2y(2l  —  1),  which,  after  substituted  into  pt,  yields 
an  explicit  expression  for  the  repetition  number  1  / pt  of  the 
i th  connection. 

(iii)  After  a  desired  number  n  =  2l+1  of  the  ensembles 
have  been  entangled  in  the  effective  state  pi-n,  we  close 
the  loop  in  the  last  step  by  first  preparing  the  state  |  'Trw  j)  [in 
the  form  of  Eq.  (1)]  and  then  putting  the  last  two  ensembles 
n ,  1  into  application  measurements.  As  usual,  we  keep 
the  results  only  when  one  excitation  appears  from  each 
detected  ensemble,  and  this  automatically  eliminates  con¬ 
tributions  from  the  vacuum  component  in  the  state  p\-n. 
So  the  effective  state  of  the  whole  set  of  ensembles  post- 
selected  by  all  the  application  measurements  is  still  de¬ 
scribed  by  the  exact  GHZ  state  (3),  and  the  application 
measurement  results  should  reveal  perfect  GHZ  entangle¬ 
ment  between  the  n  ensembles  in  the  polarization  degree 
of  freedom.  The  application  measurements  on  the  en¬ 
sembles  n,  1  in  the  last  step  succeeds  with  a  probability 
pi  =  (1  —  77)2/[2(l  +  cij],  so  the  whole  process  needs 
to  be  repeated  in  average  l/pi  times. 
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Now  we  calculate  in  this  improved  scheme  how  much 
time  is  needed  in  total  for  a  successful  detection  of  the 
n -party  GHZ  entanglement.  This  time  is  given  by  T[mp  = 
to/[piPi  YYj=2  Pj\  where  to  is  the  preparation  time  of 
the  state  (1).  We  consider  two  limiting  cases.  In  the 
first  case  with  a  negligible  loss  probability  r\  for  each  en¬ 
semble,  we  have  p\  =  2 pj  =  1/2  and  T[mp  =  2ll  +  lto  = 
n2to/2 ,  which  increases  with  the  number  n  of  entangled 
ensembles  by  the  slow  quadratic  law.  In  the  second  case 
with  a  considerably  large  loss  probability  rj ,  the  total  time 
Timp  is  approximated  by  Timp  ~  t0[2rjn/(l  -  rj)2]  in/ 
2)log2t2 Vsfn/(\-r])2\^  w h j c h  increases  with  n  faster,  but  still 
polynomially  (or,  more  accurately,  subexponentially).  The 
basic  reason  for  the  improvement  from  the  exponential 
scaling  to  the  much  slower  polynomial  scaling  is  because 
we  have  divided  the  whole  preparation  process  into  many 
small  steps,  checking  in  each  step  whether  the  preparation 
is  successful,  and  repeating  this  small  step  instead  of  the 
whole  process  if  it  fails. 

Finally,  we  briefly  discuss  the  practical  implication  of 
this  proposal.  With  the  improved  scheme,  for  example,  we 
can  generate  high-fidelity  GHZ  entanglement  over  n  =  16 
ensembles  in  a  time  T[mp  ~  50  ms  with  a  notable  loss 
rj  ~  1/3  and  a  typical  choice  to  ~  10  p,s.  With  such  a 
short  preparation  time  7imp,  the  noise  that  we  have  not  in¬ 
cluded,  such  as  the  nonstationary  phase  drift  induced  by 
the  pumping  phase  or  by  the  optical  channel,  is  negligi¬ 
ble.  As  long  as  the  number  n  of  the  ensembles  is  not  huge, 
we  can  also  safely  neglect  the  single-bit  rotation  error  (be¬ 
low  10-4  with  the  use  of  accurate  polarization  techniques 
for  Zeeman  sublevels  [20])  and  the  dark  count  probabil¬ 
ity  (about  10-5  in  a  typical  detection  time  window  0.1  pis) 
of  single-photon  detectors.  Because  of  the  efficient  scal¬ 
ing  of  this  scheme,  one  can  use  it  to  steadily  increase 
the  number  of  entangled  ensembles,  and  it  seems  reason¬ 
able  to  generate  GHZ  entanglement  over  tens  of  ensembles 
with  the  current  technology.  Such  an  extraordinary  possi¬ 
bility  opens  up  prospects  for  many  exciting  experiments 
and  applications. 
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Abstract — We  show  that  by  using  the  photonic  crystals,  we  can 
confine,  guide,  and  emit  fight  efficiently.  By  precise  control  over  the 
geometry  and  three-dimensional  design,  it  is  possible  to  obtain  high 
quality  optical  devices  with  extremely  small  dimensions.  Here  we 
describe  examples  of  high-Q  optical  nanocavities,  photonic  crystal 
waveguides,  and  surface  plasmon  enhanced  light-emitting  diode 
(LEDs). 

Index  Terms — Finite-difference  time-domain  (FDTD)  methods, 
light-emitting  diodes  (LEDs),  microcavities,  nanooptics,  photonic 
bandgap  (PBG)  materials,  photonic  crystal  waveguides,  photonic 
crystals,  quantum-well  laser,  semiconductor  device  fabrication, 
spontaneous  emission,  surface  plasmons. 


I.  Photonic  Crystal  Nanocavities 


THE  PAST  rapid  emergence  of  optical  microcavity  devices, 
such  as  vertical-cavity  surface-emitting  lasers  (VCSELs) 
[1]  and  [2]  can  be  largely  attributed  to  the  high  precision  over  the 
layer  thickness  control  available  during  semiconductor  crystal 
growth.  High  reflectivity  mirrors  can,  thus,  be  grown  with  sub¬ 
nanometer  accuracy  to  define  high-Q  cavities  in  the  vertical  di¬ 
mension.  Recently,  it  has  also  become  possible  to  microfabri- 
cate  high  reflectivity  mirrors  by  creating  two-  and  three-dimen¬ 
sional  periodic  structures.  These  periodic  “photonic  crystals” 
[3]  and  [4]  can  be  designed  to  open  up  frequency  bands  within 
which  the  propagation  of  electromagnetic  waves  is  forbidden 
irrespective  of  the  propagation  direction  in  space  and  define 
photonic  bandgaps.  When  combined  with  high  index  contrast 
slabs  in  which  light  can  be  efficiently  guided,  microfabricated 
two-dimensional  photonic  bandgap  (PBG)  mirrors  provide  us 
with  the  geometries  needed  to  confine  and  concentrate  light  into 
extremely  small  volumes  and  to  obtain  very  high  field  intensi¬ 
ties.  Here  we  show  that  it  is  possible  to  use  these  “artificially” 
microfabricated  crystals  in  functional  optical  devices,  such  as 
lasers,  modulators,  add-drop  filters,  polarizers  and  detectors. 

Fabrication  of  optical  structures  has  evolved  to  a  precision 
which  allows  us  to  control  light  within  etched  nanostructures 
[5].  For  example,  subwavelength  nano-optic  cavities  can  be  used 
for  efficient  and  flexible  control  over  the  emission  wavelength 
[6]  and  [7].  Similarly,  nanofabricated  optical  waveguides  can 
be  used  for  efficient  coupling  of  light  between  devices.  This 
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Fig.  1.  Field  intensity  of  light  leakage  from  a  single  defect  surrounded  by 
three  periods  of  photonic  crystal  and  slightly  enlarged  holes  in  the  horizontal 
direction.  Light  preferentially  emits  in  the  y  dipole. 


new  capability  allows  the  reduction  of  the  size  of  optical  com¬ 
ponents  and  leads  to  their  integration  in  large  numbers,  much  in 
the  same  way  as  electronic  components  have  been  integrated  for 
improved  functionality  to  form  microchips.  As  high-Q  optical 
and  electronic  cavity  sizes  approach  a  cubic  half-wavelength 
the  spatial  and  spectral  densities  (both  electronic  and  optical) 
increase  to  a  point  where  the  light-matter  coupling  becomes  so 
strong  that  spontaneous  emission  is  replaced  by  the  coherent  ex¬ 
change  of  energy  between  the  two  systems  [8]— [1 1] . 

We  can  use  the  lithographic  control  over  the  wavelength 
and  polarization  supported  within  photonic  crystal  cavities  to 
construct  compact  nanophotonic  laser  (see  Fig.  1)  and  detector 
arrays,  as  well  as  all-optical  gates  and  routers.  We  have  already 
demonstrated  room  temperature  lasing  in  the  smallest  optical 
cavities,  with  mode  volumes  down  to  2.5  (A/2nsiab)3,  or 
0.03  7/m3  in  InGaAsP  emitting  at  1.55  7/m.  We  have  also  been 
able  to  tune  the  emission  wavelength  of  these  lasers  from 
1450  nm  to  1620  nm  within  a  10  x  10  laser  array  in  an  area 
of  100  7/m  x  100  7/m  by  local  lithographic  modification  of 
the  cavity  lengths  (see  Fig.  2)  [5].  As  the  mode  volumes  of 
nanocavities  are  decreased,  the  coupling  efficiency  between 
the  spontaneous  emission  [12]  within  the  cavity  and  the  lasing 
mode  can  be  significantly  improved.  We  have  calculated 
spontaneous  emission  coupling  factors  (fi)  above  85%  [13] 
for  optimized  photonic  crystal  lasers  constructed  in  active 
quantum-well  (QW)  material.  This  spontaneous  emission 
coupling  efficiency  can  be  even  higher  if  the  linewidth  of  the 
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Fig.  2.  10  x  10  multiwavelength  laser  array  within  a  100  x  1 00  fi  area.  Each  laser  emits  at  a  lithographically  controlled  wavelength. 
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Fig.  4.  Five  photonic  crystal  cavities  coupled  together  lithographically  by 
arrangement  in  the  same  slab. 


Fig.  3.  Sample  spectra  taken  from  lasers  defined  within  the  10  x  10  array. 

semiconductor  emission  is  narrowed,  as  will  be  the  case  when 
using  quantum  dot  active  material.  Therefore,  single  defect 
photonic  crystal  lasers  represent  in  many  ways  the  ultimate 
evolution  of  VCSELs,  since  control  over  both  vertical  and 
lateral  spontaneous  emission  is  possible.  With  most  of  the 
spontaneous  emission  funneled  into  a  single  optical  mode,  the 
photonic  crystal  laser  can  be  modulated  at  much  higher  fre¬ 
quencies  even  close  to  threshold.  The  photonic  crystal  provides 
us  with  the  unique  opportunity  of  coupling  light  emitted  by  one 
cavity,  and  using  it  to  optically  pump  another  with  negligible 
diffraction  losses.  Photonic  crystals  are  also  the  perfect  medium 
for  constructing  what  have  been  termed  “photonic  molecules,” 
or  interconnected  cavities  which  can  share  and  exchange  pho¬ 
tons.  Finally,  we  have  shown  that  the  emission  wavelength  of 
light  from  these  photonic  crystal  lasers  can  be  varied  by  simple 
adjustments  of  the  lithographic  pattern  during  their  fabrication. 
Thus,  single  mode  lasers  emitting  at  1450  nm  can  be  fabricated 
only  microns  apart  from  lasers  emitting  at  1600  nm,  and  could 


share  the  same  waveguide  slab.  Photonic  crystals  provide 
us  with  the  opportunity  of  constructing  very  compact  laser 
sources  with  designed  frequencies  (Fig.  3)  and  polarization  as 
well  as  wavelength  and  polarization  sensitive  detector  arrays. 
Moreover,  they  can  form  very  flexible  platforms  for  connecting 
optical  sources,  detectors,  routers,  modulators,  polarizers  and 
filters  in  very  compact  microfabricated  systems. 

Another  unique  feature  of  active  photonic  crystal  cavities  (see 
Fig.  4),  which  arises  from  their  ability  to  limit  the  number  of 
modes  supported  within  the  laser,  is  the  ability  to  build  high 
contrast  modulators. 

Fig.  5  shows  an  example  of  such  a  single  defect  photonic 
crystal  cavity,  which  supports  a  few  modes  within  the  same 
cavity.  Depending  on  the  diameter  of  the  pump  beam,  (shown  on 
the  left-hand  side  of  the  figure),  we  find  that  different  modes  are 
excited,  and  these  in  turn  exhibit  different  spectra.  Finite-differ¬ 
ence  time-domain  simulations  of  the  expected  geometric  distri¬ 
bution  of  the  field  intensities  within  these  modes  are  also  shown, 
and  the  relative  overlap  of  the  pump  beam  with  the  expected 
mode  geometries  matches  well  to  the  observations. 
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Fig.  5.  Geometries  of  modes  supported  by  a  single  defect  photonic  crystal  cavity.  We  show  the  measured  spectra  and  the  modeled  field  distributions  of  the  modes. 


Fig.  6.  Photonic  crystal  waveguide  defined  into  a  SOI  wafer. 


II.  Photonic  Crystal  Waveguides 

We  have  shown  above  that  it  is  possible  to  generate  light  and 
modulate  it  with  simple  high-Q  optical  resonator  cavities.  How¬ 
ever,  if  a  nanophotonic  system  is  to  be  constructed,  it  is  also  nec¬ 
essary  to  guide  light  from  one  device  to  another.  We  have  con¬ 
ducted  extensive  tests  on  optical  waveguides  based  on  photonic 
crystals  to  explore  the  opportunities  of  using  this  geometry  to 
efficiently  guide  light,  especially  around  very  sharp  bends  [18]. 
In  the  simplest  geometry,  a  photonic  crystal  waveguide  can  be 
constructed  by  lithographically  removing  a  line  of  holes  from 
a  perfect  photonic  crystal.  This  geometry  provides  the  opportu¬ 
nity  for  light  to  propagate  through  the  photonic  crystal  (Fig.  6) 
[15]-[17]. 

Although  it  is  difficult  to  couple  light  into  such  structures, 
we  have  been  able  to  measure  light  guiding  in  such  photonic 
crystal  waveguides  through  sharp  bends.  We  have  both  calcu¬ 
lated  (see  Fig.  8)  and  experimentally  obtained  [19]  the  disper¬ 
sion  diagram  of  such  waveguides,  and  have  found  that  several 
modes  can  propogate  within  them,  making  them  less  desirable 
for  many  telecommunication  applications.  More  recently,  we 
have  developed  new  single  mode  photonic  crystal  waveguide 
designs,  in  which  the  dispersion  properties  can  be  controlled 


[20].  Fig.  7  shows  modified  waveguide  designs,  fabricated  in 
silicon-on-insulator  (SOI)  material,  in  which  only  one  waveg- 
uiding  mode  is  supported  at  desired  frequency.  Photonic  crystal 
waveguides  offer  the  opportunities  to  ensure  mode-matching  be¬ 
tween  an  optical  cavity  and  the  waveguide,  but  much  work  still 
needs  to  be  done  to  determine  the  dispersive  character  of  such 
structures  with  periodic  edges. 

Very  interesting  structures  can  also  be  defined  by  using  wave¬ 
guides  with  periodic  cavities  [Fig.  7(b)],  or  coupled  resonant 
cavity  structures  [21].  In  our  fabricated  structures,  we  expect  to 
define  geometries  with  designed  dispersive  characteristics. 

III.  Surface  Plasmon  Light  Emitters 

Surface  plasmons  (SP),  also  called  surface  plasmons  polari- 
tons  are  electromagnetic  surface  waves  that  propagate  along 
metallic  surfaces,  have  their  intensity  maximum  at  the  surface 
and  exponentially  decaying  fields  perpendicular  to  it.  The  con¬ 
dition  for  occurrence  of  such  localized  waves  at  the  boundary 
between  two  media  is  that  the  two  materials  joined  the  boundary 
must  have  opposite  signs  of  the  real  part  of  the  dielectric  con¬ 
stant  (otherwise,  a  strong  field  localization  at  the  surface  cannot 
be  obtained).  This  is  possible  at  the  interface  between  a  metal 
operating  at  frequencies  below  the  plasma  frequency  and  any 
dielectric. 

Numerical  electromagnetic  calculations  in  metals  which  are 
capable  of  manipulating  radiation  at  the  nanometer  scale  are 
very  demanding.  An  important  tool  is  the  use  of  FDTD  [23]  and 
[24]  methods  for  metal/dielectric  media  developed  in  Scherer’s 
group  to  analyze  theoretically  the  electromagnetic  fields  within 
a  microcavity.  In  order  to  analyze  metals  as  well  as  dielectric 
materials  at  optical  frequencies,  it  is  necessary  to  make  certain 
changes  [25]  to  Yee’s  standard  FDTD  scheme  [26].  Electromag¬ 
netic  fields  in  metals  are  described  by  adding  a  current  term  to 
Maxwell  curl  equations  via  a  Drude  model.  Fig.  9  illustrates  re¬ 
sults  of  FDTD  calculations  which  yield  the  photonic  band  struc¬ 
ture  and  the  electromagnetic  field  profile  for  a  metal-clad  micro- 
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(a) 


(b) 

Fig.  7.  (a)  Novel  photonic  band  gap  (PBG)  waveguide  designs,  (b)  Coupled  resonator  waveguide. 


cavity  light-emitting  diode  [19].  In  the  frequency  range  where 
the  cavity  thickness  is  of  the  order  of  wavelength  of  less,  coupled 
surface  plasmon  modes  of  the  top  and  bottom  surfaces  can  exist 
and  can  be  used  to  efficiently  extract  light  from  the  semicon¬ 
ductor  slab.  Very  small  discretization  steps  of  ^  1  nm  must  be 
used  for  the  finite  difference  algorithm  to  describe  the  propaga¬ 
tion  of  electromagnetic  waves  in  metals,  and  this  in  turn  requires 
large  computational  volumes.  We  have  developed  distributed 
FDTD  algorithms  which  are  supported  on  multiprocessor  com¬ 
puter  clusters  to  manage  the  large  memory  requirements. 


IV.  SP  Enhanced  Light-Emitting 
Diodes  (SP-LEDs) 

Much  scientific  work  has  been  focused  on  improving  the  ex¬ 
traction  efficiency  of  light  emitting  diodes  (LEDs).  Many  inter¬ 
esting  approaches  have  been  proposed  to  accomplish  this,  such 
as  the  use  of  thin  light  emitting  layers  with  surface  texturing, 
resonant  cavities  or  photon  recycling.  External  quantum  effi¬ 
ciencies  of  3 1  %  were  achieved  by  employing  reflection  from  a 
bottom  metal  mirror  together  with  a  textured  top  semiconductor 
surface.  Apart  from  efforts  to  improve  light  extraction  from  a 
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Fig.  8.  Dispersion  diagram  for  vertically  even  TE-like  guided  modes  in  the  single-line  defect  PBG  waveguide  based  on  the  triangular  lattice.  Insets  show  field 
patters  of  Bzof  a  surface  plasmon  wave. 


Fig.  9.  Band  diagram  (left)  and  transverse  magnetic  (TM)  field  profile  (right)  for  p-polarized  light  of  a  metal  clad  microcavity.  The  core  is  assumed  to  be  made  of 
150-nm  thick  GaAs  (refractive  index  n  —  3.5).  Top  and  bottom  metal  layers  are  made  of  silver  with  the  following  parameters:  hujp  =  8.8  eV  and  hv  —  0.05  eV. 
The  top  layer  is  only  40-nm  thick  and  the  bottom  layer  is  200  nm  thick.  The  analyzed  structure  is  surrounded  by  air  on  top  and  bottom  and  the  slab  is  infinite  in 
lateral  dimension.  Only  the  bottom  three  bands  are  shown,  since  they  are  within  the  frequency  range  of  interest. 


semiconductor  device,  it  is  also  possible  to  enhance  the  light 
emission  rate  within  a  semiconductor.  This  approach  is  based 
on  Purcell’s  prediction  in  1946  [12]  that  the  radiation  rate  of  an 
atom  placed  within  a  wavelength- sized  cavity  can  be  changed. 
A  12-fold  enhancement  of  spontaneous  emission  was  recently 
measured  in  a  semiconductor  optical  microcavity  at  low  tem¬ 
peratures.  Metallic  structures  were  identified  as  candidates  for 
very  large  decay  rate  enhancement — much  larger  than  the  one 
achievable  by  semiconductor  photonic  crystals.  Unfortunately, 
much  of  the  previous  work  on  surface  plasmon  emitters  was  lim¬ 
ited  by  the  need  to  efficiently  extract  the  light  generated  within 
such  structures. 

In  order  to  build  an  ideal,  highly  efficient  LED,  it  is  desir¬ 
able  to  improve  both  the  extraction  efficiency  and  simultane- 


Fig.  10.  Design  of  the  surface  plasmon  LED  structure. 
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Fig.  11.  Fabrication  sequence  for  constructing  a  SP-LED. 


ously  enhance  the  spontaneous  emission  rate.  Metallic  photonic 
crystals  have  attracted  significant  scientific  attention  in  recent 
years  for  both  of  these  purposes.  In  1996,  Bill  Barnes  at  the 
University  of  Exeter  observed  a  PBG  for  surface  modes  on  a 
silver  surface  textured  with  a  hexagonal  array  of  dots.  Two  years 
later,  researchers  at  NEC  and  their  collaborators  observed  sharp 
peaks  in  the  transmission  of  light  through  a  metallic  layer  pat¬ 
terned  with  a  hexagonal  array  of  sub  wavelength  holes.  Both  ex¬ 
perimental  results  were  attributed  to  the  peculiar  properties  of 
surface  plasmons  polaritons — modes  that  exist  at  the  interface 
of  two  media  with  opposite  signs  of  the  real  part  of  dielectric 
constant. 

In  order  to  couple  the  emission  to  SP  waves  from  a  single 
metal-semiconductor  surface,  it  is  typically  necessary  to  po¬ 
sition  the  QW  very  close  to  that  metal  surface  and  within  the 
fringing  field  depth  of  the  SP.  However,  the  nonradiative  transfer 
between  a  QW  and  the  metal  increases  as  their  distance  de¬ 
creases.  The  question  is  then  how  much  of  the  observed  large 
reduction  in  lifetime  at  small  distances  is  due  to  the  coupling  to 
SP  waves,  and  what  portion  of  it  comes  from  other  nonradiative 
processes.  The  only  part  that  can  be  possibly  extracted  outside  of 
the  structure  by  properly  designing  an  “antenna”  (i.e.,  by  struc¬ 
turing  the  surface  of  metal)  is  the  portion  coupled  to  SP  waves. 
To  reduce  the  nonradiative  transfer,  we  decided  to  position  the 
QW  further  from  the  metal  surface  (at  distances  of  40  to  50  nm). 
However,  the  intensity  of  SP  waves  decreases  exponentially  as 
a  function  of  distance  from  the  metallic  surface.  In  order  to  in¬ 
crease  the  field  intensity  at  the  position  of  the  QW  even  more, 
it  is  possible  to  use  the  coupled  surface  plasmon  modes  of  two 
closely  spaced  metallic  interfaces. 

The  analyzed  structure  consists  of  a  90-nm  thick- semicon¬ 
ductor  layer  sandwiched  between  two  silver  films.  A  single 
8  nm  thick  InGaAs  quantum  well  is  positioned  in  the  middle 
of  the  semiconductor  membrane.  The  fabrication  sequence  is 
summarized  in  Fig.  1 1  and  a  scanning  electron  micrograph  of 


Fig.  12.  Scanning  electron  micrograph  of  a  surface  plasmon  enhanced  LED 
showing  the  metal  grating  pattern  which  is  used  to  couple  out  the  radiation. 

a  completed  surface  plasmon  LED  is  shown  in  Fig.  12.  The 
main  emission  peak  is  at  the  wavelength  of  990  nm,  corre¬ 
sponding  to  conduction-to-heavy  hole  (C-HH)  band  transitions. 
Another  peak  can  be  observed  at  930  nm,  corresponding  to 
conduction-to-light  hole  (C-LH)  band  transitions.  If  a  periodic 
pattern  is  defined  in  the  top  semitransparent  metal  layer  by 
lithography  (Fig.  10),  it  is  possible  to  efficiently  couple  out  the 
light  emitted  from  the  semiconductor  and  to  simultaneously 
enhance  the  spontaneous  emission  rate.  For  the  analyzed  de¬ 
signs,  we  theoretically  estimate  extraction  efficiencies  as  high 
as  37%  and  Purcell  factors  (Fp)  of  up  to  4.5  [22].  We  have 
experimentally  measured  photoluminescence  intensities  of  up  to 
46  times  higher  in  fabricated  structures  compared  to  unprocessed 
wafers.  The  increased  light  emission  is  due  to  an  increase  in  the 
efficiency  and  an  increase  in  the  pumping  intensity  resulting 
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Fig.  13.  Luminescence  spectra  from  a  surface  plasmon  enhanced  LED  at 
various  stages  of  fabrication  (see  text). 

TABLE  I 

Parameters  of  Gratings  Used  for  Extracting  the  Emission  From  the 
Metal  Clad  Microcavity,  a  Is  the  Grating  Periodicity,  5  Is  the  Gap 
Between  Silver  Stripes,  0  Is  the  Angle  of  the  Outcoupled  Photon 
With  Respect  to  Normal,  and  77  Is  the  Normalized  Diffraction 
Efficiency  Into  the  iuh  Order  of  a  Grating.  The  Outcoupling 
Happens  Through  the  hth  Order  of  a  Grating 


a  [nm]  n  6{at  986nm)  0(at  930nm)  s[nm]  rj 


250 

-1 

6° 

25° 

160 

0.82 

480 

-2 

3.5° 

153° 

160 

0.19 

650 

-3 

3or 

9 0 

160 

0.06 

650 

-3 

30} 

9 0 

100 

0.11 

from  trapping  of  pump  photons  within  the  microcavity.  The 
measured  photoluminescence  spectra  are  shown  in  Fig.  13. 
The  spectra  labels  correspond:  (a)  the  unprocessed  wafer; 
(b)  the  half-processed  wafer  (i.e.,  90-nm- thick  semiconductor 
membrane  on  top  of  a  thick,  nontransparent  silver  layer);  (c)  the 
unpatterned  metal-clad  microcavity  (i.e.,  a  semiconductor  mem¬ 
brane  sandwiched  between  two  metal  films,  without  patterning 
of  the  top  silver  layer);  and  (d)  the  fully  processed  structure 
(where  silver  stripes  are  defined  in  the  top  silver  layer,  with 
the  grating  periodicity  of  250  nm  and  the  160  nm  gap  between 
silver  stripes).  From  the  calculated  values  shown  in  Table  I,  we 
conclude  that  the  grating  with  periodicity  of 250  nm  will  have  the 
best  performance,  since  it  has  the  largest  diffraction  efficiency. 
The  grating  with  periodicity  of  650  nm  and  a  gap  between 
stripes  of  s  =  160  nm  will  have  the  worst  performance,  which 
can  be  somewhat  improved  by  reducing  the  gap  to  100  nm. 

For  all  fully  processed  wafers,  including  the  unpatterned 
metal  clad  microcavity,  the  full-width  at  half-maximum 
(FWHM)  of  the  emission  spectra  is  in  the  range  of  60  to 
110  nm.  Therefore,  their  quality  factors  are  between  10  and  15. 
For  the  half-processed  wafer,  a  FWHM  is  32  nm.  Because  of  a 
bulk  emission  tail  at  lower  wavelengths,  a  luminescence  peak 
at  930  nm  for  unprocessed  wafers  cannot  be  clearly  resolved. 
The  spectra  of  unprocessed,  half-processed  and  fully  processed 


Fig.  14.  FDTD  calculation  of  TE  field  intensities  emitted  from  a  surface 
plasmon  LED  with  a  250-nm-period  grating.  Collimated  light  emission  is 
observed  in  the  time  evolution  of  the  field. 

wafers  are  shown  in  Fig.  13.  The  PL  peak  of  the  unprocessed 
wafer  at  986  nm  was  normalized  to  1.  A  GaAs  filter  was 
applied  in  front  of  the  detector  to  cutoff  wavelengths  below 
890  nm.  As  can  be  seen  from  the  luminescence  curves,  it  is 
possible  to  increase  the  luminescence  intensity  of  a  diode  48 
times  over  a  grown  heterostructure  by  introducing  a  metal  clad 
waveguide  with  a  semitransparent  patterned  surface  plasmon 
extractor.  More  interestingly,  when  comparing  curve  (b),  the 
half-processed  wafer  on  top  of  a  silver  mirror,  with  curve 
(d),  the  fully  processed  and  metalized  structure,  much  higher 
light  extraction  is  obtained  through  the  top  metal  layer.  Such 
a  structure  is  ideally  suited  for  electrical  pumping,  since  the 
metal  layers  can  be  used  for  both  light  extraction  as  well 
as  carrier  injection.  Moreover,  our  calculations  predict  that 
efficient  collimated  light  emission  can  be  generated  from 
surface  plasmon  LEDs  (Fig.  14).  Here  we  propose  to  optimize 
this  diode  structure  for  electrical  pumping,  which  is  expected  to 
lead  to  highly  efficient  LEDs  with  very  fast  modulation  speeds. 
Our  preliminary  tests  have  indicated  that  in  our  structures,  the 
spontaneous  emission  rate  is  increased  by  approximately  five 
fold  (at  room  temperature)  over  conventional  LED  material. 

V.  Conclusion 

Microfabricated  structures  with  dimensions  smaller  than  the 
wavelength  of  light  allow  the  efficient  confining  and  guiding  of 
light.  High  index  contrast  systems  allow  such  structures  to  be 
defined  with  very  small  sizes,  and,  thus,  leads  to  integration  of 
several  optical  components  into  more  functional  systems.  Al¬ 
though  more  complex  three-dimensional  geometries  will  even¬ 
tually  be  developed  [27],  one  very  promising  geometry  is  the 
two-dimensional  photonic  crystal,  into  which  cavities,  waveg¬ 
uides  and  dispersive  elements  can  be  embedded.  The  most  im¬ 
portant  advantage  of  this  geometry  lies  in  the  opportunity  to 
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lithographically  couple  devices  together  with  minimal  diffrac¬ 
tion  losses  and  excellent  coupling  efficiencies.  We  believe  that 
photonic  crystals  will  be  extremely  useful  for  the  definition  of 
integrated  optical  systems,  as  well  as  for  the  definition  of  ex¬ 
tremely  small  optical  cavities,  which  would  be  applicable  to¬ 
ward  strong  coupling  experiments  with  quantum  dot  or  atom 
emitters.  On  the  other  hand,  metallic  structures  with  high  index 
contrast  offer  the  opportunity  to  harness  surface  plasmons,  and 
we  show  that  even  with  a  very  simple  design  it  is  possible  to 
construct  very  efficient  surface  LEDs. 
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I.  INTRODUCTION 

The  method  of  quantum  computation  by  adiabatic  evolu¬ 
tion  has  been  proposed  as  a  general  way  of  solving  combi¬ 
natorial  search  problems  on  a  quantum  computer  [1]. 
Whereas  a  conventional  quantum  algorithm  is  implemented 
as  a  sequence  of  discrete  unitary  transformations  that  form  a 
quantum  circuit  involving  many  energy  levels  of  the  com¬ 
puter,  the  adiabatic  algorithm  works  by  keeping  the  state  of 
the  quantum  computer  close  to  the  instantaneous  ground 
state  of  a  Hamiltonian  that  varies  continuously  in  time. 
Therefore,  an  imperfect  quantum  computer  implementing  a 
conventional  quantum  algorithm  might  experience  different 
sorts  of  errors  than  an  imperfect  adiabatic  quantum  com¬ 
puter.  In  fact,  we  claim  that  an  adiabatic  quantum  computer 
has  an  inherent  robustness  against  errors  that  might  enhance 
the  usefulness  of  the  adiabatic  approach. 

The  adiabatic  algorithm  works  by  applying  a  time- 
dependent  Hamiltonian  that  interpolates  smoothly  from  an 
initial  Hamiltonian  whose  ground  state  is  easily  prepared  to  a 
final  Hamiltonian  whose  ground  state  encodes  the  solution  to 
the  problem.  If  the  Hamiltonian  varies  sufficiently  slowly, 
then  the  quantum  adiabatic  theorem  guarantees  that  the  final 
state  of  the  quantum  computer  will  be  close  to  the  ground 
state  of  the  final  Hamiltonian,  so  a  measurement  of  the  final 
state  will  yield  a  solution  of  the  problem  with  high  probabil¬ 
ity.  This  method  will  surely  succeed  if  the  Hamiltonian 
changes  slowly.  But  how  slow  is  slow  enough? 

Unfortunately,  this  question  has  proved  difficult  to  ana¬ 
lyze  in  general.  Some  numerical  evidence  suggests  the  pos¬ 
sibility  that  the  adiabatic  algorithm  might  efficiently  solve 
computationally  interesting  instances  of  hard  combinatorial 
search  problems,  outperforming  classical  methods  [1-4]. 
Whether  the  adiabatic  algorithm  provides  a  definite  speedup 
over  classical  methods  remains  an  interesting  open  question. 
As  we  will  discuss  in  Sec.  II,  the  time  required  by  the  algo¬ 
rithm  for  a  particular  instance  can  be  related  to  the  minimum 
gap  A  between  the  instantaneous  ground  state  and  the  rest  of 
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the  spectrum.  Roughly  speaking,  the  required  time  goes  like 
A-2.  Thus,  if  A-2  increases  only  polynomially  with  the  size 
of  the  problem,  then  so  does  the  time  required  to  run  the 
algorithm.  However,  determining  A  has  not  been  possible  in 
general. 

Our  objective  in  this  paper  is  not  to  explore  the  computa¬ 
tional  power  of  the  adiabatic  model,  but  rather  to  investigate 
its  intrinsic  fault  tolerance.  Since  quantum  computers  are  far 
more  susceptible  to  making  errors  than  classical  digital  com¬ 
puters,  fault  tolerant  protocols  will  be  necessary  for  the  op¬ 
eration  of  large-scale  quantum  computers.  General  proce¬ 
dures  have  been  developed  that  allow  any  quantum 
algorithm  to  be  implemented  fault  tolerantly  on  a  universal 
quantum  computer  [5],  but  these  involve  a  substantial  com¬ 
putational  overhead.  Therefore,  it  would  be  highly  advanta¬ 
geous  to  weave  fault  tolerance  into  the  design  of  our  quan¬ 
tum  hardware. 

We  therefore  will  regard  adiabatic  quantum  computation 
not  as  a  convenient  language  for  describing  a  class  of  quan¬ 
tum  circuits,  but  as  a  proposed  physical  implementation  of 
quantum  information  processing.  We  do  not  cast  the  algo¬ 
rithm  into  the  conventional  quantum  computing  paradigm  by 
approximating  it  as  a  sequence  of  discrete  unitary  transfor¬ 
mations  acting  on  a  few  qubits  at  a  time.  Instead,  suppose  we 
can  design  a  physical  device  that  implements  the  required 
time-dependent  Hamiltonian  with  reasonable  accuracy.  We 
then  imagine  implementing  the  algorithm  by  slowly  chang¬ 
ing  the  parameters  that  control  the  physical  Hamiltonian. 
How  well  does  such  a  quantum  computer  resist  decoherence, 
and  how  well  does  it  perform  if  the  algorithm  is  imperfectly 
implemented? 

Regarding  resistance  to  decoherence,  we  can  make  a  few 
simple  observations.  The  phase  of  the  ground  state  has  no 
effect  on  the  efficacy  of  the  algorithm,  and  therefore  dephas¬ 
ing  in  the  energy  eigenstate  basis  is  presumably  harmless. 
Only  the  interactions  with  the  environment  that  induce  tran¬ 
sitions  between  eigenstates  of  the  Hamiltonian  might  cause 
trouble.  In  principle,  these  may  be  well  controlled  by  running 
the  algorithm  at  a  temperature  that  is  small  compared  to  the 
minimum  gap  A.  (We  use  units  in  which  Boltzmann’s  con¬ 
stant  kB  =  1,  so  that  temperature  has  units  of  energy.)  If  A 
decreases  slowly  as  the  size  of  the  problem  increases,  then 
the  resources  required  to  run  at  a  sufficiently  low  tempera- 
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ture  may  be  reasonable.  Since  the  adiabatic  method  is  only 
efficient  if  A  is  not  too  small,  we  conclude  that  whenever  the 
method  works  on  a  perfectly  functioning  quantum  computer, 
it  is  robust  against  decoherence. 

In  addition  to  environmental  decoherence,  we  must  also 
consider  the  consequences  of  imperfect  implementation.  Our 
chosen  algorithm  may  call  for  the  time-dependent  Hamil¬ 
tonian  H(t),  but  when  we  run  the  algorithm,  the  actual 
Hamiltonian  will  be  H(t)  +  K(t),  where  K(t )  is  an  “error.” 
An  interesting  feature  of  adiabatic  quantum  computation  is 
that  K(t)  need  not  remain  small  during  the  evolution  in  order 
for  the  algorithm  to  work  effectively.  A  reasonably  large  ex¬ 
cursion  away  from  the  intended  Hamiltonian  is  acceptable, 
as  long  as  K(t)  is  slowly  varying  and  has  initial  and  final 
values  that  are  not  too  large.  A  very  rapidly  fluctuating  K(t) 
may  also  be  acceptable,  if  the  characteristic  frequency  of  the 
fluctuations  is  large  compared  to  the  energy  scale  of  H(t). 

In  this  paper,  we  use  numerical  simulations  to  investigate 
the  sensitivity  of  an  adiabatic  computer  to  decohering  tran¬ 
sitions  and  to  a  certain  class  of  unitary  perturbations  induced 
by  a  Hamiltonian  K(t).  The  results  are  consistent  with  the 
idea  that  the  algorithm  remains  robust  as  long  as  the  tem¬ 
perature  of  the  environment  is  not  too  high  and  K(t)  varies 
either  sufficiently  slowly  or  sufficiently  rapidly.  Thus,  the 
adiabatic  model  illustrates  the  principle  that  when  the  char¬ 
acteristics  of  the  noise  are  reasonably  well  understood,  it 
may  be  possible  to  design  suitable  quantum  hardware  that 
effectively  resists  the  noise.  However,  note  that  some  of  the 
effects  of  decoherence  and  unitary  control  error  may  not  be 
significant  for  the  small  problems  we  are  able  to  study — 
especially  in  the  case  of  decoherence,  where  the  time  re¬ 
quired  by  the  simulation  restricts  us  to  systems  with  only 
four  qubits — and  hence,  our  data  may  not  be  indicative  of  the 
performance  of  the  algorithm  working  on  larger  inputs. 

A  technique  closely  related  to  adiabatic  computation  was 
described  by  Kadowaki  and  Nishimori  [6]  and  has  been 
tested  experimentally  (in  conjunction  with  a  cooling  proce¬ 
dure)  by  Brooke  et  al.  [7].  In  a  different  guise,  the  principles 
that  make  quantum  adiabatic  evolution  robust  also  underlie 
the  proposal  by  Kitaev  [8]  to  employ  nonabelian  anyons  for 
fault-tolerant  quantum  computation.  The  fact  that  adiabatic 
evolution  incorporates  a  kind  of  intrinsic  fault  tolerance  has 
also  been  noted  in  [9-14]. 

In  Sec.  II  we  review  the  adiabatic  model  of  quantum  com¬ 
putation,  and  in  Sec.  Ill  we  describe  the  specific  combinato¬ 
rial  search  problem  (three-bit  exact  cover)  that  we  use  in  our 
simulations.  Sections  IV  and  V  report  our  numerical  results 
on  decoherence  and  unitary  control  error,  and  Sec.  VI  sum¬ 
marizes  our  conclusions. 


2n-l 

Hp=  Y  *(z)|z><z|,  (1) 

z  =  0 

so  that  the  computational  basis  state  | z)  is  an  eigenstate  of 
HP  with  eigenvalue  h(z).  Then  the  problem  is  to  determine 
which  state  \z)  is  the  ground  state  (eigenstate  with  lowest 
eigenvalue)  of  HP .  We  refer  to  HP  as  the  problem  Hamil¬ 
tonian. 

The  strategy  for  finding  the  ground  state  of  HP  is  to  pre¬ 
pare  the  ground  state  of  some  other  beginning  Hamiltonian 
Hb  and  slowly  interpolate  to  HP .  A  simple  choice  for  the 
interpolation  is  given  by  the  one-parameter  family  of  Hamil¬ 
tonians 

H(s)  =  (1-s)Hb  +  sHp  (2) 

that  interpolates  between  HB  and  HP  as  s  varies  from  0  to  1. 
We  prepare  the  ground  state  of  HB  at  time  t  =  0,  and  then  the 
state  evolves  from  t  =  0  to  T  according  to  the  Schrodinger 
equation, 

=  (3) 

where  the  Hamiltonian  is 

H(t)  =  H(t/T).  (4) 

At  time  T  (the  run  time  of  the  algorithm),  we  measure  the 
state  in  the  computational  basis.  If  we  let  |  ip)  denote  the 
(unique)  ground  state  of  HP  for  a  given  instance  of  the  prob¬ 
lem,  then  the  success  probability  of  the  algorithm  for  this 
instance  is 

Prob(T)  =  \((p\tp(T))\2.  (5) 

Does  the  algorithm  work?  According  to  the  quantum  adia¬ 
batic  theorem  [15,16],  if  there  is  a  nonzero  gap  between  the 
ground  state  and  the  first  excited  state  of  H(s)  for  all 
s  £  [0,1],  then  Prob(T)  approaches  1  in  the  limit  T ^oo.  Fur¬ 
thermore,  level  crossings  are  nongeneric  in  the  absence  of 
symmetries,  so  a  non  vanishing  gap  is  expected  if  HB  does 
not  commute  with  HP .  Thus,  the  success  probability 

Prob(  T)  of  the  algorithm  will  be  high  if  the  evolution  time  T 
is  large  enough.  The  question  is:  how  large  a  I  is  large 
enough  so  that  Prob(T)  is  larger  than  some  fixed  constant? 

We  can  reformulate  this  question  in  terms  of 

A=  min  [E^s) -£0(s)]  (6) 

ie[  0,1] 


II.  ADIABATIC  QUANTUM  COMPUTATION  and 


We  briefly  review  the  adiabatic  model  of  quantum  com¬ 
putation  introduced  in  [1].  Let  h(z)  be  a  function  of  n  bits 
z  =  (zi,Z2,Z3,  .  .  .  ,zn),  and  consider  the  computational  prob¬ 
lem  of  finding  a  value  of  z  that  minimizes  h(z)-  We  will 
typically  be  interested  in  the  case  where  this  value  of  z  is 
unique.  We  may  associate  with  this  function  the  Hermitian 
operator 


£=  max 

s  e  [0,1] 


|0  ,s) 


(7) 


where  E0(s)  is  the  lowest  eigenvalue  of  H(s ),  Ex(s)  is  the 
second-lowest  eigenvalue,  and  |0,.s),  1 1,^)  are  the  corre¬ 
sponding  eigenstates.  By  calculating  the  transition  probabil- 
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ity  to  lowest  order  in  the  adiabatic  expansion  [16],  one  finds 
that  the  probability  of  a  transition  from  ground  state  to  first 
excited  state  is  small  provided  that  the  run  time  T  satisfies 


(8) 


If  the  spectrum  consists  of  only  two  levels,  then  this  condi¬ 
tion  is  sufficient  to  ensure  that  the  system  remains  in  the 
ground  state  with  high  probability.  In  general,  the  required 
run  time  T  will  be  bounded  by  a  polynomial  in  n  so  long  as 
A  and  £  are  polynomially  bounded.  For  the  problems  we  are 
interested  in,  £  is  polynomially  bounded,  so  we  only  have  to 
consider  the  behavior  of  A . 

By  rescaling  the  time,  we  can  think  of  the  evolution  as 
taking  place  in  the  unit  time  interval  between  s  =  0  and  1,  but 
in  this  case,  the  energy  eigenvalues  are  rescaled  by  the  factor 
T.  Roughly  speaking,  we  can  think  of  dH(s)/ds  as  a  pertur¬ 
bation  that  couples  the  levels  of  the  instantaneous  Hamil¬ 
tonian  H(s),  and  has  the  potential  to  drive  a  transition  from 
|0 ,s)  to  |  1a).  But  if  T  is  large,  the  effects  of  this  perturbation 
are  washed  out  by  the  rapid  oscillations  of  the  relative  phase 
exp{-;T/odi'[£'1(5')-£:o('?')]}- 

Note  that  the  Hamiltonian  may  be  regarded  as  reasonable 
only  if  it  is  “local,”  that  is,  if  it  can  be  expressed  as  a  sum  of 
terms,  where  each  term  acts  on  a  bounded  number  of  qubits 
(a  number  that  does  not  grow  with  n).  Indeed,  in  this  case, 
the  Hamiltonian  evolution  may  be  accurately  and  efficiently 
simulated  by  a  universal  quantum  computer  [17].  Many  com¬ 
binatorial  search  problems  (e.g.,  3 SAT)  can  be  formulated  as 
a  search  for  a  minimum  of  a  function  that  is  local  in  this 
sense.  Along  with  a  local  choice  of  HB ,  this  results  in  a  full 
H(t )  that  is  also  local. 

A  direct  physical  implementation  of  the  continuously 
varying  H(t )  would  presumably  be  possible  only  under  a 
somewhat  stronger  locality  condition.  We  might  require  that 
each  qubit  is  coupled  to  only  a  few  other  qubits,  or  perhaps 
that  the  qubits  can  be  physically  arranged  in  such  a  way  that 
the  interactions  are  spatially  local.  Fortunately,  there  are  in¬ 
teresting  computational  problems  that  have  such  forms,  such 
as  3  SAT  restricted  to  having  each  bit  involved  in  only  three 
clauses  or  the  problem  of  finding  the  ground  state  of  a  spin 
glass  on  a  cubic  lattice  [18].  However,  for  the  purposes  of 
our  simulation,  we  will  only  consider  small  instances,  and 
since  we  do  not  have  a  specific  physical  implementation  in 
mind,  we  will  not  concern  ourselves  with  the  spatial  arrange¬ 
ment  of  the  qubits. 


III.  AN  EXAMPLE:  THE  EXACT  COVER  PROBLEM 

For  definiteness,  we  study  the  robustness  of  the  adiabatic 
algorithm  via  its  performance  on  the  problem  known  as 
“three-bit  exact  cover”  (EC3).  An  n- bit  instance  of  EC3  con¬ 
sists  of  a  set  of  clauses,  each  of  which  specifies  three  of  the 
n  bits.  A  clause  is  said  to  be  satisfied  if  and  only  if  exactly 
one  of  its  bits  has  the  value  1 .  The  problem  is  to  determine  if 
any  of  the  2n  assignments  of  the  n  bits  satisfies  all  of  the 
clauses. 
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For  this  problem,  the  function  h(z)  is  a  sum 

Hz)  =  2  hc(zic,zjc,zkc)  (9) 

of  three-bit  clauses,  where 

,Zkc)  satisfies  clause  C 

c(Zic,zjc,Zkc^  j  ( zi(:,Zjc,Zkc )  violates  clause  C. 

(10) 

The  value  of  the  function  h(z)  is  the  number  of  clauses  that 
are  violated;  in  particular,  h(z)  =  0  if  and  only  if  z  is  an 
assignment  that  satisfies  all  the  clauses. 

To  solve  EC 3  by  the  adiabatic  algorithm,  a  sensible 
choice  for  the  beginning  Hamiltonian  is 

Hbc,  (11) 

c 

where 


The  resulting  H(t)  is  local  in  the  sense  that  it  is  a  sum  of 
terms,  each  of  which  acts  on  only  a  few  qubits.  A  stronger 
kind  of  locality  may  be  imposed  by  restricting  the  instances 
so  that  each  bit  is  involved  in  at  most  a  fixed  number  of 
clauses.  The  computational  complexity  of  the  problem  is  un¬ 
changed  by  this  restriction. 

Numerical  studies  of  the  adiabatic  algorithm  applied  to 
this  problem  were  reported  in  [2,4].  Instances  of  EC3  with  n 
bits  were  generated  by  adding  random  clauses  until  there  was 
a  unique  satisfying  assignment,  giving  a  distribution  of  in¬ 
stances  that  one  might  expect  to  be  computationally  difficult 
to  solve.  The  results  for  a  small  number  of  bits  (^^20)  were 
consistent  with  the  possibility  that  the  adiabatic  algorithm 
requires  a  time  that  grows  only  as  a  polynomial  in  n  for 
typical  instances  drawn  from  this  distribution.  If  this  is  the 
case,  then  the  gap  A  does  not  shrink  exponentially.  Although 
the  typical  spacing  between  levels  must  be  exponentially 
small,  since  there  are  an  exponential  number  of  levels  in  a 
polynomial  range  of  energies,  it  is  possible  that  the  gap  at  the 
bottom  is  larger.  For  example,  Fig.  1  shows  the  spectrum  of 
a  randomly  generated  seven-bit  instance  of  EC3.  The  gap  at 
the  bottom  of  the  spectrum  is  reasonably  large  compared  to 
the  typical  spacing.  This  feature  is  not  specific  to  this  one 
instance,  but  is  characteristic  of  randomly  generated  in¬ 
stances,  at  least  for  10,  beyond  which  the  repeated  matrix 
diagonalization  required  to  create  a  picture  of  the  spectrum 
becomes  computationally  costly.  A  large  gap  makes  an  in- 
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FIG.  1.  Spectrum  of  a  randomly  generated  n  =  l  bit  instance  of 
EC3  with  a  unique  satisfying  assignment.  Note  that  the  energy  gap 
between  the  ground  state  and  the  first  excited  state  is  significantly 
larger  than  all  other  gaps.  An  expanded  view  would  show  that  there 
are  no  level  crossings  anywhere  in  the  spectrum  (except  for  the 
degeneracies  at  s  =  0  and  1). 

stance  readily  solvable  by  the  adiabatic  algorithm,  and  also 
provides  robustness  against  thermal  transitions  out  of  the 
ground  state. 

IV.  DECOHERENCE 

Perhaps  the  most  significant  impediment  to  building  a 
large-scale  quantum  computer  is  the  problem  of  decoher¬ 
ence.  No  quantum  device  may  be  perfectly  isolated  from  its 
environment,  and  interactions  between  a  device  and  its  envi¬ 
ronment  will  inevitably  introduce  noise.  Fortunately,  such 
effects  can  be  countered  using  fault- tolerant  protocols,  but  as 
we  have  already  mentioned,  these  protocols  may  be  costly. 
Therefore,  we  would  like  to  consider  quantum  systems  with 
inherent  resistance  to  decohering  effects.  If  the  ground  state 
of  our  adiabatic  quantum  computer  is  separated  from  the 
excited  states  by  a  sizable  energy  gap,  then  we  expect  it  to 
exhibit  such  robustness.  Here,  we  consider  how  the  adiabatic 
algorithm  for  EC3  is  affected  by  decoherence. 

First,  we  briefly  review  the  master  equation  formalism  for 
describing  the  decohering  effects  of  an  environment  on  a 
quantum  system.  Suppose  that  our  quantum  computer  is  a 
collection  of  spin- 1/2  particles  interacting  with  each  other 
according  to  the  Hamiltonian  Hs  and  weakly  coupled  to  a 
large  bath  of  photons.  The  total  Hamiltonian  of  the  quantum 
computer  and  its  environment  is 

H  =  Hs  +  He+\V ,  (14) 

where  HE  is  the  Hamiltonian  of  its  environment,  V  is  an 
interaction  that  couples  the  quantum  computer  and  the  pho¬ 
ton  bath,  and  X  is  a  coupling  constant.  We  may  describe  the 
state  of  the  quantum  computer  alone  by  the  density  matrix  p 
found  by  tracing  over  the  environmental  degrees  of  freedom. 
In  general,  the  time  evolution  of  p  is  complicated,  but  under 
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reasonable  assumptions,  we  may  approximate  its  evolution 
using  a  Markovian  master  equation. 

One  way  of  deriving  such  a  master  equation  is  to  consider 
the  weak  coupling  limit,  in  which  \  <l  [19].  If  the  environ¬ 
ment  is  very  large  and  only  weakly  coupled  to  the  quantum 
computer,  it  will  be  essentially  unchanged  by  the  interaction. 
Furthermore,  in  this  limit,  we  expect  the  evolution  of  the 
quantum  computer  to  be  Markovian,  or  local  in  time,  if  we 
filter  out  high-frequency  fluctuations  by  some  coarse- 
graining  procedure.  Assuming  that  the  combined  state  of  the 
quantum  computer  and  its  environment  begins  in  a  product 
state  p(0)(g>p£,  Davies  derives  the  master  equation 

<^=-i[Hs,p]  +  \2K*p,  (15) 

where 

Kp=~  I  dxTr E\_U(-x)VU(x),[V,p®pE]l  (16) 

Jo 

K'p=  lim  -  f 'dy£/(-y){A:[I/(y)p£/(-y)]}£/(y), 

j^OO  X  JO 

(17) 

with 

U(x)  =  e~ix{Hs+HE\  (is) 

where  we  have  (temporarily)  assumed  that  Hs  is  time  inde¬ 
pendent.  Although  the  t|  operation  defined  by  Eq.  (17)  does 
not  appear  in  some  formulations  of  the  Markovian  master 
equation,  it  appears  to  be  essential  for  the  equation  to  prop¬ 
erly  describe  the  weak-coupling  limit  [20],  and  in  particular, 
for  it  to  capture  the  physics  of  relaxation  to  thermal  equilib¬ 
rium.  The  master  equation  (15)  has  the  property  that  if  the 
environment  is  in  thermal  equilibrium  at  a  given  tempera¬ 
ture,  then  the  decohering  transitions  drive  the  quantum  com¬ 
puter  towards  the  Gibbs  state  of  Hs  at  that  temperature. 
While  not  an  exact  description  of  the  dynamics,  Eq.  (15) 
should  provide  a  reasonable  caricature  of  a  quantum  com¬ 
puter  in  a  thermal  environment. 

Note  that  Eq.  (15)  is  derived  assuming  a  time-independent 
Hamiltonian  Hs ;  with  a  time-varying  Hs(t ),  we  should  ex¬ 
pect  the  generator  of  time  evolution  at  any  particular  time  to 
depend  on  the  Hamiltonian  at  all  previous  times  [21].  How¬ 
ever,  if  Hs(t )  is  slowly  varying,  then  it  is  a  good  approxi¬ 
mation  to  imagine  that  the  generator  at  any  particular  time 
depends  only  on  Hs  at  that  time  [22].  In  particular,  since  we 
are  interested  in  nearly  adiabatic  evolution,  Hs(t )  varies 
slowly,  so  Eq.  (15)  remains  a  good  approximation,  where  at 
any  given  time  t  we  compute  using  only  Hs(t).  Note  that 
with  Hs(t )  time  dependent,  U(x )  defined  by  Eq.  (18)  is  not 
the  time  evolution  operator;  it  depends  on  the  time  t  only 
implicitly  through  Hs(t). 

For  a  system  of  spins  coupled  to  photons,  we  choose  the 
interaction 
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roc 

v=2  f 

i  JO 


+  g*(w)^o-(_°],  (19) 


where  2,  is  a  sum  over  the  spins,  are  raising  and  low¬ 
ering  operators  for  the  zth  spin,  aM  is  the  annihilation  opera¬ 
tor  for  the  photon  mode  with  frequency  co,  and  Xg(a))  is  the 
product  of  the  coupling  strength  and  spectral  density  for  that 
mode.  Note  that  if  the  coupling  strength  is  frequency  depen¬ 
dent,  we  may  absorb  that  dependence  into  g(co),  leaving  X 
as  a  frequency-independent  parameter.  With  this  specific 
choice  for  V,  we  can  perform  the  integrals  and  trace  in  Eqs. 
(15-18).  If  we  assume  that  all  spacings  between  eigenvalues 
of  Hs  are  distinct,  the  resulting  expression  simplifies  consid¬ 
erably,  and  we  find 

■£■  =-i\Hs,p]~  2  VNba\gba\2{a\cr(!)\b)(b\<T(?\a) 

m  i,a,b 

+  {Nab+l)\gab\2(b\cr(!_)\a)(a\(j^\b)]{{\a){a\p) 

+  {p\a){a\)-2\b){a\p\a)(b\},  (20) 

where  the  states  | a)  are  the  time-dependent  instantaneous 
eigenstates  of  Hs  with  energy  eigenvalues  o)a  , 

1 

{21) 


is  the  Bose-Einstein  distribution  at  temperature  1//5,  and 

\g((0b-wa),  (Ob>(Oa, 

(22) 

0,  ■ 

We  simulated  the  effect  of  thermal  noise  by  numerically  in¬ 
tegrating  the  master  Eq.  (20)  with  a  Hamiltonian  Hs  given 
by  Eq.  (4)  and  with  the  initial  pure  state  density  matrix 
p(0)  =  \if/(0))(if/(0)\  given  by  Eq.  (13).  For  simplicity,  we 
chose  g(co)  =  l  for  co^O  and  zero  otherwise.  Although  we 
would  expect  that  g(co)^0  as  for  the  small  systems 

we  are  able  to  simulate,  it  should  be  a  reasonable  approxi¬ 
mation  to  treat  g(co)  as  constant  and  tune  the  overall  cou¬ 
pling  strength  using  X2. 

How  should  we  expect  the  success  probability 
(cp\p(T)\cp),  where  |  cp)  is  the  ground  state  of  HP ,  to  depend 
on  the  run  time  T  and  the  temperature?  If  the  run  time  T  is 
sufficiently  long,  then  regardless  of  its  initial  state,  the  quan¬ 
tum  computer  will  come  to  thermal  equilibrium.  At  the  time 
of  the  final  readout,  it  will  be  close  to  the  Gibbs  state 


e~WP 

iimP(r)=- — ^T^Pp  (23) 

7^00  Jre~f3HP 


of  the  problem  Hamiltonian  HP  ,  and  the  success  probability 
will  be  approximately  {(p\pP\cp).  This  probability  may  be 
appreciable  if  the  temperature  is  small  compared  to  the  gap 
between  the  ground  state  and  first  excited  state  of  HP .  Thus, 
one  way  to  find  the  ground  state  of  HP  is  to  prepare  the 
computer  in  any  initial  state,  put  it  in  a  cold  environment, 
wait  a  long  time,  and  measure.  However,  this  thermal  relax- 
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ation  method  is  not  an  efficient  way  to  solve  hard  optimiza¬ 
tion  problems.  Although  it  may  work  well  on  some  instances 
of  a  given  problem,  this  method  will  not  work  in  cases  where 
the  computer  gets  stuck  in  local  minima  from  which  down¬ 
ward  transitions  are  unlikely.  In  such  cases,  the  time  for 
equilibration  is  expected  to  be  exponentially  large  in  n. 

Consider  an  instance  with  a  long  equilibration  time  so  that 
cooling  alone  is  not  an  efficient  way  to  find  the  ground  state 
of  HP .  It  is  possible  that  the  minimum  gap  A  associated 
with  the  quantum  algorithm  is  not  small,  and  the  idealized 
quantum  computer,  running  without  decohering  effects, 
would  find  the  ground  state  of  HP  in  a  short  time.  In  this 
situation,  if  we  include  the  coupling  of  the  system  to  the 
environment  and  we  run  at  a  temperature  much  below  A, 
then  thermal  transitions  are  never  likely,  and  the  adiabatic 
algorithm  should  perform  nearly  as  well  as  in  the  absence  of 
decoherence.  But  if  the  temperature  is  comparable  to  A,  then 
the  performance  may  be  significantly  degraded. 

On  the  other  hand,  consider  an  instance  for  which  the 
equilibration  time  is  short,  so  that  cooling  alone  is  a  good 
algorithm.  Furthermore,  suppose  that  the  adiabatic  algorithm 
would  find  the  ground  state  of  HP  in  a  short  time  in  the 
absence  of  decohering  effects.  In  this  case,  the  combined 
effects  of  cooling  and  adiabatic  evolution  will  surely  find  the 
ground  state  of  HP  in  a  short  time.  But  note  that  A  alone 
does  not  control  the  success  of  the  algorithm.  Even  if  H(t) 
changes  too  quickly  for  the  evolution  to  be  truly  adiabatic  so 
that  a  transition  occurs  where  the  gap  is  smallest,  the  system 
may  be  cooled  back  into  its  ground  state  at  a  later  time. 

Typical  results  of  the  simulation  are  shown  in  Fig.  2  for 
two  n  =  4  bit  instances  of  EC3  with  unique  satisfying  assign¬ 
ments.  These  two  instances  have  minimum  gaps  of  A 
^0.301  and  A  ^0.425.  For  each  instance,  we  plot  the  suc¬ 
cess  probability  as  a  function  of  the  run  time  T.  With  X2 
=  0.1,  we  consider  five  temperatures:  1/10,  1/2,  1,  2,  and  10. 
We  also  present  the  data  with  no  decoherence  (\2  =  0)  for 
comparison. 

Unfortunately,  the  time  required  to  integrate  Eq.  (20) 
grows  very  rapidly  with  n.  Whereas  a  state  vector  contains 
2n  entries,  the  density  matrix  contains  4n  entries;  and  in 
addition,  calculating  dp/dt  at  each  timestep  requires  evaluat¬ 
ing  a  double  sum  over  2n  energy  eigenstates.  For  this  reason, 
we  were  only  able  to  consider  instances  with  n  ^  4 . 

The  results  are  consistent  with  our  general  expectations. 
In  the  absence  of  decoherence,  the  success  probability  be¬ 
comes  appreciable  for  sufficiently  long  run  times.  This  prob¬ 
ability  rises  faster  for  the  problem  with  a  larger  gap.  When 
we  add  decoherence  at  high  temperature,  the  success  prob¬ 
ability  never  becomes  very  large  (note  the  lowest  curves  in 
Fig.  2).  As  the  temperature  is  decreased  to  a  value  of  order 
one,  the  presence  of  decoherence  has  a  less  significant  effect 
on  the  success  probability.  In  fact,  for  sufficiently  low  tem¬ 
peratures,  the  success  probability  may  actually  be  higher  in 
the  presence  of  decoherence  than  when  there  is  no  decoher¬ 
ence.  This  is  because  the  primary  effect  of  decoherence  at 
low  temperature  is  to  drive  transitions  towards  the  ground 
state,  improving  performance. 

However,  these  results  do  not  illustrate  a  definitive  con¬ 
nection  between  the  minimum  gap  A  and  the  temperature 
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FIG.  2.  The  success  probability  as  a  function  of  run  time  T  for  two  instances  of  EC3  with  n  —  4  bits.  The  instance  on  the  left  has  a  gap 
of  A  ^0.301  and  the  instance  on  the  right  has  a  gap  of  A2^ 0.425.  The  dotted  line  shows  the  behavior  of  the  algorithm  with  no  decoherence, 
i.e.,  X2  =  0.  Note  that  in  the  figure  on  the  right,  the  dotted  curve  is  partially  obscured  but  can  be  seen  slightly  above  the  topmost  solid  curve. 
The  solid  lines  show  the  behavior  of  the  algorithm  in  the  presence  of  decoherence  with  \2  =  0.1  for  five  different  temperatures.  The  triangles 
at  the  far  right  show  the  thermal  success  probabilities  (cp\pP\cp)  at  each  of  these  temperatures.  From  top  to  bottom,  the  temperatures  are  1/10, 
1/2,  1,  2,  and  10. 


above  which  the  algorithm  no  longer  works.  These  simple 
n  =  4  bit  instances  fall  into  the  second  category  discussed 
above:  the  equilibration  time  is  short,  so  cooling  alone  is  a 
good  algorithm.  In  other  words,  no  sharp  distinction  can  be 
drawn  between  the  run  time  required  for  the  adiabatic  algo¬ 
rithm  to  perform  well  in  the  absence  of  decoherence  and  the 
run  time  required  for  equilibration.  Accordingly,  the  depen¬ 
dence  of  the  success  probability  on  temperature  and  run  time 
is  similar  for  the  two  instances  shown  in  Fig.  2,  even  though 
the  minimum  gaps  for  these  instances  are  somewhat  differ¬ 
ent. 

V.  UNITARY  CONTROL  ERROR 

We  now  consider  how  the  performance  of  the  adiabatic 
algorithm  for  EC 3  is  affected  by  adding  three  different  kinds 
of  perturbations  to  the  Hamiltonian.  Each  perturbation  we 
consider  is  a  sum  of  single-qubit  terms,  where  each  term  can 
be  interpreted  as  a  magnetic  field  pointing  in  a  random  di¬ 
rection.  To  simplify  our  analysis,  we  assume  that  the  magni¬ 
tude  of  the  magnetic  field  is  the  same  for  all  qubits,  but  its 
direction  varies  randomly  from  qubit  to  qubit.  The  perturba¬ 
tions  we  consider  are 


jK1(v)  =  C1v2  mi'cr (24) 

i  =  1 

n 

K2(s)  =  C2sin(7rv)2  mr  ^l\  (25) 

/  =  i 

1  n 

K3(s)=  -sin(C37rs)^j  mra{l\  (26) 

2  i=  i 


which  are  added  to  Eq.  (2)  and  give  a  time-dependent  Hamil¬ 
tonian  according  to  Eq.  (4).  Each  is  a  randomly  generated 
real  three-component  vector  with  unit  length,  Cx  and  C2  are 
real  numbers,  and  C3  is  a  nonnegative  integer. 

The  adiabatic  algorithm  was  simulated  by  numerically 
solving  the  time-dependent  Schrodinger  equation  with  initial 
state  |iA(0))  given  by  Eq.  (13)  and  Hamiltonian  H(t/T ) 
+  Kj(t/T)  for  a  given  jg  {1,2,3}.  As  in  [2-4],  we  used  a 
fifth-order  Runge-Kutta  method  with  variable  step  size,  and 
checked  the  accuracy  by  verifying  that  the  norm  of  the  state 
was  maintained  to  one  part  in  a  thousand.  For  a  specified 
value  of  n ,  we  randomly  generated  an  instance  of  EC3  with  a 
unique  satisfying  assignment.  Then  we  randomly  generated 
several  different  values  of  the  magnetic  field  directions  {raj. 
For  each  instance  of  the  problem  and  the  magnetic  field,  the 
run  time  was  chosen  so  that  the  success  probability  without 
the  perturbation  was  reasonably  high.  With  this  run  time 
fixed,  we  then  determined  the  success  probability  for  varying 
values  of  the  relevant  Cj . 

First,  we  consider  the  perturbation  K  x .  Since  it  turns  on  at 
a  constant  rate,  this  perturbation  can  be  thought  of  as  an  error 
in  HP.  Note  that  with  CjAO,  the  final  Hamiltonian  is  not 
simply  HP ,  so  the  algorithm  will  not  work  exactly  even  in 
the  adiabatic  limit  T This  perturbation  is  potentially 
dangerous  because  of  the  way  its  effect  scales  with  the  num¬ 
ber  of  bits  n.  Indeed,  consider  the  case  where  HP  can  be 
separated  into  a  sum  of  Hamiltonians  acting  separately  on 
each  qubit.  If  adding  Kx  reduces  the  overlap  of  the  ground- 
state  |  cp)  of  HP  with  the  perturbed  ground-state  |  cpf)  by 
some  fixed  value  e  for  each  of  the  n  qubits,  then  the  total 
overlap  is  (1  —  6)”,  which  is  exponentially  small  in  the  num¬ 
ber  of  bits.  Thus,  the  algorithm  clearly  fails  in  this  factorized 
case.  In  general,  if  the  magnitude  of  Kx  is  independent  of  n , 
then  we  expect  the  algorithm  to  fail.  However,  if  the  magni- 
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FIG.  3.  (Top)  The  success  probability  of  the  adiabatic  algorithm  for  two  randomly  generated  instances  of  EC3  with  n  —  1  bits  (left)  and 
n  =  10  bits  (right)  under  the  perturbation  Kx  defined  by  Eq.  (24)  for  four  different  sets  of  magnetic-field  directions.  For  each  n ,  the  run  time 
is  the  same  for  each  random  perturbation.  (Bottom)  The  corresponding  overlaps  |(<p|<p')|2  of  the  ground-state  \cp)  of  HP  with  the  perturbed 
ground- state  |  cp ' )  at  s  =  1 . 


tude  of  Kx  falls  as  1  In  or  faster,  then  the  shift  of  the  ground 
state  may  be  small  enough  (as  it  would  be  in  the  factorized 
case)  that  the  algorithm  is  not  significantly  affected.  Note 
that  for  any  n  there  is  some  value  of  C  x  that  is  small  enough 
that  the  disadvantage  of  reduced  overlap  with  the  ground 
state  of  HP  may  be  overcome  if  the  perturbation  happens  to 
increase  the  minimum  gap  A.  For  this  reason,  we  expect  to 
sometimes  see  an  increase  in  success  probability  for  small 
Cx  that  goes  away  as  Cj  is  increased. 

The  effect  of  the  perturbation  Kx  is  shown  in  Fig.  3  for 
n  =  l  and  10  bit  instances  of  EC3,  with  four  different  ran¬ 
domly  generated  sets  of  magnetic-field  directions  for  each 
instance.  The  run  time  is  chosen  such  that  for  C^O,  the 
success  probability  is  around  1/2.  The  top  plots  show  that  for 
small  Ci ,  the  success  probability  is  not  strongly  suppressed; 
in  fact,  in  some  cases  it  is  significantly  enhanced.  For  large 
enough  C1?  the  success  probability  is  heavily  suppressed. 
The  bottom  plots  show  the  overlap  |(<p|<p')| 2  between  the 
ground  state  of  HP  and  the  actual  ground  state  in  the  pres¬ 
ence  of  the  perturbation.  As  we  expect,  the  suppression  of 
the  success  probability  is  correlated  with  the  amount  of  over¬ 


lap.  We  also  studied  a  similar  perturbation  in  which  s  is 
replaced  by  1  —s,  which  may  be  thought  of  as  an  error  in 
Hb  .  Unsurprisingly,  the  results  were  qualitatively  similar. 

Next,  we  consider  the  low-frequency  perturbation  K2.  The 
period  of  oscillation  is  chosen  such  that  the  perturbation  van¬ 
ishes  at  t  =  0  and  T,  so  the  perturbation  does  not  affect  the 
algorithm  in  the  adiabatic  limit.  Since  the  success  probability 
is  quite  sensitive  to  the  value  of  the  minimum  gap  A,  and  it 
is  not  a  priori  obvious  whether  a  perturbation  will  increase 
or  decrease  A ,  we  can  guess  that  turning  on  a  nonzero  value 
of  C2  may  either  increase  the  success  probability  or  decrease 
it.  In  fact,  it  would  be  surprising  if  A  decreased  for  all  per¬ 
turbations  K2.  The  Hamiltonian  H(s)  +  K2(s)  is  another  way 
to  interpolate  from  HB  to  HP ,  and  we  know  of  no  reason 
why  the  choice  K2  =  0  should  always  be  optimal,  even  when 
the  number  of  bits  is  large  and  C2  is  not  decreasing  with  n. 

Figure  4  shows  the  effect  of  the  perturbation  K2 ,  using  the 
same  instances,  magnetic  field  directions,  and  run  times  as  in 
Fig.  3.  The  top  plots  show  the  success  probability  as  a  func¬ 
tion  of  C2.  As  in  the  case  of  Kx,  some  perturbations  may 
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FIG.  4.  (Top)  The  success  probability  of  the  adiabatic  algorithm  for  the  same  instances  used  in  Fig.  3  under  the  perturbation  K2  defined 
by  Eq.  (25).  The  four  different  magnetic  field  directions  for  each  instance  are  also  the  same  as  in  Fig.  3.  (Bottom)  The  minimum  gap  A  in 
the  perturbed  problem. 


raise  the  success  probability  and  some  suppress  it.  Perhaps 
unsurprisingly,  a  particular  set  of  magnetic  field  directions 
that  raises  the  success  probability  under  Kx  is  also  likely  to 
help  when  K2  is  applied.  But  unlike  K2  may  improve 
the  success  probability  even  with  C2  —  2,  where  the  size  of 
the  perturbation  is  comparable  to  the  size  of  the  unperturbed 
Hamiltonian.  The  bottom  plots  show  the  minimum  gap  A 
when  the  perturbation  is  added.  Note  that  there  is  a  strong 
correlation  between  the  success  probability  and  A . 

For  both  perturbations  Kx  and  K2 ,  similar  results  have 
been  observed  (with  fewer  data  points)  for  instances  with  as 
many  as  n  =  14  bits.  Figures  3  and  4  present  typical  data.  For 
example,  for  a  given  instance,  typically  one  or  two  out  of 
four  sets  of  randomly  chosen  magnetic-field  directions  led  to 
an  improvement  in  the  success  probability  for  some  values  of 
Cx  and  C2,  compared  to  the  unperturbed  case. 

Finally,  we  consider  the  perturbation  K3 ,  in  which  the 
magnitude  of  the  oscillating  component  is  fixed,  but  we  may 
vary  its  frequency  by  varying  C3.  As  for  K2,  the  frequency  is 
chosen  so  that  the  perturbation  vanishes  at  t  =  0  and  T.  We 


expect  that  for  C3  of  order  one,  the  perturbation  will  be 
likely  to  excite  a  transition,  and  that  the  success  probability 
will  be  small.  But  since  both  HB  and  HP  have  a  maximum 
eigenvalue  of  order  n ,  we  may  anticipate  that  for 

nT 

C3>—,  (27) 

77 

the  perturbation  will  be  far  from  any  resonance.  Then  the 
probability  that  the  perturbation  drives  a  transition  will  be 
low,  and  the  success  probability  should  be  comparable  to  the 
case  where  the  perturbation  vanishes. 

Some  representative  plots  of  the  dependence  of  the  suc¬ 
cess  probability  on  C3  are  shown  in  Fig.  5.  Each  plot  corre¬ 
sponds  to  a  particular  randomly  generated  instance  of  EC3 
(with  either  n  =  8  bits  or  n=  10  bits)  and  a  randomly  gener¬ 
ated  set  of  magnetic  field  directions.  In  the  top  row  of  plots, 
the  run  time  is  chosen  so  that  the  success  probability  is 
around  1/8  with  the  perturbation  absent  (i.e.,  C3  =  0).  In  the 
bottom  row,  the  run  time  is  doubled.  All  of  the  data  exhibit 
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FIG.  5.  The  success  probability  as  a  function  of  the  frequency  C3  of  the  perturbation  K3  defined  in  Eq.  (26).  The  data  in  each  plot  were 
obtained  for  a  randomly  generated  instance  of  EC3  with  randomly  generated  magnetic-field  directions.  The  data  in  the  left  column  are  for 
two  instances  with  n  =  8  bits,  and  the  data  in  the  right  column  are  for  two  instances  with  n  =  10  bits.  For  the  top  row,  the  run  time  is  chosen 
so  that  the  success  probability  is  around  1/8  for  C3  =  0,  and  for  the  bottom  row,  the  run  time  is  twice  as  long.  The  leftmost  points  in  each 
plot  correspond  to  C3  =  0,  so  the  perturbation  is  absent  for  all  t.  C3  takes  integer  values,  so  the  lines  are  included  only  to  guide  the  eye. 


the  expected  qualitative  trend.  The  leftmost  point  corre¬ 
sponds  to  C3  =  0.  For  the  smallest  values  of  C3>0,  the  suc¬ 
cess  probability  may  not  be  too  badly  damaged;  for  some¬ 
what  larger  values  of  C3,  it  is  heavily  suppressed;  and  for 
sufficiently  large  C3,  it  recovers  to  a  value  near  the  success 
probability  in  the  absence  of  the  perturbation.  The  value  of 
uTItt  is  around  19  and  39  for  the  upper  and  lower  n  =  8 
plots  and  is  around  38  and  76  for  the  upper  and  lower  n 
=  10  plots,  so  the  estimate  (27)  turns  out  to  be  reasonable. 

Another  conspicuous  feature  of  the  plots  in  Fig.  5  is  that 
the  success  probability  tends  to  oscillate  between  even  and 
odd  values  of  C3,  though  whether  even  or  odd  values  are 
favored  varies  from  case  to  case.  This  occurs  because  the 
perturbation’s  time  average  vanishes  for  C3  even,  so  that  its 
integrated  effect  is  weaker  than  for  C3  odd.  Since  a  small 
perturbation  might  either  help  or  hurt,  the  success  probability 
is  slightly  enhanced  for  odd  C3  in  some  cases,  and  is  slightly 
suppressed  in  other  cases. 


VI.  CONCLUSIONS 

We  have  conducted  numerical  simulations  to  investigate 
the  fault  tolerance  of  adiabatic  quantum  computation,  and 
our  results  are  consistent  with  the  claim  that  this  algorithm  is 
robust  against  decoherence  and  certain  kinds  of  random  uni¬ 
tary  perturbations.  Thus,  if  a  physical  system  could  be  engi¬ 
neered  with  interactions  reasonably  well  described  by  a 
Hamiltonian  that  smoothly  interpolates  from  an  initial  HB  to 
a  final  HP  corresponding  to  an  interesting  combinatorial 
search  problem,  and  if  the  gap  remains  large  throughout  the 
interpolation,  that  system  might  be  a  powerful  computational 
device. 

Although  we  have  viewed  unitary  perturbations  as  noise, 
the  fact  that  they  sometimes  raise  the  success  probability 
suggests  a  possible  way  to  speed  up  the  adiabatic  algorithm. 
The  algorithm  finds  the  ground  state  of  HP  by  starting  the 
system  in  the  ground  state  of  HB  .  The  quantum  state  evolves 
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as  the  system  Hamiltonian  smoothly  interpolates  from  HB  to 
HP .  However,  there  are  many  possible  choices  for  HB  and 
many  smooth  paths  from  a  given  HB  to  HP .  The  choices 
(11)  and  (2)  are  convenient  but  arbitrary,  so  choosing  an 
alternate  route  to  HP  might  speed  up  the  algorithm.  An  ex¬ 
ample  of  this  is  seen  in  [23],  where  it  is  shown  that  optimiz¬ 
ing  the  time-dependent  coefficients  of  HB  and  HP  allows  the 
adiabatic  algorithm  to  achieve  a  square-root  speedup  for  an 
unordered  search  problem.  More  generally,  the  interpolating 
Hamiltonian  might  involve  terms  that  have  nothing  to  do 
with  Hb  or  HP ,  but  that  increase  A  and  therefore  improve 
performance.  For  example,  the  perturbation  K2  sometimes 
increases  the  success  probability,  as  seen  in  Fig.  4.  Rather 
than  being  thought  of  as  a  source  of  error,  such  a  perturba¬ 
tion  could  be  applied  intentionally  and  might  sometimes  en¬ 
hance  the  effectiveness  of  the  adiabatic  algorithm. 
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We  show  that  the  nondemolition  measurement  of  a  spacelike  Wilson  loop  operator  W(C)  is  impossible  in  a 
relativistic  non- Abelian  gauge  theory.  In  particular,  if  two  spacelike- separated  magnetic  flux  tubes  both  link 
with  the  loop  C,  then  a  nondemolition  measurement  of  W(C)  would  cause  electric  charge  to  be  transferred 
from  one  flux  tube  to  the  other,  a  violation  of  relativistic  causality.  A  destructive  measurement  of  W(C)  is 
possible  in  a  non-Abelian  gauge  theory  with  suitable  matter  content.  In  an  Abelian  gauge  theory,  many 
cooperating  parties  distributed  along  the  loop  C  can  perform  a  nondemolition  measurement  of  the  Wilson  loop 
operator  if  they  are  equipped  with  a  shared  entangled  ancilla  that  has  been  prepared  in  advance.  We  also  note 
that  Abelian  electric  charge  (but  not  non-Abelian  charge)  can  be  transported  superluminally,  without  any 
accompanying  transmission  of  information. 
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I.  INTRODUCTION  AND  SUMMARY 

What  measurements  are  possible  in  a  gauge  field  theory? 
Since  the  interactions  of  the  elementary  constituents  of  mat¬ 
ter  are  described  by  gauge  theory,  hardly  any  question  could 
be  more  fundamental.  Yet  definitive  answers  are  elusive. 

The  Wilson  loop  operators  associated  with  closed  space¬ 
like  paths  provide  a  complete  characterization  of  a  gauge- 
field  configuration  in  terms  of  gauge-invariant  quantities 
[1,2].  Therefore,  in  formulations  of  gauge  theories,  Wilson 
loops  are  often  taken  to  be  the  basic  observables.  But  we  will 
show  that  nondemolition  measurements  of  spacelike  Wilson 
loops  are  impossible  in  a  non-Abelian  gauge  theory  that  re¬ 
spects  relativistic  causality.  We  reach  this  conclusion  by  ar¬ 
guing  that  any  procedure  for  nondemolition  measurement  of 
a  spacelike  non-Abelian  Wilson  loop  would  allow  informa¬ 
tion  to  be  transmitted  outside  the  forward  light  cone. 

Causality  places  no  such  restriction  on  the  measurability 
of  an  Abelian  Wilson  loop  (one  evaluated  in  a  one¬ 
dimensional  irreducible  representation  of  the  gauge  group), 
and  indeed  we  find  that  nondemolition  measurement  of  an 
Abelian  Wilson  loop  is  possible.  We  also  find  that,  in  gauge 
theories  with  suitable  matter  content,  destructive  measure¬ 
ments  of  non-Abelian  Wilson  loops  are  possible.  By  destruc¬ 
tive  measurements  we  mean  ones  that,  in  contrast  to  non¬ 
demolition  measurements,  inflict  damage  on  Wilson  loop 
eigenstates. 

In  a  quantum  field  theory  in  flat  spacetime,  described  in 
the  Schrodinger  picture,  what  do  we  mean  by  a  nondemoli¬ 
tion  “measurement”  of  an  observable  defined  on  a  time 
slice?  Typically,  such  a  measurement  requires  the  coopera¬ 
tion  of  many  parties  who  are  distributed  over  the  slice,  and  is 
a  three-step  process.  In  the  first  step  (which  might  not  be 
necessary),  a  suitable  entangled  quantum  state  (the  “an¬ 
cilla”)  is  prepared  and  distributed  to  the  parties.  Second, 
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each  party  performs  a  local  operation  on  her  local  field  vari¬ 
ables  and  her  part  of  the  entangled  ancilla.  Third,  classical  or 
quantum  information  extracted  by  the  parties  in  the  second 
step  is  shipped  to  a  central  location  where  the  readout  of  the 
result  is  completed. 

Although  the  outcome  of  the  measurement  is  not  known 
until  the  third  step  is  completed,  the  coherence  of  a  superpo¬ 
sition  of  eigenstates  of  the  observable  with  distinct  eigenval¬ 
ues  is  already  destroyed  in  the  second  step,  which  is  carried 
out  on  the  time  slice  where  the  operator  is  defined.  At  that 
time,  the  density  operator  p  encoding  the  quantum  state  of 
the  field  theory  is  transformed  according  to 

P-£(P)S2  EapEa,  (1) 

a 

where  {Ea}  is  the  set  of  orthogonal  projectors  onto  the 
eigenspaces  of  the  observable.  The  term  “nondemolition” 
means  that  if  the  state  prior  to  the  measurement  is  an  eigen¬ 
state  of  the  observable,  then  the  state  will  be  unaffected  by 
the  measurement. 

Any  permissible  way  in  which  a  quantum  state  can 
change  is  described  by  a  quantum  operation ,  a  completely 
positive  trace-nonincreasing  linear  map  of  density  operators 
to  density  operators  [3,4].  The  orthogonal  measurement  £  in 
Eq.  (1),  summed  over  its  possible  outcomes,  is  a  special  type 
of  quantum  operation.  It  is  natural  to  ask,  what  are  the  quan¬ 
tum  operations  that  can  really  be  executed  on  a  time  slice  in 
a  relativistic  quantum  theory?  The  general  answer  is  not 
known,  but  it  is  known  that  many  operations  are  unphysical 
because  they  run  afoul  of  relativistic  causality  [5-11].  Con¬ 
sider,  as  in  Fig.  1,  two  parties  Alice  and  Bob  who  perform 
spacelike-separated  actions.  Just  prior  to  the  implementation 
of  £,  Alice  performs  a  local  operation  on  the  fields  in  her 
vicinity,  and  just  after  the  implementation  of  £,  Bob  performs 
a  local  measurement  of  the  fields  in  his  vicinity.  If  Bob  is 
able  to  acquire  any  information  about  what  local  operation 
Alice  chose  to  apply,  then  Alice  has  successfully  sent  a  su¬ 
perluminal  signal  to  Bob.  If  an  operation  allows  such  super¬ 
luminal  signaling,  we  say  that  the  operation  is  acausaf  oth- 
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FIG.  1.  Causality  criterion  for  quantum  operations.  First  Alice 
applies  a  local  operator  in  her  vicinity,  then  the  quantum  operation 
is  executed,  and  finally  Bob  makes  a  local  measurement  that  is 
spacelike  separated  from  Alice’s  action.  If  Bob’s  measurement  re¬ 
sult  allows  him  to  acquire  information  about  what  local  operator 
Alice  applied,  then  the  quantum  operation  is  not  causal  and  hence 
not  physically  implementable. 

erwise,  it  is  causal.  Physically  realizable  operations  must  be 
causal.  We  will  apply  this  causality  criterion  to  non- Abelian 
gauge  theories,  and  will  argue  that  nondemolition  measure¬ 
ment  of  a  Wilson  loop  operator  is  an  acausal  operation. 

In  discussing  the  locality  properties  of  a  field  theory,  it  is 
convenient  to  use  the  concept  of  a  “reduced”  density  opera¬ 
tor  that  encodes  the  observations  that  are  accessible  to  an 
agent  acting  in  a  bounded  spatial  region.  This  density  opera¬ 
tor  is  obtained  from  a  density  operator  for  the  full  system  by 
“tracing  out”  the  degrees  of  freedom  in  the  unobserved  re¬ 
gion.  In  a  gauge  theory,  performing  a  partial  trace  involves 
potential  subtleties  arising  from  the  Gauss  law  constraint  sat¬ 
isfied  by  physical  states.  For  conceptual  clarity,  we  will  side¬ 
step  these  difficulties  by  founding  our  discussion  on  the  con¬ 
cept  of  charge  super- selection  sectors  [12].  Strictly  speaking, 
our  analysis  applies  to  the  “free-charge”  phase  of  a  weakly 
coupled  gauge  theory  with  non- Abelian  gauge  group  G;  the 
local  symmetry  is  unbroken  and  G  charges  are  unconfined. 
The  same  argument,  though,  shows  that  Wilson  loop  mea¬ 
surement  would  allow  superluminal  signaling  in  a  confining 
gauge  theory,  where  the  separation  between  the  communicat¬ 
ing  parties  is  small  compared  to  the  confinement  distance 
scale. 

The  protocol  by  which  Alice  can  exploit  measurement  of 
the  spacelike  Wilson  loop  operator  W(C)  to  send  a  signal  to 
Bob  is  illustrated  in  Fig.  2.  First  Alice  and  Bob,  acting  on  the 
weakly-coupled  ground  state  with  gauge-invariant  local 
probes,  both  create  magnetic  flux  tubes.  Bob’s  flux  tube  links 
with  the  loop  C;  Alice  encodes  one  bit  of  classical  informa¬ 
tion  by  placing  her  tube  in  one  of  two  possible  positions, 
either  linking  with  C  or  not.  In  the  framework  of  lattice 
gauge  theory,  we  may  imagine  that  Bob  has  control  of  a 
single  lattice  link  lB  contained  in  the  loop  C,  and  he  creates 
his  “flux  tube”  by  manipulating  his  link — exciting  the  lattice 
plaquettes  that  contain  lB  to  a  particular  nontrivial  conjugacy 
class  of  G,  as  illustrated  in  Fig.  3.  Similarly,  Alice  controls  a 
single  link  lA  and  she  encodes  a  bit  by  either  exciting  her 
link  or  not.  Of  course,  since  Alice  and  Bob  act  locally  and 
the  theory  respects  a  charge  superselection  rule,  the  flux 


a) 


OCT  JD 

Alice  q  Bob 


FIG.  2.  Nondemolition  measurement  of  the  Wilson  loop  opera¬ 
tor  W(C)  [more  precisely  decoherence  in  the  basis  of  eigenstates  of 
W(C)]  allows  Alice  to  signal  Bob.  Alice  and  Bob,  with  gauge- 
invariant  probes,  can  prepare  magnetic  flux  tubes  that  carry  trivial 
electric  charge.  In  (a),  Bob’s  tube  links  with  the  loop  C  but  Alice’s 
does  not;  when  W(C)  is  measured,  neither  tube  is  affected.  In  (b), 
Alice  moves  her  tube  into  position  so  that  it  too  links  with  C;  when 
W(C)  is  measured,  then  (with  nonvanishing  probability),  Bob’s 
tube  and  Alice’s  acquire  nontrivial  and  opposite  electric  charges.  By 
measuring  the  charge  of  his  tube,  Bob  can  tell  how  Alice  positioned 
her  loop  and  so  receive  a  message  from  Alice. 


tubes  created  by  Alice  and  Bob  have  trivial  electric  charge. 

If  either  Bob’s  tube  or  Alice’s  tube  links  with  the  loop  C, 
but  not  both,  then  the  configuration  is  an  eigenstate  of  W(C) 
(or  close  to  an  eigenstate  in  the  weakly-coupled  case)  and 
will  be  unaffected  (or  little  affected)  by  the  measurement  of 
W(C).  But  if  both  tubes  link  with  C,  the  configuration  is  not 
an  eigenstate,  and  will  be  altered  by  the  measurement.  We 
will  see  that,  with  nonvanishing  probability,  the  measure¬ 
ment  will  generate  equal  and  opposite  nonzero  electric 
charges  on  Alice’s  tube  and  Bob’s.  Then,  by  measuring  the 
charge  on  his  tube,  Bob  can  infer  (with  a  success  probability 
better  than  a  random  guess)  whether  Alice’s  tube  linked  with 
C  or  not,  and  so  receive  a  superluminal  signal. 

In  a  non- Abelian  gauge  theory,  a  magnetic  flux  tube  can 
carry  a  peculiar  kind  of  electric  charge  that  has  no  localized 
source,  which  has  been  called  Cheshire  charge  [13,14].  (The 
property  that  the  charge  of  an  excitation  in  a  non-Abelian 
gauge  theory  need  not  be  the  integral  of  a  local  density  is 
analogous  to  the  property  that  the  energy  of  an  object  in 
general  relativity  need  not  be  the  integral  of  a  local  density.) 


FIG.  3.  A  small  “magnetic  flux  tube”  in  lattice  gauge  theory.  By 
manipulating  her  link,  Alice  excites  the  plaquettes  that  contain  the 
link,  creating  a  magnetic  flux  tube.  The  links  dual  to  these 
plaquettes  form  a  closed  loop  on  the  dual  lattice. 
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Our  protocol  for  superluminal  signaling  is  based  on  the  ob¬ 
servation  that  Wilson  loop  measurement  causes  Cheshire 
charge  to  be  transferred  from  Alice’s  flux  tube  to  Bob’s. 
Cheshire  charge,  while  conceptually  elusive,  is  physically 
genuine  and  readily  detected  in  principle. 

Our  conclusion  that  Wilson  loop  measurement  is  an 
acausal  operation  does  not  hold  in  the  case  of  an  Abelian 
gauge  theory.  Indeed,  we  will  show  that  a  nondemolition 
measurement  of  W(C)  is  possible  in  an  Abelian  gauge 
theory  that  includes  charged  matter.  In  our  analysis  of  this 
case,  we  adopt  the  convenient  idealization  that  the  parties 
who  perform  the  measurement  are  equipped  with  gauge- 
invariant  ancilla  variables  that  are  not  themselves  described 
by  the  gauge  theory.  We  will  also  see  that  a  destructive  mea¬ 
surement  of  a  non- Abelian  Wilson  loop  (a  measurement  that 
determines  the  value  of  the  Wilson  loop  but  in  doing  so 
damages  Wilson  loop  eigenstates)  is  possible  in  a  gauge 
theory  that  includes  suitable  charged  matter.  In  particular,  if 
the  matter  transforms  faithfully  under  the  gauge  group,  then 
the  Wilson  loop  can  be  measured  destructively  in  any  repre¬ 
sentation  of  the  gauge  group. 

The  conclusion  that  nondemolition  measurement  of 
spacelike  Wilson  loops  is  impossible  in  a  non- Abelian  gauge 
theory  seems  surprising  and  somewhat  troubling,  as  it  leaves 
us  without  a  fully  satisfactory  way  to  characterize  the  con¬ 
figurations  of  a  quantized  relativistic  gauge  theory  in  terms 
of  measurable  quantities.  Related  difficulties  arise  in  quan¬ 
tum  theories  of  gravity.  That  nondemolition  measurement  of 
a  Wilson  loop  operator  would  allow  superluminal  signaling 
has  been  anticipated  by  Sorkin  [8]. 

We  formulate  the  properties  of  magnetic  flux  tubes  in  Sec. 
II,  and  analyze  a  protocol  for  superluminal  signaling  enabled 
by  Wilson  loop  measurement  in  Sec.  III.  In  Sec.  IV,  we  de¬ 
fend  the  legitimacy  of  the  magnetic  flux  tubes  that  are  used 
in  our  signaling  protocol,  through  explicit  constructions 
within  the  formalism  of  lattice  gauge  theory.  In  Sec.  V,  we 
explore  the  consequences  of  including  charged  matter  fields 
in  the  gauge  theory,  and  show  that  destructive  measurement 
of  a  spacelike  Wilson  loop  is  possible.  The  Abelian  case  is 
discussed  in  Sec.  VI,  and  we  show  that  in  an  Abelian  gauge 
theory  with  charged  matter,  nondemolition  measurement  of  a 
Wilson  loop  is  possible.  The  pure  Abelian  gauge  theory 
(without  matter)  is  considered  in  Sec.  VII;  in  that  case  non¬ 
demolition  measurement  of  homologically  trivial  Wilson 
loops  is  possible,  but  homologically  nontrivial  Wilson  loops 
are  unmeasurable  and  there  is  an  associated  superselection 
rule.  We  take  up  the  related  question  of  whether  electric 
charges  can  travel  faster  than  light  in  Sec.  VIII,  concluding 
that  superluminal  transport  of  Abelian  charge,  but  not  non- 
Abelian  charge,  is  possible.  Section  IX  contains  some  con¬ 
cluding  comments. 

II.  WILSON  LOOPS,  MAGNETIC  FLUX,  AND  ELECTRIC 
CHARGE 

In  a  theory  with  gauge  group  G,  the  effect  of  parallel 
transport  of  a  charged  object  around  a  closed  path  C  that 
begins  and  ends  at  the  point  x0  can  be  encoded  in  a  group 
element  a(C,x0)eG  given  by 


a(C,x0)  =  P  exp  /  I  A].  (2) 

\  JC,x  o  / 

Here  A  is  the  gauge  potential  and  P  denotes  path  ordering; 
the  state  \q)  of  a  charged  object  carried  along  C  is  modified 
according  to 

\q)^D(R\a(C,x0))\q),  (3) 

if  | q)  transforms  as  the  unitary  irreducible  representation 
D{R)  of  G.  The  element  a(C,x0)  e  G  depends  on  a  “gauge 
choice”  at  the  point  v0;  that  is,  on  how  a  basis  is  chosen  in 
the  representation  D^R\  A  basis-independent  characterization 
of  the  gauge  holonomy  is  obtained  if  we  evaluate  the  trace  in 
the  representation  D{R\  obtaining  the  Wilson  loop  operator 
associated  with  C  given  by 

W(R\C)  =  X(R\a(C,x0)),  (4) 

where  x<Rl  denotes  the  character  of  the  representation  DlR> . 
The  Wilson  loop  operator  does  not  depend  on  how  the  point 
x0  on  the  loop  C  is  chosen.  In  much  of  what  follows,  we  will 
assume  for  notational  simplicity  that  the  unbroken  gauge 
group  G  is  finite;  however,  our  arguments  can  be  easily  ex¬ 
tended  to  the  case  of  compact  Lie  groups. 

By  acting  with  a  gauge-invariant  source  on  the  weakly- 
coupled  ground  state  of  the  gauge  theory,  Alice  (or  Bob)  can 
create  a  “color  magnetic  flux  tube”  or  “cosmic  string”  that 
carries  trivial  “color  electric”  charge.  This  tube  is  an  eigen¬ 
state  of  the  Wilson  loop  operator  W^R  \CA),  where  CA  is  a 
loop  that  links  once  with  the  tube,  for  any  irreducible  repre¬ 
sentation  (Rf)  of  the  gauge  group  G;  hence  the  tube  can  be 
labeled  by  a  conjugacy  class  a  of  G. 

When  we  say  that  the  tube  has  trivial  gauge  charge,  we 
mean  that  it  transforms  as  the  trivial  representation  of  G 
under  global  gauge  transformations.  To  understand  this  prop¬ 
erty  it  is  helpful  to  specify  a  basepoint  v0?A  on  the  loop  CA 
and  to  fix  the  gauge  at  this  point.  Then  the  effect  of  parallel 
transport  around  the  loop  CA  ,  beginning  and  ending  at  x0^  , 
can  be  encoded  (in  this  particular  gauge)  in  a  group  element 
a(CA  ,x0iA)  =  a.  If  the  tube  is  associated  with  a  particular 
group  element  a ,  we  call  its  quantum  state  a  “flux  eigen¬ 
state,”  denoted  | a).  But  under  a  gauge  transformation  g 
gG  at  v0,a  ,  this  flux  eigenstate  is  transformed  as 

a^gag~l.  (5) 

Thus  a  flux  eigenstate  is  not  a  gauge  singlet  in  general,  if  G 
is  non- Abelian.  A  gauge-singlet  quantum  state  of  the  flux 
tube  is  a  coherent  superposition  of  the  flux  eigenstates  be¬ 
longing  to  conjugacy  class  a , 


where  |  a\  denotes  the  number  of  members  of  the  class. 

Other  possible  states  of  the  flux  tube  can  carry  nontrivial 
electric  charge.  For  example,  the  state 
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a) 


b) 


FIG.  4.  Loops  linked  with  flux  tubes.  The  flux  tube  configura¬ 
tion  created  locally  by  Alice  and  by  Bob  is  an  eigenstate  of  the 
Wilson  loop  operators  associated  with  the  paths  CA  and  C B ,  shown 
in  (a),  but  not  an  eigenstate  of  the  Wilson  loop  operator  C  that  links 
with  both  tubes,  shown  in  (b). 


(2  I  hah  1)a)®(2  I  gbg  Ob],  (13) 

\/,eG  /  (jeG  / 

where  a  e  a  and  b  e  (3.  This  configuration  is  a  direct  product 
of  the  state  of  Alice’s  tube  with  the  state  of  Bob’s  tube,  a 
simultaneous  eigenstate  of  the  commuting  Wilson  loop  op¬ 
erators  W(CA)  and  W(CB). 

But  because  multiplication  of  conjugacy  classes  is  ill- 
defined,  this  state  is  not  an  eigenstate  of  W(C),  where  C  is 
the  loop  shown  in  Fig.  4(b)  that  links  with  both  tubes. 
Rather,  if  Alice’s  tube  is  an  eigenstate  of  W(CA)  and  Bob’s 
tube  is  an  eigenstate  of  W(CB ),  then  an  eigenstate  of  W(C) 
is  not  a  product  state  but  an  entangled  state  of  the  form  (up  to 
normalization) 


a,R)  =  NaR 


xw(g)*|g«s  ‘) 


(7) 


2  I  gag  l)A®\gbg  ‘>b  *  (14) 

g&G 


(where  N a  R  is  a  normalization  factor  and  a  e  a)  transforms 
as  the  nontrivial  irreducible  representation  (R)  under  global 
gauge  transformations.  To  verify  this,  first  construct  the  op¬ 
erator 

e{R)=y^\  2  xw(g)u(g),  (8) 

where  \G\  denotes  the  order  of  the  group,  nR  is  the  dimen¬ 

sion  of  the  irreducible  representation  ( R ),  and  U(g )  is  the 
global  gauge  transformation  that  conjugates  the  flux  by  g , 

U{g):\a)^\gag~l).  (9) 

Using  the  group  orthogonality  relations 

Sr  2  Xm(g)*X(R'\gh)=S™'/«\h),  (10) 

geG 

we  find  that  E^R)  is  the  orthogonal  projection  onto  the  space 
transforming  as  (R),  which  satisfies 

E(R)E(R')=sPR'E(R\  (11) 

Applying  the  orthogonality  relations  once  more,  we  see  that 

E(R)\a,R)=\a,R);  (12) 

thus  |  a,R)  transforms  as  ( R ).  The  “Cheshire  charge”  carried 
by  the  flux  tube  in  this  state  can  be  detected  through,  for 
example,  the  Aharonov-Bohm  interactions  of  the  tube  with 
other,  distant,  flux  tubes  [13,14]. 

Suppose  that  Alice  and  Bob,  acting  locally,  each  create 
flux  tubes  with  zero  electric  charge,  where  the  flux  of  Alice’s 
tube  belongs  to  conjugacy  class  a  and  the  flux  of  Bob’s 
belongs  to  conjugacy  class  (3.  (The  process  of  preparing  the 
flux  tubes  will  be  discussed  in  more  detail  in  Sec.  IV.)  To 
describe  the  quantum  state  of  this  configuration,  we  may 
choose  loops  CA  and  C B  that  link  with  the  tubes,  and  fix  the 
gauge  at  basepoints  x0A  and  x0  B  as  illustrated  in  Fig.  4(a). 
Up  to  a  normalization  factor,  the  quantum  state  of  the  two 
tubes  can  be  expressed  as 


This  state  has  zero  total  charge,  as  it  is  invariant  under  a 
global  gauge  transformation  applied  to  both  tubes.  But  it  is 
not  invariant  under  a  gauge  transformation  that  acts  on  just 
one  of  the  tubes;  it  can  be  expanded  in  a  basis  in  which 
Alice’s  tube  and  Bob’s  have  definite  and  opposite  charges. 
Using  the  group  orthogonality  relations  in  the  form 

2  i^j -X(R)(g)  =  8g,e  (15) 

(where  e  is  the  identity  element),  we  may  rewrite  the  state  as 
a  sum  over  irreducible  representations 

2  tA(  2  X(R\gh~l)*\hah~l)A®\gbg~l)B). 

(16) 

The  expression  in  parentheses  transforms  as  (R)  under  gauge 
transformations  acting  on  Bob’s  tube  and  as  the  conjugate 
representation  ( R *)  under  gauge  transformations  acting  on 
Alice’s  tube,  as  we  can  verify  by  applying  and 

Thus,  when  two  flux  tubes  are  prepared  in  a  quantum  state 
that  is  an  eigenstate  of  the  Wilson  loop  operator  W(C ), 
where  C  links  with  both  flux  tubes,  then  the  flux  tubes  carry 
correlated  nontrivial  electric  charges.  This  property  is  the 
basis  of  our  claim  that  Wilson  loop  measurement  is  acausal, 
as  we  elaborate  in  the  next  section. 

III.  NONDEMOLITION  MEASUREMENT 
OF  NON-ABELIAN  WILSON  LOOPS  IS  ACAUSAL 

For  a  static  gauge  field  configuration,  it  is  possible  in 
principle  to  measure  W(7?)(C)  [and  hence  the  conjugacy  class 
of  the  group  element  a(C,x0)]  by  performing  interference 
experiments  with  projectiles  that  transform  as  D^R)  [15].  But 
what  if  the  loop  C  lies  in  a  time  slice  and  the  gauge  field  is 
dynamical ?  In  a  relativistic  field  theory,  no  projectile  can 
follow  a  spacelike  world  line,  so  that  a  direct  measurement 
of  the  effect  of  parallel  transport  along  C  is  not  feasible. 

However,  it  seems  conceivable  that  a  less  direct  measure- 
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ment  strategy  might  succeed.  When  we  speak  of  a  “measure¬ 
ment”  of  an  operator  whose  support  is  on  a  spacelike  slice, 
we  need  not  require  that  the  result  be  instantaneously  known 
by  anyone.  We  might  imagine  instead  that,  in  order  to  mea¬ 
sure  W(/?)(C)  at  time  t  =  0,  many  parties  distributed  along 
the  loop  C  perform  local  operations  at  t  =  0 .  Later,  the  data 
collected  by  the  parties  can  be  assembled  and  processed  at  a 
central  laboratory,  where  the  outcome  of  the  measurement  of 
WW(C)  can  be  determined.  In  such  a  protocol,  we  should 
allow  the  parties  to  share  any  entangled  quantum  state  that 
they  might  have  prepared  prior  to  t  =  0,  and  we  should  allow 
them  to  ship  quantum  information  (rather  than  just  classical 
data)  to  the  central  laboratory  after  they  have  performed  their 
local  operations.  Of  course,  the  quantum  or  classical  vari¬ 
ables  that  are  sent  to  the  central  laboratory  for  analysis  are 
not  the  variables  of  the  underlying  field  theory;  they  are  an- 
cilla  variables  that  are  assumed  to  be  available  to  assist  with 
the  measurement. 

Just  prior  to  the  measurement  at  time  t  =  0,  the  quantum 
state  of  the  gauge  theory  is  p,  a  density  operator  that  acts  on 
the  physical  gauge-invariant  subspace.  Even  though  the  mea¬ 
surement  result  may  not  be  known  until  later,  the  operations 
performed  at  t  =  0  modify  the  state  p  immediately.  If  the 
local  operations  performed  at  t  =  0  are  to  achieve  a  measure¬ 
ment  of  Ww(C),  then  the  coherence  of  a  superposition  of 
eigenstates  of  p  with  different  eigenvalues  must  be  de¬ 
stroyed.  At  t  =  0 ,  then,  the  quantum  state  is  modified  accord¬ 
ing  to 

p— >fW(C)(p)  =  2  EwpEw  ,  (17) 


son  loop  is  measured.  But  if  Alice  moves  her  flux  tube  into 
position  to  link  with  C,  then  the  configuration  is  no  longer  an 
eigenstate  of  W(/?)(C),  and  it  is  affected  by  the  operation 
£w(C)  •  In  fact>  a^ter  ^e  operation,  though  the  total  charge  of 
the  system  remains  zero,  there  is  a  nonzero  probability  that 
Alice’s  tube  and  Bob’s  tube  carry  equal  and  opposite  nonzero 
charges.  This  charge  can  be  detected  by  Bob.  For  example, 
he  can  determine  whether  his  tube  has  vacuum  quantum 
numbers  by  allowing  it  to  shrink  and  observing  whether  it 
will  annihilate  and  disappear — if  the  tube  is  charged,  a  stable 
charged  particle  will  be  left  behind. 

Thus,  if  there  were  a  way  to  implement  the  operation 
£w(C)  at  t  =  0,  then  by  observing  whether  his  flux  tube  is 
charged  after  t  =  0,  Bob  would  be  able  to  infer  (with  a  prob¬ 
ability  of  success  better  than  a  random  guess)  whether  Alice 
moved  her  tube  into  position  or  not.  Therefore  Alice  can 
transmit  classical  information  to  Bob  over  a  noisy  channel 
with  nonzero  capacity;  she  is  able  to  send  a  superluminal 
signal  to  Bob.  By  the  same  method,  Bob  can  send  a  signal  to 
Alice. 

To  understand  this  charge  transfer  process  in  more  detail, 
let  us  consider  a  specific  example.  Suppose  that  G  is  the 
quaternionic  group  of  order  eight,  whose  two-dimensional 
faithful  unitary  irreducible  representation  is 

{±I,±ial,±ia2,±i(r3},  (19) 

where  ai,a2,(r 3  are  the  Pauli  matrices.  Suppose  that  Alice 
and  Bob  both  have  tubes  carrying  flux  in  the  class  a  =  (3 
=  {±/cr1}.  For  tubes  in  this  class,  the  quantum  state  with 
trivial  charge  is 


where  Ew  is  the  orthogonal  projector  onto  the  subspace  of 
states  with 


W(R\C)  =  w.  (18) 

This  operation  describes  a  projective  measurement  of 
W^R\C)  with  an  unknown  outcome. 

The  operation  £W(q  is  actually  weaker  than  a  measure¬ 
ment  of  W^R\C)’,  conceivably  decoherence  in  the  basis  of 
eigenstates  of  an  observable  can  be  accomplished  even  if  the 
measurement  outcome  is  never  recorded.  But  if  any  record  of 
the  value  of  W^R\C)  is  written  at  t  =  0  (even  one  that  cannot 
be  read  until  later),  then  decoherence  as  described  by  Eq. 
(17)  must  occur. 

We  will  show  that  £W(q  can  be  used  to  send  superluminal 
signals,  and  so  establish  that  £W(q  cannot  be  implemented  in 
a  gauge  theory  that  respects  relativistic  causality. 

To  devise  a  superluminal  signaling  protocol,  Alice  and 
Bob  use  local  gauge-invariant  probes  to  prepare  uncharged 
flux  tubes  belonging  to  classes  a  and  [3  respectively,  as  de¬ 
scribed  in  Sec.  II.  Bob  moves  his  flux  tube,  which  will  re¬ 
ceive  the  message,  into  position  so  that  it  links  once  with  the 
loop  C;  Alice  encodes  one  bit  of  information  by  choosing  to 
place  her  flux  tube  in  one  of  two  possible  positions,  either 
linking  with  C  or  not.  If  Alice  chooses  to  place  her  tube 
where  it  does  not  link  with  C,  then  the  configuration  is  an 
eigenstate  of  W^R\C)  and  will  be  unaffected  when  the  Wil¬ 


l+>=  ^(l!cU+l-/cri»’ 

and  there  is  also  a  state  of  nontrivial  charge 


(20) 


(21) 


The  state  |  —  )  transforms  as  the  nontrivial  one-dimensional 
representation  of  G  in  which  ±7,  ±  f  crx  are  represented  by  1 
and  ±icr2 ,  ±  i  cr3  are  represented  by  —  1 . 

If  Alice  and  Bob  each  have  a  charge-zero  flux  tube,  the 
quantum  state  of  their  two  tubes  is  a  product  state 

I  *A)init=  I  +  )a  ®  I  +  )s  -  (22) 


But  if  the  loop  C  links  once  with  each  tube,  then  the  value  of 
Ww(C)  in  the  two-dimensional  irreducible  representation 
(R)  can  be  either  2  or  —2.  If  the  initial  state  |^)init  is  pro¬ 
jected  onto  the  state  with  W^R\C)  =  2,  Alice’s  tube  becomes 
entangled  with  Bob’s;  the  resulting  state  is 


I  *A)  fin, 2 


1 

vf 


(I'O’i)a® 


*0'i)b+|-*0'i)a®I«0'i)b)- 


(23) 
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Bob’s  final  density  operator,  obtained  by  tracing  over  the 
state  of  Alice’s  tube,  is 

PB,fin,2  =  2(li'°'l)(/0'll  +  l_/crl)(_/0'll)’  (24) 

an  incoherent  mixture  of  the  two  flux  eigenstates.  Similarly, 
if  |  <A)init  is  projected  onto  the  state  with  W^R\C)  =  -2,  the 
resulting  state  is 


l^)fin,_2=  jf  (l/°’i>A®l/cri>B+l-'0’i)A®l-/cri)B)’ 

(25) 

and  again  Bob’s  final  density  operator  is 

—  2  =  Pfi,fin,2  •  (26) 


the  measurement  of  the  Wilson  loop,  and  Bob  subsequently 
measures  the  charge  of  his  tube,  he  will  find  the  charge  to  be 
trivial  with  probability 


_  h-Z^IUa®  1/8,0X0,01)1^ 

=  ni/\/3\  =  Prob(w,-).  (32) 

(Here  |  /3,0)  denotes  the  charge-0  state  of  a  string  whose  flux 
is  in  conjugacy  class  f3.)  Therefore,  if  Alice’s  and  Bob’s 
tubes  both  link  once  with  the  loop  C  when  the  operation 
£w(C)  is  applied,  then  afterwards  Bob  will  find  his  tube  car¬ 
ries  trivial  charge  with  probability 


Prob(0)  =  2  Prob(0|wj)-Prob(wj) 


Each  of  the  two  flux  eigenstates  is  an  equally  weighted  co¬ 
herent  superposition  of  the  charge  eigenstates  |  +  )B  and 
|  —  )B .  Thus  if  Bob  were  to  measure  the  charge  of  his  tube 
after  the  operation  £V(C)  acts  at  ^  =  he  would  find  the 
charge  to  be  ( — )  with  probability  1/2  if  Alice’s  tube  linked 
with  the  loop  C,  while  he  would  never  find  the  charge  to  be 
(  — )  if  Alice’s  tube  did  not  link  with  C.  Alice  has  sent  a 
superluminal  signal  to  Bob. 

We  can  easily  generalize  this  construction  to  an  arbitrary 
finite  gauge  group  G.  If  Alice’s  tube  initially  carries  flux  in 
the  conjugacy  class  a  and  has  trivial  charge,  while  Bob’s 
carries  flux  in  the  class  f3  and  has  trivial  charge,  then  the 
initial  state  of  their  tubes  is  a  product  state 

l^)init=nf=j=f=f  2  2  I«}a®|£)b-  (27) 

VI  OL\  •  \(3\  a&a  b&p 


=  2  (Prob(w,))2=2  («;/ 1/?|)2.  (33) 

i  i 

We  see  that,  unless  the  initial  configuration  is  an  eigenstate 
of  W(R)(C),  we  have  Prob(0)<l.  We  conclude  that  Bob’s 
tube  is  charged  with  nonzero  probability  if  Alice’s  tube 
linked  with  C,  and  it  is  guaranteed  to  be  uncharged  if  Alice’s 
tube  did  not  link  with  C.  Alice  can  send  a  superluminal  sig¬ 
nal  to  Bob.  (Of  course,  since  Alice’s  tube  has  an  electric 
charge  equal  and  opposite  to  that  of  Bob’s  tube,  Bob  can  also 
send  a  superluminal  signal  to  Alice,  with  the  same  probabil¬ 
ity  of  success.) 

The  argument  also  applies  to  compact  Lie  groups.  For 
example  if  the  gauge  group  is 

G  =  SU(2)  =  {g(n,0)  =  Qxp[  —  i(0/2)n-  a], 


This  state  can  be  expanded  in  terms  of  eigenstates  of 

W(R\C)  =  x(R\ab).  (28) 

Suppose  that  for  fixed  a  e  a  there  are  nt  distinct  elements 
e  //  fi=  1,2, .  .  .  ,nt ,  such  that  x^R\ab)  =  wi-  (This 
number  n{  is  independent  of  how  the  class  representative  a  is 
chosen.)  Then  the  component  of  |  0-) init  with  W^R\C)  =  Wi  is 
the  (unnormalized)  entangled  state 

i^=t i«>a®2 j  io«-  (29) 

*  VM-|/3|  flea  fJL=  1 

This  state  is  invariant  under  a  global  gauge  transformation 
acting  as 

a^gag-\  b^gbg~\  (30) 

so  that  its  total  charge  is  trivial.  We  see  that  if  WlR,( C)  is 
measured,  the  outcome  wt  occurs  with  probability 

Prob(wi-)  =  w.(iff\  if/)w=ni/\/3\ ;  (31) 

it  is  obvious  from  the  definition  of  n{  that  these  probabilities 
sum  to  unity.  Furthermore,  if  the  state  is  prepared  by 


n<=S 2,  #e[0,27r]},  (34) 

then  conjugacy  classes  are  labeled  by  6.  If  a  flux  tube  has 
trivial  charge,  its  quantum  state  can  be  expressed  as 

|0/>=  f  d0f{0)  f  dn\g(n,0)),  (35) 

where /is  any  square  integrable  class  function.  If  the  loop  C 
links  with  Alice’s  tube  and  Bob’s,  then  the  product  state 

lk)init=|0,/l)A^|0/2)5  (36) 

is  in  general  not  an  eigenstate  of  the  operator  W^R\C); 
hence  measurement  of  W^R\C)  would  induce  a  detectable 
transfer  of  charge  from  Alice’s  tube  to  Bob’s. 

The  argument  also  applies  in  any  spatial  dimension  d 
^2.  In  d  =  2  dimensions,  the  flux  tubes  may  be  replaced  by 
pairs  of  pointlike  vortices;  in  d>3  dimensions,  the  tubes 
become  membranes  of  codimension  2. 

In  our  discussion,  we  have  ignored  the  effects  of  magnetic 
and  electric  quantum  fluctuations — in  particular  we  have  not 
considered  whether  gauge  charges  might  be  confined  or 
screened  by  the  Higgs  mechanism.  We  have  implicitly  as¬ 
sumed  that  the  G  gauge  symmetry  is  unbroken,  and  (if  G 
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electric  charges  are  confined)  that  the  separation  between 
Alice  and  Bob  is  small  compared  to  the  characteristic  dis¬ 
tance  scale  of  electric  confinement. 

We  should  note  that  in  the  case  of  a  continuous  gauge 
group,  an  ultraviolet  regulator  is  implicitly  invoked  to  define 
the  Wilson  loop.  The  Wilson  loop  detects  the  magnetic  flux 
that  links  with  C.  If  we  think  of  C  as  a  wire  of  infinitesimal 
thickness,  then  W(C )  will  be  dominated  by  very-short- 
wavelength  fluctuations  of  the  gauge  field  near  the  wire.  To 
suppress  these  fluctuations,  we  allow  the  wire  to  have  a  non¬ 
zero  thickness  a ,  removing  the  contributions  of  fluctuations 
with  wavelength  below  a.  In  3  +  1  spacetime  dimensions,  the 
fluctuations  near  the  wire  are  unimportant  provided  that 

e2  log(L/a)<8 1,  (37) 

where  e2  is  the  gauge  coupling  constant  (renormalized  at 
distance  scale  a )  and  L  is  the  characteristic  size  of  the  loop 
C. 

IV.  FLUX  TUBES  ON  THE  LATTICE 

Our  argument  that  Wilson  loop  measurement  would  allow 
Alice  to  send  a  superluminal  signal  to  Bob  had  two  crucial 
elements:  that  Alice  and  Bob  are  capable  of  creating  un¬ 
charged  magnetic  flux  tubes,  and  that  Bob  can  detect  the 
charge  on  his  tube.  Let  us  examine  more  deeply  whether  the 
preparation  of  the  flux  tube  is  really  possible  in  principle. 

In  considering  whether  flux  tubes  are  legitimate  objects,  it 
is  helpful  to  think  about  a  scenario  in  which  an  underlying 
continuous  gauge  symmetry  is  spontaneously  broken  to  a 
finite  non- Abelian  subgroup.  To  be  specific,  a  generic 
vacuum  expectation  value  of  a  Higgs  field  in  the  five¬ 
dimensional  irreducible  representation  of  SU( 2)  breaks  the 
gauge  symmetry  to  the  quaternionic  group  considered  in  Sec. 
III.  In  this  Higgs  phase,  there  are  locally  stable  cosmic 
strings  that  carry  nontrivial  magnetic  flux;  these  serve  as  the 
flux  tubes  needed  for  the  signaling  protocol.  Alice  and  Bob 
both  require  closed  loops  of  string  that  have  vacuum  quan¬ 
tum  numbers;  in  principle,  these  could  be  created  in,  for 
example,  a  hard  collision  between  particles. 

Wilson  loop  measurement  can  change  the  transformation 
properties  of  a  string  loop  under  global  gauge 
transformations — it  transfers  charge  to  the  loop.  This  charge, 
like  any  charge  in  a  discrete  gauge  theory,  can  be  detected 
through  the  Aharonov-Bohm  interactions  of  the  string  loop 
with  other  string  loops  [13,14]. 

In  a  confining  gauge  theory  like  quantum  chromodynam¬ 
ics,  a  flux  tube  is  not  locally  stable,  but  it  is  still  possible  to 
engineer  one,  at  least  if  it  is  small  compared  to  the  confine¬ 
ment  distance  scale.  To  be  as  concrete  as  possible,  we  will 
describe  how  a  flux  tube  can  be  created  in  a  gauge  theory 
defined  on  a  spatial  lattice  (but  with  continuous  time).  In  this 
framework,  Bob  (or  Alice)  can  prepare  a  flux  tube  with  zero 
charge  by  acting  on  a  single  link  variable  with  a  gauge- 
invariant  local  operator,  as  indicated  in  Fig.  3. 

In  our  description  of  the  construction  of  this  operator,  we 
will  again  find  it  convenient  to  suppose  that  the  gauge  group 
G  is  a  finite  group  of  order  |  G  | ,  though  there  are  no  serious 


obstacles  to  generalizing  the  discussion  to  the  case  of  Lie 
groups.  Residing  on  the  lattice  links  are  variables  that  take 
values  in  the  G  group  algebra,  a  Hilbert  space  of  dimension 
\G\  for  which  an  orthonormal  basis  can  be  chosen  as 
{|g),g  e  G}.  A  local  gauge  transformation  associates  a  group 
element  with  each  lattice  site.  Each  link  has  an  orientation, 
and  if  a  link  connecting  sites  v  and  y  is  oriented  so  that  it 
points  from  y  to  x,  then  gauge  transformations  Ux(h)  and 
Uy(k)  at  site  v  and  y  act  on  the  link  variable  according  to 

^jc(^)  •  I  § }xy  *  I  > 

Uy(k):\g)xy^\gk-l)xy.  (38) 

Physical  states  are  invariant  under  all  local  gauge  transfor¬ 
mations.  Physical  observables  preserve  the  space  of  physical 
states,  and  hence  must  commute  with  the  local  gauge  trans¬ 
formations. 

Now  consider  an  operator  Ht(a)  that  acts  on  a  particular 
link  /  as 

Hi(a):\g)i^\ag),.  (39) 

Note  that  Ht(a)  is  not  a  gauge  transformation,  since  it  acts 
only  on  a  single  link,  rather  than  all  of  the  links  that  meet  at 
a  site.  This  operator  does  not  commute  with  local  gauge 
transformations;  rather  if  /  is  oriented  so  that  it  points  toward 
the  site  x,  we  have 

Ux(h)Hi(a)Ux(h)~l:\g)i^\hah~l  ■  g)[,  (40) 

or 

Ux(h)H,(a)Ux(h)~l  =  Hi(hah~1).  (41) 

But  if  we  define  an  operator  by  summing  Ht(a)  over  a  con- 

jugacy  class  of  G, 

H,(a)=X  2  Htia)},  (42) 

|  \  a  e  a  I 

then  Hi(a)  does  commute  with  Ux(h)  and  is  therefore  a 
gauge-invariant  operator.  This  is  the  operator  that  Alice  ap¬ 
plies  to  her  link  to  create  a  local  flux  tube  excitation  [16,17]. 
Of  course,  by  acting  on  several  adjacent  links,  Alice  can 
create  a  larger  flux  tube  if  she  wishes. 

If  Alice  applies  this  operator  to  her  link  and  Bob  applies  it 
to  his  link,  then  the  state  they  prepare  (acting  on  the  weak- 
coupling  vacuum)  is  not  an  eigenstate  of  the  Wilson  loop 
operator  W(C),  where  C  contains  both  links.  There  is  an 
eigenstate  of  W(C )  in  which  Alice’s  link  is  excited  to  con- 
jugacy  class  a  and  Bob’s  to  class  /3,  and  of  course  this  state 
can  be  created  by  a  gauge-invariant  operator  acting  on  the 
perturbative  vacuum.  But  the  operator  cannot  be  local,  since 
it  creates  charges  on  Alice’s  link  and  Bob’s.  It  is  instructive 
to  construct  the  nonlocal  gauge-invariant  operator  that  cre¬ 
ates  this  state. 

For  this  purpose,  it  is  convenient  to  choose  a  basepoint 
lattice  site  x0,  and  to  choose  oriented  lattice  paths  PA  and  PB 
that  connect  Alice’s  link  A  and  Bob’s  link  B  to  the  basepoint, 
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FIG.  5.  A  nonlocal  operator  that  creates  correlated  excitations  at 
distantly  separated  links  of  a  lattice  gauge  theory.  An  arbitrary  base- 
point  x0  is  chosen,  together  with  arbitrary  paths  PA  and  PB  that 
connect  Alice’s  link  and  Bob’s  link  to  the  basepoint.  By  acting  on 
the  links,  the  operator  excites  the  lattice  plaquettes  (shaded)  that 
contain  the  links.  The  nonlocality  is  necessary  because  Alice’s  ex¬ 
citation  and  Bob’s  excitation  carry  nontrivial  and  opposite  electric 
charges. 

as  shown  in  Fig.  5.  Let  g P  denote  the  path-ordered  product  of 
link  variables  associated  with  the  path  P, 

8p=U  8i,  (43) 

UP 

with  later  links  along  the  path  appearing  further  to  the  left. 
Then  we  may  define  a  generalization  of  the  operator  Ht  that 
depends  on  the  path  and  the  basepoint.  Acting  on  Alice’s  link 
lA  we  have 

HiA(a,PA,x0):\g)iA-^\g-gpAagp^)lA,  (44) 

and  acting  on  Bob’s  link  lB  we  have 

H,B(b,PB,x0):\g)lB^\gplBbgP[j-g)lB.  (45) 

Hence  HiB(b,PB  ,x0),  like  HlB(b),  excites  the  plaquettes  that 
contain  Bob’s  link.  But  while  HiB(b)  left-multiplies  the  link 
variable  by  b ,  HiB(b,PB  ,x0)  left-multiplies  by  the  conjugate 
group  element  gpBbgpB.  In  a  fixed  gauge,  the  operator 
HiB(b,PB  ,xQ)  creates  an  excitation  such  that  the  effect  of 
gauge  parallel  transport  about  a  closed  path  that  begins  and 
ends  at  x0  and  passes  through  link  l B  is  encoded  in  the  group 
element  b.  HiA(a,PA  ,x0)  is  defined  similarly,  but  acts  by 
right  multiplication  because  of  the  way  we  have  chosen  the 
orientation  of  the  link  lA  . 

The  operator  HiA(a,PA  ,x0)  commutes  with  local  gauge 
transformations  acting  in  the  vicinity  of  Alice’s  link  lA  ,  and 
HjB(b,PB  ,x0)  commutes  with  gauge  transformations  acting 
in  the  vicinity  of  Bob’s  link  lB .  But  they  do  not  commute 
with  gauge  transformations  acting  at  the  basepoint  x();  rather 
we  have 

UXo(g)H,A(a,PA,x0)UXQ(g)~1  =  H,A(gag~1,PA  ,x0), 

UXo(g)HlB(b,PB,x0)UXo(g)-l  =  HlB(gbg-\PB,x0). 

(46) 

Again,  we  can  obtain  a  gauge-invariant  operator  by  summing 
a  or  b  over  a  conjugacy  class,  e.g., 


Hib(P,Pb  .*o)  =  j^y  ( HiB(b,PB  ,*o) )  •  (47) 

In  fact,  it  is  clear  from  the  definitions  that  Hi  (0,PB  ,x0) 
=  Hib(/3);  it  is  really  a  local  operator  in  disguise. 

But  we  can  also  construct  a  gauge-invariant  operator  that 
acts  simultaneously  on  Alice’s  link  and  Bob’s,  and  that  really 
is  nonlocal  [16,17]: 

|^|  |  2,  HlA(gag~\PA  ,x0)-H,B(gbg-\PB  ,x0)|. 

(48) 

This  operator,  acting  on  the  weak-coupling  vacuum,  creates  a 
state  in  which  Alice’s  link  and  Bob’s  are  correlated,  as  in  Eq. 
(14).  This  gauge-invariant  operator  does  not  depend  on  how 
the  basepoint  x0  is  chosen;  we  are  free  to  slide  the  basepoint 
along  the  path  connecting  Alice’s  link  and  Bob’s  however  we 
please. 

The  communication  protocol  explained  in  Sec.  Ill  can  be 
described  this  way:  Alice  and  Bob  apply  the  local  operators 
HiA(a)  and  HiB(fi )  to  create  link  excitations  that  are  uncor¬ 
related  with  one  another.  Then  the  Wilson  loop  measurement 
operation  £W(q  is  applied,  where  the  loop  C  contains  the 
links  lA  and  l B .  This  operation  establishes  a  correlation  be¬ 
tween  the  links.  It  transforms  a  state  that  can  be  created  by 
local  operators  to  a  state  that  can  be  created  only  by  a  non¬ 
local  operator  like  that  in  Eq.  (48).  Such  a  transformation 
cannot  occur  on  a  time  slice  in  a  theory  that  respects  relativ¬ 
istic  causality.  We  conclude  that  the  nondemolition  measure¬ 
ment  of  the  non- Abelian  Wilson  loop  operator  is  not  physi¬ 
cally  realizable. 

Now,  the  operator  ///(cr)  is  a  gauge-invariant  local  opera¬ 
tor,  but  it  is  not  unitary,  so  we  should  clarify  what  it  means  to 
say  that  Alice  or  Bob  applies  this  operator  to  a  state.  In  fact, 
if  A  is  any  bounded  operator  that  does  not  annihilate  the  state 
|  if/),  we  can  apply  the  operation 

,  A 1 0-) 

(49) 

with  a  nonzero  probability  of  success  by  making  a  suitable 
measurement.  First  note  that  we  may  assume  without  loss  of 
generality  that  the  eigenvalues  of  A^A  are  no  larger  than 
one — if  not,  we  merely  rescale  A  without  modifying  the  op¬ 
eration  equation  (49).  Then  let  { 1 0) ,  1 1 )}  be  an  orthonormal 
basis  for  a  two-dimensional  “ancilla”  space,  and  consider 
the  transformation 

(50) 

where 

A*A  +  B^B  =  I.  (51) 

This  transformation  is  norm-preserving  and  so  has  a  unitary 
extension.  Hence  we  apply  the  unitary  U  to  1 0 )  <8>  |  0”)  and 
then  measure  the  ancilla  by  projecting  onto  the  basis 
{| 0), 1 1 )}.  The  outcome  |0)  is  obtained  with  probability 
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in  which  case  Eq.  (49)  is  applied.  If  the  outcome 
1 1 )  is  found,  then  Alice  may  discard  the  state  and  make 
another  attempt.  As  long  as  Alice  can  repeat  the 

procedure  until  she  gets  the  desired  outcome. 

Clearly,  the  gauge-invariant  bounded  operator  analogous 
to  Ht(a)  can  also  be  constructed  in  the  case  where  G  is  a  Lie 
group.  For  example  if  the  gauge  group  is  SU(2),  then  asso¬ 
ciated  with  the  group  element  g (n ,  0)  =  —  i(0/2)ri‘  cr)  is 

the  transformation 

Hl(g(n,0)):lh)l^ig(n,0)h)l,  (52) 

where  h  e  SU(2).  This  transformation  can  be  expressed  as 

H,(g(n,6))=e-ie!,i>,  (53) 

where  the  electric  field  is  the  angular  momentum  conju¬ 
gate  to  the  SU(2)  rotor  h  at  link  /.  The  bounded  operator 


In  addition  to  the  gauge  symmetry,  the  matter  field  at  site  x 
also  transforms  under  a  global  symmetry  transformation 
Vx(h ),  acting  on  </>  from  the  right,  that  commutes  with  gauge 
transformations: 

Vx(h):\  <t>)x^\<j>h-l)x.  (56) 

The  interpretation  of  this  global  symmetry  is  that  our  matter 
fields  have  both  “color”  and  “flavor”  degrees  of  freedom. 
The  regular  representation  of  G  decomposes  into  irreducible 
representations,  with  the  dimension-^ R  representation  ( R )  oc¬ 
curring  nR  times.  Gauge  transformations  mix  the  nR  states 
that  span  (. R )  (the  colors),  while  global  transformation  mix 
the  nR  copies  of  (. R )  (the  flavors). 

Let  xy  denote  a  link  connecting  the  neighboring  sites  v 
and  y  on  the  lattice,  with  orientation  pointing  from  y  to  x,  and 
let  Uxy  eG  be  the  gauge-field  variable  associated  with  this 
oriented  link.  We  may  also  assign  to  this  link  the  gauge- 
invariant  variable 

Uxy  ~  <t>x  U xy  4>y  »  (57) 


Hl(f)=  f  ddf(e)  f  dnei0"J‘  (54) 

(where  /  is  any  integrable  function)  is  gauge-invariant.  In¬ 
deed,  it  is  a  function  of  the  gauge-invariant  observable  E ] 
that  acts  on  the  link  /. 

As  we  have  also  explained  in  Sec.  Ill,  the  common-sense 
reason  that  the  state  created  by  the  nonlocal  operator  in  Eq. 
(48)  cannot  be  created  with  local  operators  is  that  Alice’s 
link  and  Bob’s  carry  (correlated)  nonzero  electric  charges.  In 
quantum  chromodynamics,  as  in  a  discrete  gauge  theory, 
Wilson  loop  measurement  can  cause  color  charge  to  be  trans¬ 
ferred  to  a  flux  tube.  This  color  charge  is  surely  detectable; 
like  any  color  charge,  it  acts  as  a  source  for  a  measurable 
color  electric  field. 

V.  MATTER  FIELDS  AND  THE  DESTRUCTIVE 
MEASUREMENT  OF  WILSON  LOOPS 

We  have  shown  that  the  nondemolition  measurement  of  a 
non- Abelian  Wilson  loop  conflicts  with  relativistic  causality. 
But  there  are  further  questions  that  we  wish  to  address.  Can 
the  Wilson  loop  be  measured  destructively ?  What  about  the 
Abelian  case?  To  formulate  our  answers,  we  will  continue  to 
use  the  formalism  of  lattice  gauge  theory.  Furthermore,  to 
ensure  that  the  agents  who  are  to  perform  measurements  are 
as  well  equipped  as  possible,  we  will  include  in  the  theory 
matter  fields  that  couple  to  the  gauge  fields. 

Our  matter  fields  reside  on  the  sites  of  the  lattice,  and  like 
the  link  variables,  take  values  in  the  group  algebra.  The  basis 
for  the  Hilbert  space  at  a  site  v  will  be  denoted  {{(/>) x,(/> 
e  G}.  Under  the  local  gauge  transformation  Ux(g)  acting  at 
the  site  x,  the  matter  variable  transforms  as  the  regular  rep¬ 
resentation  of  G  (which  contains  all  irreducible  representa¬ 
tions  of  G), 


Ux{g):\4>)x-+\g4>)x-  (55) 


the  “covariant  derivative”  of  the  matter  field.  Without 
changing  the  physical  content  of  the  theory,  we  can  replace 
the  link  variables  {  G/}  by  the  new  gauge-invariant  variables 
{ui}.  But  after  this  replacement,  the  physical  Hilbert  con¬ 
straint  can  be  trivially  constructed:  at  each  site  x,  the  state  of 
the  matter  field  is  required  to  be  the  gauge-invariant  uniform 
superposition  state 


Vf ~G\  4>^G 


2  !</>>*• 


(58) 


Since  the  matter  fields  are  completely  constrained  by  gauge 
invariance,  they  have  no  role  in  dynamics  and  they  too  can 
be  eliminated,  leaving  only  the  gauge-invariant  local  vari¬ 
ables  {w/}. 

Although  the  new  variables  are  gauge  invariant,  they 
transform  nontrivially  under  the  global  transformations,  ac¬ 
cording  to 


Vx(h):\u)xy^\hu)xy, 

Vy(k):\u)xy^\uk~l)xy. 


(59) 

(60) 


Thus  physical  states  can  carry  global  G  charges. 

A  gauge-invariant  unitary  operator  acting  on  the  link  vy 
can  be  defined  as 


Hxy(a):\u)xy^\au)xy.  (61) 

Acting  on  the  weakly  coupled  vacuum,  this  operator  pro¬ 
duces  a  flux  tube  excitation  at  the  link.  The  flux  tube  at  vy 
has  Cheshire  charge  that  is  exactly  compensated  by  charge 
localized  at  the  site  v.  An  operator  Hxy(a)  that  creates  an 
excitation  with  trivial  Cheshire  charge  can  be  constructed 
and  applied  as  described  in  Sec.  IV. 

Since  the  variables  { u /}  are  local  and  preserve  the  physi¬ 
cal  Hilbert  space,  it  is  reasonable  to  postulate  that  they  are 
observable.  Physically,  the  measurement  of  uxy  has  a  simple 
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interpretation  in  terms  of  the  effect  of  parallel  transport  of  a 
colored  object  from  site  y  to  the  neighboring  site  v.  Of 
course,  we  are  free  to  adopt  arbitrary  color  conventions  at 
each  site,  and  the  way  we  describe  the  effect  of  parallel 
transport  depends  on  these  conventions.  However,  if  we  have 
multiple  flavors  at  our  disposal,  we  can  use  the  flavors  to 
record  our  conventions,  so  that  in  effect  (gauge-dependent) 
statements  about  color  transport  can  be  translated  into 
(gauge-invariant)  statements  about  flavors. 

To  be  concrete,  suppose  that  (R)  is  a  three-dimensional 
irreducible  representation;  our  “quarks”  come  in  three  colors 
(red,  yellow,  blue)  and  three  flavors  (up,  down,  strange).  At 
each  site,  we  adopt  conventions  for  color  and  flavor,  and  we 
prepare  standard  quarks  in  three  mutually  orthogonal  colors 
and  three  mutually  orthogonal  flavors  that  lock  these  conven¬ 
tions  together:  the  up  quark  is  red,  the  down  quark  is  yellow, 
the  strange  quark  is  blue.  Then  standard  quarks  prepared  at 
site  y  are  covariantly  transported  to  site  x,  and  compared  to 
the  standard  quarks  that  have  been  prepared  at  that  site.  Thus 
the  effect  of  the  transport  can  be  equivalently  described  as 
either  a  rotation  in  the  color  space  (Uxy)  or  in  the  flavor 
space  ( uxy ).  Performing  this  experiment  for  each  irreducible 
transformation  of  G  assigns  a  unique  group  element  uxy  to 
the  link  xy,  for  these  particular  conventions.  A  modification 
of  the  conventions  can  be  interpreted  as  a  rotation  in  the 
flavor  space,  under  which  the  variable  uxy  transforms. 

Now  consider  a  large  loop  C  on  the  spatial  lattice,  and 
suppose  that  many  parties  distributed  along  the  loop  are  to 
measure  the  Wilson  loop 

n  i/,]=Wn «/)  (62) 

\/eC  /  \/eC  / 

in  representation  ( R ).  Since  all  the  matter  fields  cancel  out, 
the  Wilson  loop  can  be  expressed  in  terms  of  the  gauge- 
invariant  variables  {w/}.  Each  party  has  access  to  a  single 
link  along  the  loop,  and  using  n R  flavors  of  quarks  in  repre¬ 
sentation  ( R ),  determines  the  value  of  the  nRXnR  matrix 
D^r\u)  at  that  link,  for  a  particular  choice  of  flavor  conven¬ 
tions.  Each  party  then  reports  her  value  of  D^R\u)  to  the 
central  authority  for  post-processing,  the  matrices  are  multi¬ 
plied  together,  and  the  trace  is  evaluated.  The  result,  which 
does  not  depend  on  the  local  flavor  conventions,  is  the  value 
of  the  Wilson  loop. 

Thus,  distributed  parties,  each  acting  locally,  can  measure 
the  Wilson  loop  operator.  But  in  doing  so,  they  collect  much 
additional  information  aside  from  the  value  of  the  Wilson 
loop.  In  particular,  an  eigenstate  of  W^R\C)  need  not  have  a 
definite  value  of  each  D^r\ui)  along  the  loop.  Therefore,  the 
localized  measurement  procedure  typically  disturbs  the 
quantum  state  of  the  field,  even  if  the  initial  state  before  the 
measurement  is  a  Wilson  loop  eigenstate.  Rather  than  a  lo¬ 
calized  nondemolition  measurement  (which  we  have  already 
seen  is  impossible)  it  is  a  localized  destructive  measurement. 
Note  also  that  distributed  parties  can  measure  destructively 
each  of  several  Wilson  loop  operators  W^Ri\Ci),i 
=  1,2,3,  .  .  .  ,n,  all  on  the  same  time  slice,  and  hence  the 
product  n iW^Ri\Ci).  In  this  respect,  the  destructive  mea¬ 
surement  is  compatible  with  the  Wilson  loop  algebra.  Of 


course,  we  may  not  be  able  to  measure  more  than  one  of  the 
if  the  C*  are  on  different  time  slices,  since  a  mea¬ 
surement  on  an  earlier  slice  may  interfere  with  a  measure¬ 
ment  on  a  later  slice. 

We  have  assumed  that  the  matter  fields  transform  as  the 
regular  representation  Eq.  (55)  of  the  gauge  group  G.  What 
about  more  general  choices  for  the  representation  content  of 
the  matter?  Provided  that  the  matter  transforms  as  a  faithful 
representation  of  the  gauge  group,  one  can  show  that  the 
destructive  measurement  of  Ww(C)  is  still  possible  in  any 
representation  ( R ). 

We  reach  this  conclusion  by  noting  that  matter  in  the 
regular  representation  can  be  simulated  using  matter  that 
transforms  faithfully,  augmented  by  ancilla  degrees  of  free¬ 
dom.  We  will  give  only  a  brief  sketch  of  the  argument.  First 
we  recall  that  if  (Rm)  is  a  faithful  representation,  and  ( R )  is 
any  irreducible  representation,  then  (. R )  is  contained  in 
(Rm)®n  for  some  n.  Therefore,  if  our  fundamental  matter 
fields  transform  as  (Rm),  then  we  can  build  composite  ob¬ 
jects  that  transform  as  (. R )  from  n  fundamental  constituents. 

Next  we  observe  that  if  the  theory  contains  only  a  single 
matter  field  that  transforms  as  ( R ),  we  can  use  ancilla  vari¬ 
ables  to  attach  an  effective  flavor  index  to  the  field.  To  un¬ 
derstand  the  point  heuristically,  consider  the  case  of 
“quarks”  that  come  in  three  colors  but  only  one  flavor. 
Rather  than  using  “natural”  flavors  to  keep  track  of  our  color 
conventions,  we  can  use  “artificial”  flavors  instead,  labeling 
red,  yellow,  and  blue  quarks  with  the  three  mutually  orthogo¬ 
nal  states  (up,  down,  strange)  of  the  ancilla.  When  quarks  are 
transported  from  one  site  to  a  neighboring  site,  the  attached 
value  of  the  ancilla  is  transported  along  with  the  color;  hence 
artificial  flavors,  just  like  natural  flavors,  allow  us  to  describe 
local  gauge  transport  in  terms  of  gauge-invariant  quantities. 
Since  we  can  construct  composite  matter  fields  in  any  repre¬ 
sentation  (j R )  of  G,  and  we  can  use  ancillas  to  ensure  that 
matter  transforming  as  ( R )  comes  in  nR  flavors,  our  simu¬ 
lated  matter  transforms  as  the  regular  representation;  thus  we 
can  measure  destructively  a  Wilson  loop  in  any  representa¬ 
tion. 

What  about  the  case  of  a  pure  gauge  theory  (one  contain¬ 
ing  no  charged  matter  at  all)?  The  gauge  variables  them¬ 
selves  can  simulate  matter  that  transforms  according  to  the 
adjoint  representation 

D(g):\h)^\ghg~1),  (63) 

which  is  a  faithful  representation  of  G/Z(G),  where  Z(G) 
denotes  the  center  of  G.  Thus,  by  building  composite  fields 
and  manipulating  ancillas,  we  can  simulate  matter  that  trans¬ 
forms  as  the  regular  representation  of  G/Z(G).  Therefore, 
W^R\C)  can  be  measured  destructively  for  any  representa¬ 
tion  (. R )  of  G/Z(G),  or  equivalently  for  any  representation  of 
G  that  represents  the  center  of  G  trivially. 

We  have  seen  that  the  nondemolition  measurement  of  a 
non- Abelian  Wilson  loop  is  an  example  of  an  acausal  mea¬ 
surement  that  can  be  made  causal  (and  in  fact  localizable)  if 
additional  information  is  collected  simultaneously.  Other  ex¬ 
amples  were  noted  in  [11]. 
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VI.  NONDEMOLITION  MEASUREMENT  OF  ABELIAN 
WILSON  LOOPS  IS  LOCALIZABLE 

The  causality  problem  arose  for  the  nondemolition  mea¬ 
surement  of  non-Abelian  Wilson  loops  because  multiplica¬ 
tion  of  conjugacy  classes  is  ill-defined.  Since  this  problem 
does  not  arise  if  G  is  Abelian,  one  might  expect  that  a  space¬ 
like  Wilson  loop  operator  should  be  measurable  in  an  Abe¬ 
lian  gauge  theory  (or  more  generally,  if  the  Wilson  loop  is 
evaluated  in  a  one-dimensional  irreducible  representation  of 
the  gauge  group).  We  will  see  that  this  is  the  case. 

To  be  concrete,  consider  a  lattice  theory  (containing 
charged  matter)  with  gauge  group  G=U(  1 ) .  Gauge  vari¬ 
ables  U i  e.U(  1 )  reside  at  each  link  /  of  the  lattice,  and  matter 
variables  4>x<eU(  1 )  reside  at  each  site  v.  As  we  have  seen, 
the  gauge  and  matter  variables  can  be  eliminated  in  favor  of 
gauge-invariant  variables  uxy=  <p~l  Uxy(f>y  ,  and  the  Wilson 
loop  operator  is 

mc)= n  Ui= n «/.  (64) 

ZeC  leC 


Now  each  party  can  measure  the  value  of  her  u  t ,  and  broad¬ 
cast  the  result  to  the  central  authority.  The  measurement  out¬ 
comes  are  random,  so  that  each  individual  measurement  re¬ 
veals  no  information  about  the  state  of  the  lattice  variables. 
When  the  results  are  accumulated,  the  value  of  W(C)  can  be 
inferred  by  evaluating  n  ZM/ ,  but  no  further  information  about 
the  field  configuration  is  acquired.  (This  type  of  local  mea¬ 
surement  making  use  of  a  shared  entangled  ancilla  was  de¬ 
scribed  in  [7],  and  was  shown  to  be  the  basis  of  a  separation 
between  classical  and  quantum  multiparty  communication 
complexity  in  [18].) 

Of  course,  the  transformation  equation  (66)  that  couples 
the  ancilla  to  the  field  variables  can  also  be  described  in  a 
conjugate  basis,  which  may  clarify  its  meaning.  We  may 
write  u  =  e~ 10 ,  u  =  e~ 10 ,  and  define  the  angular  momentum 
Q  conjugate  to  6  by 

e-iQi\0)  =  \o+l)'  (68) 

Then  Eq.  (66)  becomes 


To  perform  a  destructive  measurement  of  W(C),  parties  dis¬ 
tributed  along  the  loop  C  could  each  measure  the  local  value 
of  u\  then  the  results  can  be  multiplied  together  later  to  de¬ 
termine  the  value  of  the  Wilson  loop. 

To  perform  a  nondemolition  measurement  of  W(C),  the 
procedure  must  be  modified  so  that  only  the  value  of  the 
Wilson  loop,  and  no  further  information,  is  collected.  Imag¬ 
ine,  then,  that  n  parties  have  been  distributed  along  the  loop 
C,  each  with  access  to  one  of  the  links  of  C.  And  suppose  the 
party  who  resides  at  link  /  can  manipulate  not  only  the 
gauge-invariant  field  variable  u  / ,  but  also  a  gauge-invariant 
ancilla  variable  e  U(  1 )  that  will  be  used  to  assist  with  the 
measurement.  Some  time  ago,  the  parties  prepared  an  en¬ 
tangled  state  of  their  ancilla  variables, 


r  n 

|  initial)  anc  =  11  (dul)\ul,u2, 

J  1=1 


. un)s\  n  ui~i 

\  Z=1 


(65) 


This  state  is  a  coherent  superposition  of  all  possible  states  for 
the  ancilla  variables,  subject  only  to  one  global  constraint  on 
the  product  of  all  the  u{ s.  Now  each  party  applies  a  local 
unitary  transformation  to  her  lattice  field  variable  and  her 
part  of  the  ancilla: 


I (66) 

a  rotation  of  the  ancilla  rotor  controlled  by  the  value  of  the 
lattice  rotor.  This  is  achieved  by  turning  on  a  Hamiltonian 
that  couples  u  t  and  u  x . 

The  operation  Eq.  (66)  modifies  the  constraint  on  the  an¬ 
cilla  variables,  which  becomes 


11  u,=  W(C).  (67) 

l 


\0,Q)->(e~ie)°\6,Q).  (69) 

Thus  we  may  regard  Q  as  a  fictitious  electric  charge,  whose 
transport  properties  are  governed  by  the  connection  u — the 
parties  implement  Eq.  (69)  by  “parallel  transporting”  their 
ancilla  charges  by  one  lattice  spacing  in  the  effective  gauge 
field  defined  by  u.  The  (unnormalizable)  initial  state  of  the 
ancilla  can  be  written 


00 

| initial) anc=  \Q,Q,Q,  . . .  ,Q),  (70) 

<2=-°° 


which  is  transformed  to 


| initial) anc=  JS  [W(C)]0\Q,Q,Q,  (71) 

<2=  -oo 

Since  the  charges  held  by  the  parties  are  perfectly  correlated, 
only  the  global  information  about  transport  around  the  entire 
loop  C  becomes  imprinted  on  the  ancilla  state.  This  informa¬ 
tion,  encoded  in  relative  phases  in  the  Q -basis,  can  be  read 
out  via  measurements  in  the  conjugate  5-basis.  Note  that  it  is 
important  that  the  ancilla  variables  carry  fictitious  rather  than 
genuine  electric  charges — otherwise  states  with  different  val¬ 
ues  of  the  total  charge  would  reside  in  distinct  superselection 
sectors  and  the  relative  phases  in  Eq.  (71)  would  be  unob¬ 
servable.  We  also  note  that  while  to  measure  the  Wilson  loop 
perfectly  we  must  prepare  the  ancilla  in  the  unnormalizable 
(and  hence  unphysical)  state  Eq.  (70),  a  measurement  with 
arbitrarily  good  precision  can  be  achieved  using  a  normaliz¬ 
able  approximation  to  this  state. 

The  key  to  this  procedure  for  measuring  the  Wilson  loop 
is  that  the  W(C)  can  be  expressed  in  terms  of  the  local 
gauge-invariant  variables  {w/}  as  in  Eq.  (64).  This  property 
has  a  clear  physical  interpretation.  The  matter  field  repre¬ 
sents  a  medium  laid  out  along  the  loop  C  that  becomes  su¬ 
perconducting  on  the  time  slice  where  the  measurement  of 
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W(C)  is  to  be  carried  out:  <p  =  e~l 6  is  a  superconducting 
order  parameter  with  phase  6.  Though  the  phase  and  the 
gauge  field  A  are  not  locally  observable,  the  covariant  de¬ 
rivative 


D^d^O+Ap  (72) 

is  observable — it  is  proportional  to  the  local  current  density. 
By  coupling  the  local  current  to  our  entangled  ancilla,  we 
have  modified  the  state  of  the  ancilla  in  a  manner  that  is 
sensitive  to  the  value  of  the  quantity 


exp  i 


D^Odx^ 


=  W(C ); 


(73) 


the  equality  is  obtained  from  the  property  that  </>  =  e  ~ 1 6  is  a 
single- valued  function. 

Even  without  the  entangled  ancilla,  parties  distributed 
along  the  loop  could  determine  the  value  of  W(C)  by  mea¬ 
suring  the  local  value  of  D^O,  and  broadcasting  their  results. 
In  that  case,  not  just  W(C)  but  also  the  covariant  derivative 
of  9  would  be  determined  by  their  measurement  outcomes. 
By  invoking  the  entangled  ancilla,  we  have  emphasized  that 
it  is  possible  to  measure  W(C)  without  learning  anything 
else  about  the  state  of  the  lattice  system,  that  is,  to  perform  a 
nondemolition  measurement  of  W(C). 

It  is  clear  that  the  technique  we  have  described  could  be 
applied  in  principle  to  perform  a  nondemolition  measure¬ 
ment  of  the  Wilson  loop  operator  in  any  one-dimensional 
representation  of  the  gauge  group.  But  as  we  have  shown 
must  be  so,  it  fails  in  the  non- Abelian  case.  We  can  introduce 
matter  fields  such  that  uxy  =  cj)~ 1  Uxy (f)y  is  a  gauge-invariant 
quantity,  but  since  the  w’s  do  not  commute  with  the  w’s,  the 
transformation  equation  (66)  will  not  in  that  case  simply 
modify  the  constraint  on  the  ancilla  variables  as  in  Eq.  (67). 


w(o-n  Ui=  n  u%,  (74) 

ZeC 


where  U%  is  the  value  of  the  Wilson  operator  W(d 2)  for  the 
boundary  d2  of  the  cell  2 .  Therefore,  a  destructive  measure¬ 
ment  of  W(C)  can  be  carried  out  by  a  collection  of  parties 
occupying  the  surface  S.  Each  party  measures  the  local 
“magnetic  field”  U%  and  reports  her  result  to  the  central 
authority.  The  results  can  then  be  accumulated  to  determine 
the  value  of  W(C). 

This  destructive  measurement  differs  from  a  nondemoli¬ 
tion  measurement  of  W(C)  in  that  too  much  information  is 
collected — not  just  the  total  flux  through  the  surface,  but  also 
the  local  distribution  of  magnetic  flux  is  determined  by  the 
measurement.  In  a  nondemolition  measurement  of  W(C),  a 
superposition  of  two  different  magnetic  field  configurations 
with  the  same  value  of  W(C)  would  not  decohere,  but  if  the 
local  field  is  measured  this  superposition  does  decohere.  Yet 
as  in  our  previous  discussion,  a  nondemolition  measurement 
can  be  achieved  if  ancilla  variables  are  prepared  in  an  appro¬ 
priate  state  that  is  distributed  to  the  parties  in  advance.  Sup¬ 
pose  that  each  of  the  N  parties  can  access  both  the  gauge- 
invariant  local  magnetic  field  variable  1/^g[/(1)  and  an 
ancilla  variable  L^gL(1).  The  ancilla  has  been  prepared  in 
the  shared  initial  state 

I init) anc  =  f  II  (dUx)\UuU2,...,UN)slu  Ux~l\, 

(75) 

a  coherent  superposition  of  all  possible  states  for  the  ancilla 
variables,  subject  only  to  one  global  constraint  on  the  prod¬ 
uct  of  all  the  U'z’s.  To  perform  the  nondemolition  measure¬ 
ment,  each  party  applies  a  local  unitary  transformation  to  her 
magnetic  flux  variable  and  her  part  of  the  ancilla: 


VII.  WILSON  LOOPS  IN  THE  PURE  ABELIAN  GAUGE 
THEORY 

Our  procedure  for  the  nondemolition  measurement  of  an 
Abelian  Wilson  loop  uses  charged  matter  coupled  to  the 
gauge  fields.  Let  us  now  consider  whether  the  nondemolition 
measurement  is  possible  in  the  pure  Abelian  gauge  theory. 
When  there  is  no  charged  matter,  we  cannot  replace  the 
gauge  variables  on  links  by  gauge-invariant  variables  that  are 
locally  measurable. 

A.  Homologically  trivial  loops 

Consider  first  the  case  of  a  homologically  trivial  loop  C, 
the  boundary  of  a  two-dimensional  surface  S.  In  the  Abelian 
gauge  theory,  the  Wilson  loop  operator  W(C)  can  be  inter¬ 
preted  as  el °  where  O  is  the  magnetic  flux  linking  the  loop. 
In  the  lattice  formulation  of  the  theory,  the  surface  S  is  the 
union  of  elementary  cells  that  tessellate  the  surface.  Suppose 
there  are  N  such  cells,  labeled  by  an  index  2  taking  values 
2  =  1,2,3, .  .  .  ,N.  Then  the  Wilson  loop  operator  can  be  ex¬ 
pressed  as 


1  Ux,Ux)^\Ux,UiUz),  (76) 

a  rotation  of  the  ancilla  rotor  controlled  by  the  value  of  the 
lattice  rotor.  This  operation  modifies  the  constraint  on  the 
ancilla  variables,  which  has  become 

n  t/2=n  us=w(o.  (77) 

2  2 

Now  each  party  can  measure  the  value  of  her  U x ,  and 
broadcast  the  result  to  the  central  authority.  The  measure¬ 
ment  outcomes  are  random,  so  that  each  individual  measure¬ 
ment  reveals  no  information  about  the  state  of  the  lattice 
variables.  When  the  results  are  accumulated,  the  value  of 
W(C)  can  be  inferred  by  evaluating  Hx&x ,  but  no  further 
information  about  the  gauge  field  configuration  is  acquired. 

B.  Homologically  nontrivial  loops 

Now  consider  the  case  of  a  homologically  nontrivial  loop 
C,  which  is  not  the  boundary  of  any  surface.  For  example, 
suppose  that  the  theory  lives  on  a  d-dimensional  spatial  torus 
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(a  rectangular  box  with  opposite  sides  identified),  and  that 
the  loop  C  is  a  nontrivial  cycle  that  winds  around  the  torus. 

The  gauge-invariant  local  operators  of  the  theory  are  the 
magnetic  flux  operators  U %  acting  on  the  elementary  lattice 
cells,  and  the  “electric  field”  operators  that  act  on  elemen¬ 
tary  links.  The  electric  field  E{  at  the  link  l  is  the  “angular 
momentum”  conjugate  to  the  link  rotor  variable  Ut ;  it  gen¬ 
erates  rotations  of  U / 

exp(  —  i 0E{) :  |  U)i^>\e~ieU)i .  (78) 

Each  party  residing  on  the  lattice  is  empowered  to  apply  or 
measure  the  local  operators  in  her  vicinity. 

But  the  homologically  nontrivial  Wilson  loop  operator  is 
not  included  in  the  algebra  generated  by  these  local  opera¬ 
tions.  Hence  W(C),  where  C  is  a  nontrivial  cycle,  is  com¬ 
pletely  inaccessible  to  the  local  residents  of  the  lattice.  They 
cannot  measure  this  operator,  either  destructively  nor  nonde- 
structively,  nor  can  they  apply  it  to  a  state.  The  homologi¬ 
cally  nontrivial  Wilson  loop  is  not  an  observable  of  the  pure 
gauge  theory. 

Although  the  inhabitants  of  this  world  are  unable  to  mea¬ 
sure  W(C),  they  are  able  to  change  its  value.  The  link  rota¬ 
tion  e~l0El  has  a  nontrivial  commutation  relation  with  W(C) 
if  /  e  C : 

e~idEiW{C)  =  e~i6W{C)e~i  6EK  (79) 

(Here  the  orientation  of  the  link  /  used  to  define  Et  is  as¬ 
sumed  to  be  aligned  with  the  orientation  of  C  at  link  /.)  Thus 
any  party  with  access  to  a  link  /  of  C  can  rotate  the  value  of 
W(C ),  whether  or  not  W(C)  is  the  boundary  of  a  surface. 

Like  Wilson  loop  operators,  electric  field  operators  are  of 
two  types  with  differing  locality  properties.  If  C  is  a  funda¬ 
mental  nontrivial  cycle,  we  can  construct  an  electric  field 
operator  Ec  that  rotates  W(C)  but  has  no  effect  on  homo¬ 
logically  trivial  Wilson  loops.  Associated  with  the  cycle  C  of 
the  torus  is  a  closed  orientable  hypersurface  S  that  crosses  C 
exactly  once;  dual  to  this  surface  is  a  set  of  oriented  lattice 
links  5*,  as  illustrated  in  Fig.  6.  The  electric  field  conjugate 
to  W(C)  is 

Ec=  2  E,.  (80) 

/eS* 

This  nonlocal  operator  generates  a  rotation  of  the  homologi¬ 
cally  nontrivial  Wilson  loop  W(C),  but  since  any  homologi¬ 
cally  trivial  closed  loop  crosses  S  as  many  times  with  a  + 
orientation  as  with  a  —  orientation,  homologically  trivial 
Wilson  loop  operators  commute  with  Ec . 

The  “nonlocal  electric  field”  Ec  can  be  measured — all 
parties  residing  at  links  contained  in  5*  can  measure  the 
local  electric  field  and  the  results  can  be  summed.  But  while 
the  inhabitants  of  the  lattice  are  able  to  measure  Ec ,  they  are 
unable  to  change  its  value.  The  Hilbert  space  of  the  theory 
divides  into  superselection  sectors,  each  labeled  by  the  val¬ 
ues  of  Ec  e  Z,  where  the  Cy  s  are  the  cycles  that  generate  the 
homology  group  of  the  spatial  manifold. 


FIG.  6.  The  nonlocal  electric  field  operator  dual  to  a  homologi¬ 
cally  nontrivial  Wilson  loop  operator  W(C ),  in  an  Abelian  lattice 
gauge  theory  in  two  spatial  dimensions.  Here  a  two-torus  is  repre¬ 
sented  as  a  square  with  opposite  sides  identified,  C  is  a  nontrivial 
oriented  cycle  that  winds  around  the  torus,  and  5*  is  the  set  of 
oriented  links  dual  to  a  closed  “surface”  that  crosses  C  once.  Any 
homologically  trivial  closed  loop  (like  C")  crosses  S*  as  many 
times  with  a  +  orientation  as  with  a  —  orientation.  Thus  the  electric 
field  operator  on  ^  commutes  with  W(Cf ),  but  has  a  nontrivial 
commutation  relation  with  W(C). 

It  is  obvious  that  similar  conclusions  apply  to  any  Abelian 
pure  gauge  theory.  If  the  theory  is  defined  on  a  manifold  with 
nontrivial  homology,  then  the  algebra  of  observables  has  a 
different  structure  in  the  theory  with  charged  matter  than  in 
the  pure  gauge  theory  without  matter.  In  the  pure  gauge 
theory,  the  homologically  nontrivial  Wilson  loops  are  not 
observables  at  all,  and  consequently,  the  theory  divides  into 
sectors  with  different  values  of  the  nonlocal  electric  field. 


VIII.  SUPERLUMINAL  CHARGE  TRANSPORT 

The  main  conclusion  of  this  paper  is  that  the  observables 
of  Abelian  and  non- Abelian  gauge  theories  have  fundamen¬ 
tally  different  properties — in  particular,  the  nondemolition 
measurement  of  a  Wilson  loop  is  acausal  in  the  non- Abelian 
case  and  localizable  in  the  Abelian  case.  We  can  further  ap¬ 
preciate  the  distinction  between  Abelian  and  non-Abelian 
gauge  theories  by  thinking  about  not  what  operators  can  be 
measured ,  but  rather  what  operators  can  be  applied  to  a  state 
by  a  group  of  parties  each  of  whom  acts  locally. 

To  dramatize  the  question,  imagine  two  parties  Alice  and 
Bob,  many  light  years  apart,  who  share  a  “superluminal 
charge  transport  line”  (SCTL).  Alice  places  a  single  electri¬ 
cally  charged  particle,  an  electron,  at  her  end  of  the  SCTL 
(the  point  y);  then  her  charge  mysteriously  disappears,  and 
in  an  instant  reappears  at  Bob’s  end  of  the  SCTL  (the  point 
x).  The  electron  has  been  transmitted  through  the  SCTL  far 
more  rapidly  than  Alice  could  send  a  light  signal  to  Bob.  Is 
such  a  device  physically  possible? 

Yes.  We  can  understand  how  the  SCTL  works  by  charac¬ 
terizing  it  with  a  gauge-invariant  unitary  operator  that  it  ap¬ 
plies  to  a  state.  In  our  lattice  formulation  of  an  Abelian  lat¬ 
tice  gauge  theory  with  matter,  consider  a  connected  path  of 
links  P  that  begins  at  y  and  ends  at  v.  Associated  with  this 
path  is  the  gauge-invariant  operator 
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FIG.  7.  (a)  “Sawtooth  protocol”  for  superluminal  transmission 
of  an  electron  from  Alice  to  Bob,  assisted  by  many  intervening 
parties.  Each  party  (except  Alice  and  Bob)  produces  an  electron- 
positron  pair  and  keeps  the  positron,  and  each  (except  Bob)  passes 
an  electron  to  the  party  on  her  right.  Then  all  pairs  annihilate.  Thus 
a  charged  particle  sent  by  Alice  is  received  by  Bob  almost  instan¬ 
taneously,  even  though  Bob  is  many  light  years  away,  (b)  The  pro¬ 
tocol  fails  to  achieve  superluminal  transport  of  non- Abelian  charge. 
All  intervening  parties  produce  color- singlet  pairs  of  charges,  but 
when  each  party  unites  her  antiparticle  with  the  particle  created  by 
her  neighbor,  the  pairs  fail  to  annihilate  completely.  Though  the 
procedure  conserves  color,  the  color  of  the  charge  received  by  Bob 
is  uncorrelated  with  the  color  of  the  particle  that  had  been  in  Alice’s 
possession.  In  both  the  Abelian  and  non- Abelian  cases,  no  informa¬ 
tion  is  transmitted  from  Alice  to  Bob,  so  that  causality  is  not  vio¬ 
lated. 


</>;'  n  ut Uy=n  «/• 

leP  /  leP 


(81) 


Acting  on  the  weakly  coupled  ground  state  of  the  theory,  this 
operator  creates  a  pair  of  equal  and  opposite  charges  at  the 
sites  x  and  y.  Acting  on  a  state  with  a  charged  particle  at  site 
y,  it  annihilates  the  particle  at  y  while  creating  a  particle  of 
like  charge  at  x,  in  effect  transporting  the  particle  from  y  to  x. 
The  applied  operator  factorizes  as  a  product  of  gauge- 
invariant  unitary  operators  u  / ,  each  acting  on  a  single  lattice 
link.  Therefore,  many  parties  acting  simultaneously,  each 
manipulating  only  the  link  in  her  own  vicinity,  are  able  to 
operate  the  SCTL. 

More  physically,  we  can  envision  the  operation  of  the 
SCTL  as  in  Fig.  7.  Many  parties  are  distributed  along  the 
SCTL.  At  a  pre-arranged  time,  each  party  creates  an 
electron-positron  pair.  Retaining  the  positron,  she  passes  the 
electron  to  her  right,  while  receiving  an  electron  from  the 
party  on  her  left.  Then  she  brings  electron  and  positron  to¬ 
gether  to  annihilate.  Claire,  the  party  closest  to  Alice,  re¬ 
ceives  an  electron  from  Alice  and  annihilates  it  with  Claire’s 
positron,  while  Diane,  the  party  closest  to  Bob,  hands  her 
electron  to  Bob.  After  all  pairs  annihilate,  the  sole  remaining 


electron,  initially  in  Alice’s  hands,  has  been  delivered  to 
Bob.  The  closer  the  parties  are  to  one  another,  the  faster  the 
procedure  can  be  completed. 

Even  though  the  charge  transfer  is  virtually  instantaneous, 
the  Gauss  law  is  satisfied  at  all  times.  If  we  draw  surfaces 
around  Alice  and  Bob,  then  while  the  SCTL  is  operating  one 
unit  of  charge  leaves  Alice’s  surface  and  one  unit  enters 
Bob’s.  Furthermore,  even  though  the  charge  moves  superlu- 
minally,  the  process  does  not  violate  causality,  since  no  in¬ 
formation  is  transmitted  from  Alice  to  Bob.  Indeed,  if  Felic¬ 
ity,  a  party  in  the  middle  of  the  SCTL,  were  to  disobey  orders 
and  fail  to  create  an  electron-positron  pair,  then  Felicity 
would  “intercept”  the  charge  sent  by  Alice,  and  Felicity’s 
neighbor  on  the  right  would  share  a  distantly  separated 
electron-positron  pair  with  Bob.  When  Bob  receives  the  elec¬ 
tron,  all  he  learns  is  that  his  left  neighbor  has  performed  as 
expected,  but  he  learns  nothing  about  the  activities  of  Alice. 

While  an  Abelian  charge  really  carries  no  information, 
non- Abelian  charge  is  much  more  interesting — its  orientation 
in  a  representation  space  can  encode  a  message.  Thus  it  is 
easy  to  see  that  a  non-Abelian  SCTL,  were  one  to  exist, 
would  violate  causality.  To  be  explicit,  consider  the  follow¬ 
ing  protocol  that  enables  Bob  to  send  classical  information  to 
Alice  (based  on  ideas  similar  to  those  used  to  show  that 
Wilson  loop  measurement  is  acausal).  First,  Alice  produces  a 
particle-antiparticle  pair,  where  the  particle  transforms  as 
representation  ( R )  of  G  and  the  antiparticle  as  representation 
(7?*).  The  total  charge  of  the  pair  is  trivial.  If  {\et),i 
=  1,2,3,  .  .  .  nR}  denotes  a  basis  for  the  representation  ( R ), 
and  {| e*)}  denotes  the  conjugate  basis  for  (7?*),  then  the 
singlet  state  prepared  by  Alice  is 

4=2  h>® I ef).  (82) 
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Alice  keeps  the  antiparticle,  and  sends  the  particle  through 
the  SCTL  to  Bob.  Bob  has  a  loop  of  magnetic  flux  that  he 
has  prepared  in  the  charge-zero  state  |a,0)  associated  with 
conjugacy  class  a  of  G,  as  in  Eq.  (6).  To  convey  a  bit  of 
information  to  Alice,  Bob  either  does  nothing  to  the  charged 
particle  he  received  from  Alice  (sending  0)  or  lassoes  it  with 
his  flux  tube  (sending  1 ) ,  and  returns  the  charge  through  the 
SCTL  to  Alice.  Now  if  Bob  did  nothing,  Alice  recovers  a 
singlet  pair,  but  if  Bob  lassoed  the  charge,  then  the  state  of 
the  pair  has  become  entangled  with  the  state  of  Bob’s  tube: 

-7==— 7j=F  2  2  k)®k*}£,7(a)®  \a).  (83) 
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Alice  then  unites  the  particles  and  observes  whether  the  pair 
annihilates.  In  the  state  Eq.  (83),  the  probability  of  annihila¬ 
tion  is  determined  by  the  overlap  of  the  pair’s  state  with  the 
singlet  state,  and  is  readily  seen  to  be 


Prob  = 


1 


X(R\a) 


(84) 


where  ^(/?)(a)  is  the  character  of  class  a  in  representation 
( R ).  As  long  as  the  representation  (7?)  is  not  one¬ 
dimensional,  the  class  a  can  be  chosen  so  that  this  probabil- 
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ity  is  less  than  one.  Therefore,  Alice  observes  annihilation 
with  certainty  if  Bob  sends  0  and  observes  annihilation  with 
probability  less  than  unity  if  Bob  sends  1 — thus  Bob  can 
signal  Alice. 

The  capacity  of  the  SCTL  is  easily  estimated.  Suppose 
that  Alice  will  signal  Bob  by  transmitting  N  particles  (where 
N  is  even)  each  transforming  as  the  representation  (. R )  or  its 
conjugate  representation  (/?*).  She  can  prepare  and  send  a 
state  of  N/2  particles  and  N/2  antiparticles,  in  any  one  of  AN 
distinct  singlet  states.  These  states  are  mutually  orthogonal 
and  in  principle  they  can  be  readily  distinguished  by  Bob. 
Therefore,  Alice  is  able  to  send  log2  AN  bits  to  Bob  by  using 
the  SCTL  N  times.  But  the  number  of  singlets  is 


.  JnR)N 
An  P(N )’ 


(85) 


where  P(N)  grows  no  faster  than  a  polynomial  with  N.  Thus, 
asymptotically  Alice  can  send  log2nR  bits  of  information  per 
transmission.  This  rate  is  just  what  we  would  have  guessed 
naively,  ignoring  that  observables  must  be  gauge  invariant. 

Since  the  non- Abelian  SCTL  is  acausal,  it  ought  not  to  be 
physically  realizable.  What  goes  wrong  if  we  try  the  same 
procedure  that  succeeded  in  the  Abelian  case?  The  trouble  is 
that  if  Claire  produces  a  singlet  pair,  and  Diane  does  the 
same,  then  when  Claire’s  particle  unites  with  Diane’s  anti¬ 
particle,  the  charges  might  be  unable  to  annihilate.  In  fact,  if 
Claire’s  particle  transforms  as  the  representation  (. R )  and  Di¬ 
ane’s  as  (/?*),  then  the  probability  that  the  pair  annihilates, 
determined  by  its  overlap  with  the  single  state,  is  1  !n\  .  Thus, 
while  in  the  Abelian  case  the  outcome  of  the  procedure  is 
that  only  a  single  electron  survives,  which  is  in  Bob’s  pos¬ 
session,  in  the  non- Abelian  case  many  relic  charges  remain 
strewn  along  the  path  of  the  would-be  SCTL.  Though  the 
procedure  conserves  charge,  the  orientation  in  the  represen¬ 
tation  space  of  the  charge  that  Bob  receives  is  actually  un¬ 
correlated  with  the  orientation  of  the  charge  that  Alice  sent, 
and  no  information  is  transmitted. 

Finally,  in  the  non-Abelian  theory  as  in  the  Abelian 
theory,  the  operator  that  propagates  a  charged  particle  from  y 
to  v  can  be  factorized  as  in  Eq.  (81)  into  local  factors.  So 
why  can’t  this  operator  be  applied  by  many  parties,  each 
acting  locally?  We  must  recall  that  the  operators  of  the 
theory  are  not  the  group  elements  ut^G  themselves,  but 
rather  the  matrix  elements  D^R\ut)  of  representations  of  the 
group.  In  the  Abelian  case,  the  character  of  a  product  of 
group  elements  can  be  written  as  a  product  of  characters, 
where  each  character  is  a  unitary  operator.  But  in  the  non- 
Abelian  case,  the  “factorized”  operator  is  really  a  product  of 
matrices.  The  contraction  of  indices  in  this  matrix  product 
cannot  be  achieved  by  many  parties  acting  locally;  rather  it 
requires  a  nonlocal  conspiracy. 


IX.  CONCLUSIONS 

In  the  standard  formulation  of  algebraic  relativistic  quan¬ 
tum  field  theory  [12],  an  algebra  of  “local”  operators  on 
Hilbert  space  is  associated  with  each  bounded  open  region  of 
spacetime,  such  that  two  local  operators  commute  if  they  are 


associated  with  regions  that  are  spacelike  separated.  A  local 
operator  is  designated  as  a  “local  observable”  if  it  preserves 
the  superselection  sectors  of  the  theory.  One  might  be 
tempted  to  postulate  that  a  quantum  operation  is  physically 
possible  in  principle  if  and  only  if  it  can  be  expanded  in 
terms  of  these  local  observables. 

We  find  this  viewpoint  untenable,  because  causality 
places  more  stringent  constraints  on  the  allowed  operations 
[11].  The  problem  of  characterizing  which  quantum  opera¬ 
tions  are  compatible  with  causality  is  especially  subtle,  inter¬ 
esting,  and  physically  relevant  in  relativistic  quantum  field 
theories  with  local  gauge  symmetry. 

One  form  of  the  question  is:  what  operators  can  justifiably 
be  called  “observables?”  We  have  focused  our  attention  on 
the  measurability  of  the  Wilson  loop  because  of  its  promi¬ 
nent  place  in  the  operator  algebra  of  a  gauge  theory.  The 
answer  we  have  found  is  rather  elaborate.  In  a  gauge  theory 
that  includes  charged  matter  that  transforms  faithfully,  a  de¬ 
structive  measurement  of  a  spacelike  Wilson  loop  WW(C)  is 
physically  possible  for  any  representation  (. R )  of  the  gauge 
group.  The  term  “destructive”  means  that  many  cooperating 
parties  acting  together  can  ascertain  the  value  of  the  Wilson 
loop,  but  only  by  collecting  additional  information  in  the 
process,  and  at  the  price  of  damaging  Wilson  loop  eigen¬ 
states.  In  a  pure  gauge  theory  (one  with  no  charged  matter), 
the  destructive  measurement  of  W^R\C)  is  possible  for  any 
(R)  that  represents  the  center  of  the  gauge  group  trivially.  A 
nondemolition  measurement  of  the  Wilson  loop  (one  that 
leaves  Wilson  loop  eigenstates  intact)  is  possible  in  an  Abe¬ 
lian  gauge  theory  but  not  in  a  non-Abelian  gauge  theory. 

Nondemolition  measurement  of  a  non-Abelian  Wilson 
loop  is  impossible  because  it  would  conflict  with  relativistic 
causality.  Two  distantly  separated  parties  (Alice  and  Bob) 
can  each  produce  excitations  locally  (magnetic  flux  tubes), 
preparing  a  state  that  is  not  an  eigenstate  of  the  Wilson  loop 
operator  W(C),  where  C  is  a  loop  that  passes  through  both 
excitations.  Projecting  onto  a  Wilson  loop  eigenstate,  what¬ 
ever  the  outcome,  entangles  Alice’s  excitation  with  Bob’s, 
modifying  the  excitations  in  a  manner  that  either  party  can 
discern  locally.  Such  instantaneous  preparation  of  quantum 
entanglement  would  enable  spacelike- separated  Alice  and 
Bob  to  communicate. 

In  quantum  field  theory  in  general,  and  in  gauge  theories 
in  particular,  characterizing  the  physically  allowed  quantum 
operations  seems  to  be  an  open  problem.  Further  progress  on 
this  question  is  bound  to  elucidate  the  physical  content  of 
relativistic  quantum  theory. 
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We  describe  a  general  technique  that  allows  one  to  induce  and  control  strong  interaction  between  spin 
states  of  neighboring  atoms  in  an  optical  lattice.  We  show  that  the  properties  of  spin  exchange 
interactions,  such  as  magnitude,  sign,  and  anisotropy,  can  be  designed  by  adjusting  the  optical 
potentials.  We  illustrate  how  this  technique  can  be  used  to  efficiently  “engineer”  quantum  spin  systems 
with  desired  properties,  for  specific  examples  ranging  from  scalable  quantum  computation  to  probing  a 
model  with  complex  topological  order  that  supports  exotic  anyonic  excitations. 


DOI:  10. 1 103/Phy  sRevLett.91 . 09 0402 

Recent  observations  of  the  superfluid  to  Mott  insulator 
transition  in  a  system  of  ultracold  atoms  in  an  optical 
lattice  open  fascinating  prospects  for  studying  many- 
body  phenomena  associated  with  strongly  correlated  sys¬ 
tems  in  a  highly  controllable  environment  [1-4].  For 
instance,  the  recent  studies  have  shown  that,  with  spinor 
bosonic  or  fermionic  atoms  in  optical  lattices,  it  may  be 
possible  to  observe  complex  quantum  phase  transitions 
[5,6],  to  probe  novel  superfluidity  mechanisms  [7,8],  or  to 
demonstrate  the  spin-charge  separation  predicted  from 
the  Luttinger  liquid  model  [9]. 

This  Letter  describes  a  general  technique  to  control 
many-body  spin  Hamiltonians  using  ultracold  atoms. 
Specifically,  we  show  that,  when  two-state  bosonic  or 
fermionic  atoms  are  confined  in  an  optical  lattice,  the 
interaction  between  spins  of  the  particles  can  be  con¬ 
trolled  by  adjusting  the  intensity,  frequency,  and  polar¬ 
ization  of  the  trapping  light.  The  essential  idea  is  to 
induce  and  control  virtual  spin- dependent  tunneling  be¬ 
tween  neighboring  atoms  in  the  lattice  that  results  in  a 
controllable  Heisenberg  exchange  interaction.  By  combin¬ 
ing  this  simple  experimental  technique  with  the  design  of 
the  lattice  geometry,  it  is  possible  to  engineer  many 
interesting  spin  Hamiltonians  corresponding  to  strongly 
correlated  systems. 

Such  techniques  are  of  particular  significance  since 
quantum  magnetic  interactions  are  central  to  understand¬ 
ing  complex  orders  and  correlations  [10].  We  illustrate  this 
with  several  examples:  (i)  We  show  that  one  of  the  gen¬ 
erated  Hamiltonians  provides  us  an  easy  way  to  realize 
the  so-called  cluster  states  in  two  or  three  dimensions 
[11],  which  are  useful  for  an  implementation  of  scalable 
quantum  computation  with  neutral  atoms;  (ii)  we  show 
that  the  realized  Hamiltonian  has  a  rich  phase  diagram, 
opening  up  the  possibility  to  observe  various  quantum 
magnetic  phase  transitions  in  a  controllable  way; 
(iii)  finally,  we  show  how  to  implement  an  exactly  solv¬ 
able  spin  Hamiltonian  recently  proposed  by  Kitaev  [12], 
which  supports  Abelian  and  non-Abelian  anyonic  exci¬ 
tations  with  exotic  fractional  statistics.  Abelian  anyons 
could  also  exist  in  a  fast  rotating  condensate  [13]. 


PACS  numbers:  03.75.Nt,  03.67.-a,  42.50.-p,  73.43.-f 

We  consider  an  ensemble  of  ultracold  bosonic  or  fer¬ 
mionic  atoms  confined  in  an  optical  lattice  formed  by 
several  standing-wave  laser  beams.  We  are  interested  in 
the  Mott  insulator  regime,  and  the  atomic  density  of 
roughly  one  atom  per  lattice  site.  Each  atom  is  assumed 
to  have  two  relevant  internal  states,  which  are  denoted 
with  the  effective  spin  index  a  =|,  j  ,  respectively.  We 
assume  that  the  atoms  with  spins  a  =|,  I  are  trapped  by 
independent  standing-wave  laser  beams  through  polar¬ 
ization  (or  frequency)  selection.  Each  laser  beam  creates  a 
periodic  potential  VMCrsin2(^  •  r)  in  a  certain  direction 
yu,  where  is  the  wave  vector  of  light.  For  sufficiently 
strong  periodic  potential  and  low  temperatures,  the  atoms 
will  be  confined  to  the  lowest  Bloch  band  as  has  been 
confirmed  from  experiments  [1],  and  the  low  energy 
Hamiltonian  is  then  given  by 

77  —  {t ^crCLipCljcr  T  H.C.)  +  —  ^ \U ~  1) 

(ij)cr  Ucr 

+  LiXw'tn'i>  (i) 

i 

Here  (i,  j )  denotes  the  near  neighbor  sites  in  the  direction 
yu,  ai(T  are  bosonic  (or  fermionic)  annihilation  operators, 
respectively,  for  bosonic  (or  fermionic)  atoms  of  spin  a 
localized  on-site  i,  and  ni(7  =  a]  ai(j. 

For  the  cubic  lattice  (yu  =  x,  y,  z)  and  using  a  harmonic 
approximation  around  the  minima  of  the  potential  [3],  the 
spin-dependent  tunneling  energies  and  the  on-site  inter¬ 
action  energies  are  given  by  t ^  ~  (4 /  ^[tt)E1ra(V 
exp _  Un  ~  (8/t 7)l/2(kasU)  X 

(ERVmV2nV3ny'\  Here  V ^  =  4 
is  the  spin  average  potential  in  each  direction,  ER  = 
h2k2 /2m  is  the  atomic  recoil  energy,  and  as^  is  the  scat¬ 
tering  length  between  the  atoms  of  different  spins.  For 
bosonic  atoms  Ua  ~  (8/7 T)l^2{kasa){ERVlo.V2(7V2)a)1  A 
( ascr  are  the  corresponding  scattering  lengths).  For  fer¬ 
mionic  atoms,  Ua  is  on  the  order  of  Bloch  band  separation 
—2^/V^fj.Eji,  which  is  typically  much  larger  than  and 
can  be  taken  to  be  infinite.  In  writing  Eq.  (1),  we 
have  neglected  overall  energy  shifts  — 

■\[ERV/il)(ni]  ~  ni[)/2,  which  can  be  easily  compensated 
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by  a  homogeneous  external  magnetic  field  applied  in  the  z 
direction. 

From  the  above  expressions,  we  observe  that  t de¬ 
pend  sensitively  (exponentially)  upon  the  ratios  VfJba./ER, 
while  t/fj  and  Ua  exhibit  only  weak  dependence.  We  can 
easily  introduce  spin-dependent  tunneling  t  a  by  varying 
the  potential  depth  V  j  and  V ^  with  control  of  the  in¬ 
tensity  of  the  trapping  laser.  We  now  show  that  this  simple 
experimental  method  provides  us  a  powerful  tool  to  en¬ 
gineer  many-body  Hamiltonians.  We  are  interested  in  the 
regime  where  «  Ua,  and  (n^)  +  —  1,  which 

corresponds  to  an  insulating  phase.  In  this  regime,  the 
terms  proportional  to  tunneling  t MCr  can  be  considered  via 
perturbation  theory.  We  use  a  simple  generalization  of  the 
Schriffer-Wolf  transformation  [14]  (see  another  method 
in  [8])  and,  to  the  leading  order  in  Eq.  (1)  is 

equivalent  to  the  following  effective  Hamiltonian: 


H  =  ±  Am±  +  rfrfy]-  (2) 

(ij) 


Here  a]  =  n ^  cr*  =  +  a^a^,  and  cr]  = 

—  i(a^an  —  are  the  usual  spin  operators.  The  + 

and  —  signs  before  in  Eq.  (4)  correspond,  respec¬ 
tively,  to  the  cases  of  fermionic  and  bosonic  atoms.  The 
parameters  \^z  and  for  the  bosonic  atoms  are  given 
by 


■  _l_  p 


 Vt 


/4 


2Un 


t 2  f2 

VT  _  Vi 

U\  u{’ 


A 


VrVi 

un  ' 


(3) 


For  fermionic  atoms,  the  expression  for  A^  is  the  same  as 
in  (3),  but  in  the  expression  for  \z  the  last  two  terms 
vanish  since  Ua  »  U^.  In  writing  Eq.  (2),  we  neglected 
the  term  -  which  can  be  easily 

compensated  by  an  applied  external  magnetic  field. 

The  Hamiltonian  (2)  represents  the  well-known  aniso¬ 
tropic  Heisenberg  model  (XXZ  model),  which  arises  in 
the  context  of  various  condensed  matter  systems  [10]. 
However,  the  approach  involving  ultracold  atoms  has  a 
unique  advantage  in  that  the  parameters  AMZ  and  Am1  can 
be  easily  controlled  by  adjusting  the  intensity  of  the 
trapping  laser  beams.  They  can  also  be  changed  within 
a  broad  range  by  tuning  the  ratio  between  the  scattering 
lengths  asft  and  ascr  (a  =|,  | )  by  adjusting  an  external 
magnetic  field  through  Feshbach  resonance  [15]. 
Therefore,  even  with  bosonic  atoms  alone,  it  is  possible 
to  realize  the  entire  class  of  Hamiltonians  in  the  general 
form  (2)  with  an  arbitrary  ratio  A^/A^.  This  is  impor¬ 
tant  since  bosonic  atoms  are  generally  easier  to  cool.  In 
Fig.  1(a),  we  show  the  phase  diagram  of  the  Hamiltonian 
(2)  on  a  bipartite  lattice  as  a  function  of  (3t  =  t^/t{  +  tj 
and  U\\IU o-  [16]  for  the  case  when  f/j  =  f/j  and  t ^  is 
independent  of  the  spatial  direction  /jl .  Certain  lines  on 
this  phase  diagram  correspond  to  well-known  spin  sys¬ 
tems:  When  U^/Ufj  =  1/2  we  have  an  XY  model;  when 
(3t  =  oo  (^  or  is  zero)  we  have  an  Ising  model;  for  /3t  = 
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FIG.  1  (color  online).  Phase  diagrams  of  the  Hamiltonians 
(2)  for  bosonic  and  fermionic  atoms:  (a)  at  zero  magnetic  field, 
(b)  with  a  longitudinal  field  hz,  and  (c)  (for  bosons  only)  with  a 
transverse  field  hx.  Each  phase  is  characterized  by  the  follow¬ 
ing  order  parameter:  I,  z-Neel  order;  II,  z-ferromagnetic  order; 
III,  xy-Neel  order  for  fermionic  atoms  and  xy-ferromagnetic 
order  for  bosonic  atoms;  IV  and  V  spin  polarization  in  the 
direction  of  applied  field,  z  and  x,  respectively 

±(1/2  —  f/jj/f/^-)-1  we  have  an  SU(2)  symmetric  anti¬ 
ferromagnetic  or  ferromagnetic  systems,  respectively. 

Before  proceeding,  we  estimate  the  typical  energy 
scales  and  discuss  the  influence  of  imperfections  ^and 
noise.  For  Rb  atoms  with  a  lattice  constant  7r/\k\  ~ 
426  nm,  the  typical  tunneling  rate  t/H  can  be  chosen 
from  zero  to  a  few  kHz  [1].  The  on-site  interaction  U/H 
corresponds  to  a  few  kHz  at  zero  magnetic  field,  but  can 
be  much  larger  near  the  Feshbach  resonance.  The  energy 
scale  for  magnetic  interaction  is  about  t2 / HU  ~  0.1  kHz 
(corresponding  to  a  time  scale  of  10  ms)  with  a  conser¬ 
vative  choice  of  U  ~  2  kHz  and  (t/U)2  ~  1/20.  These 
energy  scales  are  clearly  compatible  with  current  experi¬ 
ments  [1].  We  further  note  that  the  present  system  should 
be  quite  robust  to  realistic  noise  and  imperfections.  First 
of  all,  the  next  order  correction  to  the  Hamiltonian  (2)  is 
proportional  to  ( t / U )2,  which  is  small  in  the  Mott  regime. 
Second,  since  the  atoms  only  virtually  tunnel  to  the 
neighboring  sites  with  a  small  probability  ( t/U )2,  the 
dephasing  rate  and  the  inelastic  decay  rate  are  signifi¬ 
cantly  reduced  compared  with  the  cold  collision  scheme 
[17,18].  Finally,  the  spontaneous  emission  noise  rate  can 
be  made  very  small  by  using  a  blue-detuned  optical  lattice 
or  by  increasing  the  detuning.  In  a  blue-detuned  lattice, 
even  with  a  moderate  detuning  A  ~  5  GHz,  the  effective 
spontaneous  emission  rate  is  estimated  to  be  of  the  order 
of  Hz,  which  is  significantly  smaller  than  t2/{HU). 

We  now  illustrate  the  ability  to  engineer  many-body 
spin  Hamiltonians  with  specific  examples.  For  the  first 
example,  we  set  V^/V^  »  1,  so  that  becomes  negli¬ 
gible  while  remains  finite.  In  this  case,  the  Hamil¬ 
tonian  (2)  reduces  to  the  Ising  model  H  =  ^^•  /->A/Ctz(X- <Xy, 
with  A^z  =  ^/(0.5/f/||  —  1/t/j).  Though  this  Hamil¬ 
tonian  has  quite  trivial  properties  for  its  ground  states 
and  excitations,  its  realization  in  optical  lattices  can  be 
very  useful  for  a  dynamical  generation  of  the  so-called 
cluster  states  [11].  Specifically,  we  note  that  this  Ising 
interaction  can  be  easily  turned  on  and  off  by  adjusting 
the  potential  depth  V  f.  If  we  first  prepare  each  atom  in 
the  lattice  into  the  superposition  state  (||)  +  ||))/V2,  and 
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then  lower  Ur  for  a  time  T  with  A^ZT  =  tt/A  mod  tt/2, 
the  final  state  is  a  cluster  state  with  its  dimension  deter¬ 
mined  by  the  dimension  of  the  lattice  [11].  The 
d-dimensional  ( d  >  2)  cluster  states  have  important  ap¬ 
plications  for  implementation  of  scalable  quantum  com¬ 
putation  with  neutral  atoms:  After  its  preparation,  one 
can  implement  universal  quantum  computation  simply 
via  a  series  of  single -bit  measurements  only  [11]. 
The  use  of  such  cluster  states  can  significantly  alleviate 
the  stringent  requirements  on  separate  addressing  of  the 
neighboring  atoms  in  the  proposed  quantum  computation 
schemes  [17,19].  Although  the  present  approach  is  some¬ 
what  slower  that  the  cold  collision  scheme  [17],  it  allows 
one  to  take  advantage  of  its  simplicity  and  the  reduced 
dephasing  rate. 

As  our  second  example,  we  explore  the  rich  phase 
diagram  of  the  Hamiltonian  (2)  in  the  presence  of  mag¬ 
netic  fields.  For  simplicity,  we  assume  a  bipartite  lattice 
and  identical  spin  exchange  constants  for  all  links. 
Figure  1(b)  shows  the  mean-field  phase  diagram  for  bo¬ 
sonic  particles  in  the  presence  of  a  longitudinal  field  hz. 
This  diagram  was  obtained  by  comparing  energies  of  the 
variational  wave  functions  of  two  kinds:  (i)  the  Neel  state 
in  the  z  direction  (d-j)  =  (—l)lez;  (ii)  canted  phase  with 
ferromagnetic  order  in  the  xy  plane  and  finite  polarization 
in  the  z  direction  (cr,-)  =  ex  cos#  +  ez  sin#.  Here,  #  is  a 
variational  parameter,  and  ezx  are  unit  vectors  in  the 
directions  z,x.  Transition  between  the  z-Neel  and  the 
canted  phases  is  a  first  order  spin-flop  transition  [20]  at 
hz  =  Z(A2  —  A^)1//2  (Z  is  the  number  of  neighboring 
atoms  of  each  lattice  site),  and  transition  between  the 
xy-Neel  phase  and  the  z  polarized  phase  is  a  second  order 
transition  of  the  XY  type  at  hz  =  Z(Az  +  Aj_).  In  the 
absence  of  transverse  magnetic  field,  one  can  use  the 
existence  of  two  sublattices  to  change  the  sign  of  Ax 
using  the  transformation  afy  — ►  (— )l(r*,y.  Hence,  fermi¬ 
onic  atoms  in  the  longitudinal  magnetic  field  have  the 
same  phase  diagram  as  shown  in  Fig.  1(b),  except  that 
their  canted  phase  has  transverse  Neel  rather  than  trans¬ 
verse  ferromagnetic  order.  Results  of  a  similar  mean-field 
analysis  of  the  Hamiltonian  (2)  for  bosonic  atoms  with  a 
transverse  magnetic  field  hx  are  shown  in  Fig.  1(c).  For 
fermionic  atoms  in  a  transverse  field,  there  is  one  more 
phase  with  a  Neel  order  along  y  direction. 

The  third  example  involves  the  anisotropic  spin  model 
on  a  2D  hexagonal  lattice  proposed  recently  by  Kitaev 
[12].  In  this  model,  interactions  between  nearest  neigh¬ 
bors  are  of  the  XX,  the  YY,  or  the  ZZ  type,  depending  on 
the  direction  of  the  link: 

H=  X  A^crJ,  (4) 

v=x,y,z\{i,j)^Dv 


where  the  symbol  (i,  j )  E  Dv  denotes  the  neighboring 
atoms  in  the  Dv  ( v  =  x ,  y,  z)  direction  [see  Fig.  2(b)]. 

To  implement  this  model  using  ultracold  atoms,  we  first 
raise  the  potential  barriers  along  the  vertical  direction  Z 


FIG.  2  (color  online),  (a)  The  contours  with  the  three  poten¬ 
tials  in  the  form  of  Eq.  (5).  The  minima  are  at  the  centers  of  the 
triangles  when  <£>0  =  7t/2.  (b)  The  illustration  of  the  model 
Hamiltonian  (4).  (c)  The  schematic  atomic  level  structure  and 
the  laser  configuration  to  induce  spin-dependent  tunneling. 


in  the  three-dimensional  optical  lattice  so  that  the  tun¬ 
neling  and  the  spin  exchange  interactions  in  Z  direction 
are  completely  suppressed  [1,9].  In  this  way,  we  get  an 
effective  2D  configuration  with  a  set  of  independent 
identical  2D  lattice  in  the  X-Y  plane.  We  then  apply  in 
the  X-Y  plane  three  trapping  potentials  (identical  for  all 
spin  states)  of  the  forms 

Vj(x,  y)  =  V0sin2[/:||(xcos#7-  +  ysin#y)  +  <p0 ],  (5) 

where  j  =  1,  2,  3,  and  0{  =  tt/6,  #2  =  7t/2,  #3  =  ~tt/6. 
Each  of  the  potentials  is  formed  by  two  blue-detuned 
interfering  traveling  laser  beams  above  the  X-Y  plane 
with  an  angle  cp\\  =  2arcsin(l/V3),  so  that  the  wave 
vector  k\\  projected  onto  the  X-Y  plane  has  the  value  k\\  = 
&sin(<p||/2)  =  k/y/3.  We  choose  the  relative  phase  cp0  = 
7t/2  in  Eq.  (5)  so  that  the  maxima  of  the  three  potentials 
overlap.  In  this  case,  the  atoms  are  trapped  at  the  minima 
of  the  potentials,  which  form  a  hexagonal  lattice  as 
shown  by  the  centers  of  the  triangles  in  Fig.  2(a).  We 
assume  that  there  is  one  atom  per  each  lattice  site,  and 
this  atom  interacts  with  the  three  neighbors  in  different 
directions  through  virtual  tunneling  with  a  potential 
barrier  given  by  Vo/ A. 

In  such  a  hexagonal  lattice,  we  wish  to  engineer  aniso¬ 
tropic  Heisenberg  exchange  for  each  tunneling  direction 
(denoted  by  Dx,  Dy,  and  Dz,  respectively).  To  this  end,  we 
apply  three  blue-detuned  standing-wave  laser  beams  in 
the  X-Y  plane  along  these  tunneling  directions: 

V„Jx,  y )  =  V,,irs\n2[k(x  cos0'„  +  v  sinfl'J],  (6) 

where  v  =  x,y,  z,  and  6'x  =  —  7t/3,  6'y  =  tt,  #'  =  7t/3.  In 
general,  we  require  that  the  potential  depth  VV(T  depend  on 
the  atomic  spin  state  as 

Vva  =  U+ 1 +>„<+!  +  Vv-\-)v{-\,  (v  =  x,  y,  -), 

(7) 

where  |+)„  (|  —  )„)  is  the  eigenstate  of  the  corresponding 
Pauli  operator  av  with  the  eigenvalue  +1  (  —  1). 
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The  spin- dependent  potentials  in  the  form  of  Eqs.  (6) 
and  (7)  can  be  realized,  for  instance,  with  the  specific 
atomic  level  configuration  shown  in  Fig.  2(c).  Here,  a  = 
t,  i  denote  two  hyperfine  levels  of  the  atom  with  different 
energies.  They  are  coupled  to  the  common  excited  level 
\e)  with  a  blue  detuning  A,  respectively,  through  the  laser 
beams  Ll  and  L2  with  frequencies  matching  the  corre¬ 
sponding  transitions.  The  quantization  axis  is  chosen  to 
be  perpendicular  to  the  X-Y  plane,  and  the  phase-locked 
laser  beams  Lx  and  L2  are  both  polarized  along  this 
direction.  In  the  tunneling  direction  Dz ,  we  only  apply 
the  Lx  laser  beam,  which  induces  the  potential  Vza(x,  y ) 
with  the  desired  form  (7)  of  its  depth  Vza.  In  the  tunnel¬ 
ing  direction  Dx  or  Dy,  we  apply  both  lasers  Lx  and  L2, 
but  with  different  relative  phases,  which  realize  the  de¬ 
sired  potential  depth  Vxa  or  Vya  of  the  form  (7)  in  the 
corresponding  direction. 

The  potentials  (6)  and  (7)  do  not  have  influence  on  the 
equilibrium  positions  of  the  atoms,  but  they  change  the 
potential  barrier  between  the  neighboring  atoms  in  the  Dv 
direction  from  V0/4  to  V'V(T  =  V0/4  +  VV(J.  The  parame¬ 
ters  Vv+  and  Vv-  in  Eq.  (7)  can  be  tuned  by  varying  the 
laser  intensity  of  Lx  and  L2  in  the  Dv  direction,  and  one 
can  easily  find  their  appropriate  values  so  that,  in  the  Dv 
direction,  the  atom  can  virtually  tunnel  with  a  rate  t+v 
only  when  it  is  in  the  eigenstate  1  +  )*,.  Hence,  it  follows 
from  Eqs.  (2)  and  (3)  that  the  effective  Hamiltonian  for 
our  system  is  given  by  Eq.  (4)  with  Xv  ~  — t2+v/{2U )  for 
bosonic  atoms  with  ~  Lf  ~  Lj  ~  L.  After  compen¬ 
sating  effective  magnetic  fields,  we  find  exactly  the  model 
described  by  the  Hamiltonian  (4). 

The  model  (4)  is  exactly  solvable  due  to  the  existence  of 
many  conserved  operators,  and  it  has  been  shown  to 
possess  very  interesting  properties  [12].  In  particular,  it 
supports  both  Abelian  and  non-Abelian  anyonic  excita¬ 
tions,  depending  on  the  ratios  between  the  three  parame¬ 
ters  Xv.  In  the  region  where  2Av/(Xx  +  \y  +  Az)  <  1 
(v  =  x,y,z),  the  excitation  spectrum  of  the  Hamil¬ 
tonian  (4)  is  gapless,  but  a  gap  opens  when  perturbation 
magnetic  fields  are  applied  in  the  v,  y,  z  directions,  and 
the  excitations  in  this  case  obey  non-Abelian  fractional 
statistics.  Out  of  this  region,  except  at  some  trivial  points 
with  \xXy\z  =  0,  the  Hamiltonian  (4)  has  gapped  exci¬ 
tations  which  satisfy  Abelian  fractional  statistics.  Thus, 
the  present  implementation  opens  up  an  exciting  possi¬ 
bility  to  realize  experimentally  the  exotic  Abelian  and 
non-Abelian  anyons. 

Now  we  briefly  discuss  the  techniques  for  probing  the 
resulting  states.  To  detect  the  quantum  phase  transitions 
in  the  XXZ  model  with  magnetic  fields  or  in  Kitaev’s 
model,  one  can  probe  the  excitation  spectra  via  Bragg  or 
Raman  spectroscopy.  In  general,  different  quantum 
phases  are  characterized  by  specific  dispersion  relations 
(for  instance,  in  Kitaev’s  model,  one  phase  is  gapped 
while  the  other  is  gapless).  If  the  two  probe  light  beams 
have  the  momentum  and  frequency  differences  which 
match  those  of  the  dispersion  relation  in  the  correspond¬ 


ing  phase,  a  resonant  absorption  of  the  probe  light  could 
be  observed  [21].  The  direct  observation  of  the  fractional 
statistics  in  Kitaev’s  model  can  be  based  on  atomic  inter¬ 
ferometry  with  a  procedure  similar  to  that  described  in 
Ref.  [13]:  One  generates  a  pair  of  anyonic  excitations  with 
a  spin-dependent  laser  focused  on  two  lattice  sites,  rotates 
one  anyon  around  the  other,  and  then  brings  them  together 
for  fusion  which  gives  different  results  depending  on  the 
anyonic  statistics.  Other  methods  for  detecting  complex 
quantum  states  of  atoms  have  also  been  developed  re¬ 
cently  [22]. 

In  summary,  we  have  described  a  general  technique  to 
engineer  many-body  spin  Hamiltonians. 
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Near-field  scanning  optical  microscopy  was  used  to  observe  high-resolution  images  of  confined 
modes  and  photonic  bands  of  planar  photonic  crystal  (PPC)  nanocavities  fabricated  in  active 
InGaAsP  material.  We  have  observed  the  smallest  optical  cavity  modes,  which  are  intentionally 
produced  by  fractional  edge  dislocation  high- Q  cavity  designs.  The  size  of  the  detected  mode  was 
roughly  four  by  three  lattice  spacings.  We  have  also  observed  extended  dielectric-band  modes  of  the 
bulk  PPC  surrounding  the  nanocavity  by  geometrically  altering  the  bands  in  emission  range  and 
eliminating  localized  modes  out  of  the  emission  range.  ©  2003  American  Institute  of  Physics. 
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Photonic  crystals,1  and  planar  photonic  crystals  (PPC)  in 
particular,  have  recently  attracted  attention  as  a  promising 
platform  for  realization  of  compact  and  efficient 
nanocavities2,3  and  lasers.4-8  In  most  of  these  reports,  micro¬ 
photoluminescence  was  used  to  characterize  the  structures. 
On  the  other  hand,  near-field  scanning  optical  microscopy 
(NSOM)  has  recently  been  used  as  a  powerful  alternative 
method  to  analyze  local  electromagnetic  field  distributions  in 
fabricated  nanophotonic  structures.9-17  Gerard  et  al.  14  re¬ 
ported  NSOM  measurements  of  active  PPC  with  spectral 
emission  in  the  infrared  region  and  Shin  et  al 15  reported  the 
near-field  investigation  of  the  lasing  modes  in  PPC  lasers. 
However,  in  both  studies,  uncoated  optical  fibers  were  used 
and,  therefore,  it  was  not  possible  to  obtain  high  spatially 
resolved  near-field  images  of  the  field  distribution  inside  the 
cavity.  Also,  both  studies  analyzed  large  hexagonal  cavities 
(empty  lattice  cavities),  which  support  many  modes  with 
rather  large  mode  volumes.  In  this  letter,  we  report  the  results 
of  NSOM  of  very  small  PPC  cavities  based  on  fractional 
edge  dislocations.2,3,8  The  metal-coated  fiber  tip  enables  us  to 
distinguish  between  localized  cavity  modes  and  propagating 
far-field  modes,  and  to  obtain  more  precise  mode  profiles 
when  the  tip  probes  into  holes  of  PPCs.  The  best  resolution 
in  our  system  is  as  small  as  50  nm. 

The  experimental  setup  for  the  NSOM  measurement  is 
shown  in  Fig.  1.  We  used  a  twin-SNOM  system  manufac¬ 
tured  by  OMICRON,  capable  of  both  illumination  mode  (I- 
mode)  and  the  collection  mode  (C-mode)  measurements.  For 
the  I  mode,  continuous- wave  light  from  a  He-Ne  laser  (633 
nm)  was  used  to  pump  the  structures  through  the  optical  fiber 
tip.  The  photoluminescence  (PL)  signal  was  detected  through 
the  reflective  objective  lens.  The  excitation  power  of  the 
He-Ne  laser,  before  coupling  into  the  optical  fiber,  was  1 
mW.  For  the  C  mode,  a  780  nm  diode  laser,  operated  with  20 
ns  long  pulses  of  2  jas  periodicity,  was  focused  on  the  sample 
through  the  refractive  objective  lens  and  the  optical  fiber  tip 
was  used  to  detect  the  PL  signal.  The  excitation  pump  beam 
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spot  was  several  tens  of  jam2.  In  both  modes,  the  PL  signals 
were  distinguished  from  the  reflected  light  of  the  excitation 
laser  by  using  the  colored  glass  filter  with  a  cutoff  wave¬ 
length  of  850  nm,  and  detected  with  a  high- sensitivity  (fW) 
InGaAs  photodetector.  The  optical  fiber  tip  was  metal  coated 
and  the  aperture  size  at  the  end  of  the  tip  was  150  nm.  The 
fiber  tip  is  positioned  at  the  dither  piezodevice  and  shear- 
force  detector  in  order  to  control  the  distance  between  the  tip 
and  the  sample  surface  (^10nm)  and  to  obtain  a  topo¬ 
graphic  image  of  the  sample. 

The  PPC  nanocavities  described  in  this  work  are  very 
similar  to  those  used  to  realize  low-threshold  lasers  described 
in  our  previous  publication.8  The  most  important  difference 
from  the  cavities  analyzed  in  Ref.  8  is  the  omission  of  central 
defect  hole,  and  therefore  Q  factors  are  limited  to  about 
1000,  according  to  our  theoretical  predictions.  Optical  emis¬ 
sion  in  our  structures  was  obtained  from  four  9  nm  thick 
InGaAsP  quantum  well  (QW)  layers  (Eg  =1.55  jam)  sepa¬ 
rated  by  20  nm  thick  InGaAsP  barrier  layers  (Eg  =  1.22  jam), 
and  placed  in  the  center  of  a  330  nm  thick  InGaAsP  slab, 
grown  on  the  top  of  InP  substrate.  The  emission  from  QWs 
was  found  to  be  in  the  range  of  1300  to  1650  nm.  The  PPC 
structure  is  a  free-standing  membrane  patterned  with  trian¬ 
gular  lattice  of  holes  within  which  cavity  based  on  fractional 
edge  dislocation  is  defined.  Details  of  the  fabrication  proce¬ 
dures  are  presented  in  Ref.  8. 

Figure  2(a)  shows  the  topographic  image  of  the  entire 
structure  obtained  by  the  shear-force  microscopy.  The  PPC 
structure  in  the  center  of  the  membrane  as  well  as  the  unpat¬ 
terned  edges  of  the  membrane  can  be  seen.  In  Fig.  2(b),  we 
show  the  near-field  optical  image  of  the  same  sample  ob¬ 
tained  using  NSOM  I  mode.  A  bright  region  corresponds  to 
the  light  localized  in  the  Fabry-Perot  (FP)  resonator  formed 
between  the  edge  of  the  membrane  and  the  edge  of  the  PPC 
region  in  Fig.  2(b).  We  have  confirmed  this  by  conducting 
micro-PL  measurements  on  this  «*2.1  jam  long  resonator, 
and  FP  resonances  were  detected  in  the  spectrum  when  the 
structure  was  pumped  close  to  the  edge  [Fig.  2(c)]. 
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FIG.  1.  Experimental  setup  for  the  NSOM  measure¬ 
ment  with  illumination  mode  and  collection  mode. 


Close  inspection  of  Fig.  2(b)  also  reveals  presence  of  the 
light  localized  at  the  center  of  the  PPC  structure.  In  order  to 
investigate  the  origin  of  this  signal,  we  have  increased  the 
spatial  resolution  of  our  NSOM  and  analyzed  only  the  cen¬ 
tral  region  of  the  structure,  where  a  nanocavity  based  on 
fractional  edge  dislocations  exists.  In  Fig.  3(a),  we  show  the 
scanning  electron  microscope  (SEM)  image  of  this  central 
region  of  the  device  shown  in  Fig.  2.  The  periodicity  of  the 
lattice  is  <2  =  420nm,  radius  of  holes  is  r=135nm,  and 
thickness  of  the  slab  is  d=  330  nm.  This  PPC  geometry,  with 
rla  =  032  and  d/a  =  0.79,  has  a  band  gap  in  the  frequency 
range  approximately  a/X  e  (0.25,  0.33),  that  is  in  the  wave¬ 
length  range  X  e  ( 1270  nm,  1680  nm).  The  elongation  in  this 
cavity  was  p/a  =  15% . 2,8  The  cavity  based  on  fractional  edge 
dislocations  supports  two  prominent  resonances.  These  reso¬ 
nances  correspond  to  doubly  degenerate  dipole  modes  of  the 
simple  single  defect  cavity,4  and  the  introduced  asymmetry 
due  to  the  dislocation  lifts  the  degeneracy.  The  two  dipole 
modes  are  linearly  but  orthogonally  polarized,  and  the  mode 
positioned  at  a  longer  wavelength  can  have  very  high  Q. 
Figure  3(b)  shows  the  results  of  micro-PL  analysis  of  this 
structure.  Two  peaks  positioned  around  X=1450nm  corre¬ 
spond  to  the  localized  dipole  modes,  whereas  peaks  above 
X  =  1600  nm  correspond  to  dielectric  band  modes. 

Figures  3(c)  and  3(d)  show  an  enlargement  of  the  central 
region  from  Figs.  2(a)  and  2(b),  respectively.  The  bright  spot 
seen  in  NSOM-PL  image  [Fig.  3(d)]  is  located  at  the  center 
of  the  PPC  structure,  matching  the  position  of  the  defect 
cavity,  as  shown  in  topographic  image  [Fig.  3(c)].  We  have 
attributed  this  optical  signal  to  two  dipole  eigenmodes  of  our 
cavity.  The  size  of  the  bright  spot  is  roughly  4Aa  by  3a. 
This  small  spot  size  is  an  indication  of  a  small  mode  volume, 
as  expected  from  the  localized  cavity  modes.  The  NSOM 
images  should  be  a  superposition  of  two  orthogonal  dipole 
modes.  These  NSOM-PL  results  are  obtained  by  using  I 
mode  and,  therefore,  the  size  of  the  bright  spot  is  expected  to 
contain  information  on  the  diffusion  properties  of  free  carri- 


FIG.  2.  (a)  Topographic  image  of  the  whole  structure  by  the  shear-force 
microscopy,  (b)  Near-field  PL  image,  (c)  FP  resonances  detected  using  mi¬ 
crophotoluminescence  when  structure  was  pumped  close  to  the  edge.  Inset 


ers  excited  by  the  pump  beam,  in  addition  to  the  information 
on  the  optical  mode  size.  Therefore,  we  believe  that  by  using 
I  mode,  we  actually  overestimate  the  size  of  the  optical  mode 
due  to  the  free-carrier  diffusion.  However,  the  small  size  of 
the  detected  light  signal  is  a  clear  indication  of  presence  of 
well-confined  modes  in  the  center  of  our  cavity.  In  Figs.  3(e) 
and  3(f),  we  show  NSOM  images  without  and  with  a  cutoff 
colored  glass  filter,  respectively,  this  time  for  a  cavity  with 
elongation  p/a  =  20%.  When  the  filter  was  not  used  [Fig. 
3(e)],  an  interference  pattern  was  observed.  We  have  attrib¬ 
uted  this  to  the  reflection  of  the  pump  He-Ne  laser  light 
from  the  sample  surface.  On  the  other  hand,  when  a  filter 
was  used,  a  very  different  result  was  obtained  and,  clearly, 
localized  defect  modes  could  be  observed.  Figure  4  shows 
the  polarization  dependence  of  the  optical  modes  detected  in 
cavity  with  p/a  =  25%,  obtained  using  both  micro-PL  and 
NSOM  approaches.  As  predicted  by  theory,  two  dipole 
modes  are  linearly  polarized,  with  orthogonal  polarizations. 
The  intensities  of  the  NSOM  images  with  0°,  60°,  and  90° 
(±15°)  polarizers  are  in  very  good  agreement  with  spectra 
obtained  using  microphotoluminescence.  Therefore,  we  con¬ 
clude  that  optical  modes  detected  with  NSOM  correspond  to 
confined  cavity  modes. 

Figure  5  shows  the  topographic  images  and  the  corre¬ 
sponding  NSOM  images  for  different  PPC  structures,  this 


FIG.  3.  (Color)  (a)  SEM  image  of  the  tested  cavity  with  pla=  15%.  (b) 
Resonances  detected  using  micro-PL  setup.  Confined  modes  (around  1450 
nm)  and  extended  dielectric  band  modes  (above  1600  nm)  can  be  seen,  (c) 
Topographic  and  (d)  near-field  optical  image.  Detected  optical  field  corre¬ 
sponds  to  the  confined  cavity  modes,  (e)  Near-field  image  of  the  cavity  with 


shows  pump  spot  on  the  structure.  pla  =  20%  obtained  without  and  (f)  with  a  colored  glass  filter. 
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FIG.  4.  (a)  Micro-PL  spectra  and  (b)  near-field  images  taken  from  a  sample 
with  a  parameter  p/a  =  25%  with  and  without  a  polarization  plate.  A  polar¬ 
ization  parameter  6  is  a  clockwise  angle  from  x  axis  defined  in  Fig.  3(a). 


time  with  slightly  larger  holes  and  with  the  central  defect 
hole  present.  Therefore,  we  expect  dipole  eigenmodes  of  this 
cavity  to  be  moved  toward  shorter  wavelengths.  In  the  tested 
sample,  they  were  completely  pushed  outside  the  emission 
range  of  QWs,  and  no  localized  cavity  modes  could  be  ob¬ 
served  in  the  micro-PL  experiment.  However,  we  could  ob¬ 
serve  several  peaks  that  correspond  to  the  dielectric  bands  of 
bulk  PPC  [Figs.  5(c)  and  5(f)].  The  NSOM  images  obtained 
using  C-mode  and  shown  in  Figs.  5(a)  and  5(b)  show  light 
localization  in  the  dielectric  region  between  the  PPC  holes. 
Also,  positions  of  the  air  holes  appear  dark  in  this  NSOM 
image.  Therefore,  we  have  attributed  this  result  to  the  exis¬ 
tence  of  the  dielectric  band  modes  in  the  emission  region  of 
QW  material.  Similar  results  were  predicted  by  theory.10 
Also,  spectra  obtained  using  microphotoluminescence  [Fig. 
5(c)]  show  the  presence  of  dielectric  bands  at  X=  1555  nm. 
We  would  like  to  point  out  that  dielectric  band  modes,  also 
observed  in  Fig.  3(b)  were  not  detected  using  NSOM  I  mode 
[Figs.  3(d)  and  3(f)].  We  believe  that  it  is  due  to  localized 


FIG.  5.  (Color)  (a)  Topographic  image  and  (b)  near-field  PL  image  (C 
mode)  of  PPC  structure  with  bigger  holes  (p/a  =  25%).  (d)  Topographic  and 
(e)  near-field  image  of  another  structure,  (c)  and  (f)  show  spectra  obtained 
by  microphotoluminescence  from  structures  shown  in  (a)  and  (d),  respec¬ 
tively. 


pumping  in  the  case  of  I  mode  (as  opposed  to  the  large 
pumping  spot  in  C  mode)  and  dielectric  band  modes  that 
extend  over  large  areas  in  PPC  could  not  be  excited.  We  have 
tested  another  geometries  in  PPC  structures,  and  the  NSOM 
results  are  shown  in  Figs.  5(d)  and  5(e).  Strong  light  inten¬ 
sity  is  observed  at  the  positions  of  the  air  holes,  this  time. 
This  phenomenon  could  be  attributed  to  the  presence  of  air- 
band  modes  in  the  emission  region  of  the  QW  material. 
However,  we  were  not  able  to  observe  air-band  modes  in  our 
micro-PL  experiments,  and  only  dielectric  band  modes  were 
observed  [Fig.  5(f)].  At  present,  experiments  are  underway  to 
explain  this  phenomenon. 

In  conclusion,  we  have  observed  localized  defect  modes 
of  the  compact  PPC  nanocavities.  In  addition  to  localized 
cavity  modes,  we  have  experimentally  observed  dielectric 
band  modes  in  bulk  PPCs.  We  conclude  that  NSOM  is  a 
powerful  tool  for  the  investigation  of  local  profiles  of  con¬ 
fined  modes  in  nanocavities. 
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Abstract 

We  study  the  ±J  random-plaquette  Z2  gauge  model  (RPGM)  in  three  spatial  dimensions,  a 
three-dimensional  analog  of  the  two-dimensional  ±J  random-bond  Ising  model  (RBIM).  The 
model  is  a  pure  Z2  gauge  theory  in  which  randomly  chosen  plaquettes  (occurring  with  concen¬ 
tration  p)  have  couplings  with  the  “wrong  sign”  so  that  magnetic  flux  is  energetically  favored 
on  these  plaquettes.  Excitations  of  the  model  are  one-dimensional  “flux  tubes”  that  terminate 
at  “magnetic  monopoles”  located  inside  lattice  cubes  that  contain  an  odd  number  of  wrong- 
sign  plaquettes.  Electric  confinement  can  be  driven  by  thermal  fluctuations  of  the  flux  tubes, 
by  the  quenched  background  of  magnetic  monopoles,  or  by  a  combination  of  the  two.  Like  the 
RBIM,  the  RPGM  has  enhanced  symmetry  along  a  “Nishimori  line”  in  the  p-T  plane  (where 
T  is  the  temperature).  The  critical  concentration  pc  of  wrong-sign  plaquettes  at  the  confine- 
ment-Higgs  phase  transition  along  the  Nishimori  line  can  be  identified  with  the  accuracy 
threshold  for  robust  storage  of  quantum  information  using  topological  error-correcting  codes: 
if  qubit  phase  errors,  qubit  bit-flip  errors,  and  errors  in  the  measurement  of  local  check  oper¬ 
ators  all  occur  at  rates  below  pc ,  then  encoded  quantum  information  can  be  protected  perfectly 
from  damage  in  the  limit  of  a  large  code  block.  Through  Monte-Carlo  simulations,  we  mea¬ 
sure  pc0,  the  critical  concentration  along  the  T  =  0  axis  (a  lower  bound  on  pc ),  finding 
pCQ  =  .0293  ±  .0002.  We  also  measure  the  critical  concentration  of  antiferromagnetic  bonds 
in  the  two-dimensional  RBIM  on  the  T  =  0  axis,  finding  pc0  =  .1031  ±  .0001.  Our  value  of 
pc0  is  incompatible  with  the  value  of  pc  =  .1093  ±  .0002  found  in  earlier  numerical  studies 
of  the  RBIM,  in  disagreement  with  the  conjecture  that  the  phase  boundary  of  the  RBIM  is 
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vertical  (parallel  to  the  T  axis)  below  the  Nishimori  line.  The  model  can  be  generalized  to  a 
rank -r  antisymmetric  tensor  field  in  d  dimensions,  in  the  presence  of  quenched  disorder. 

©  2002  Elsevier  Science  (USA).  All  rights  reserved. 


1.  Introduction 

Spin  systems  with  quenched  randomness  have  been  extensively  studied,  leading  to 
valuable  insights  that  apply  to  (for  example)  spin  glass  materials,  quantum  Hall  sys¬ 
tems,  associative  memory,  error-correcting  codes,  and  combinatorial  optimization 
problems  [1-3].  Gauge  systems  with  quenched  randomness,  which  have  received 
comparatively  little  attention,  will  be  studied  in  this  paper. 

The  gauge  models  we  consider  are  intrinsically  interesting  because  they  provide 
another  class  of  simple  systems  with  disorder-driven  phase  transitions.  But  our  inves¬ 
tigation  of  these  models  has  a  more  specific  motivation  connected  to  the  theory  of 
quantum  error  correction. 

In  practice,  coherent  quantum  states  rapidly  decohere  due  to  uncontrollable  inter¬ 
actions  with  the  environment.  But  in  principle,  if  the  quantum  information  is  cleverly 
encoded  [6,7],  it  can  be  stabilized  and  preserved  using  fault-tolerant  recovery  proto¬ 
cols  [8].  Kitaev  [4,5]  proposed  a  particularly  promising  class  of  quantum  error-cor¬ 
recting  codes  {surface  codes)  in  which  the  quantum  processing  required  for  error 
recovery  involves  only  local  interactions  among  qubits  arranged  in  a  two-dimen¬ 
sional  block,  and  the  protected  information  is  associated  with  global  topological 
properties  of  the  quantum  state  of  the  block.  If  the  error  rate  is  small,  then  the  to¬ 
pological  properties  of  the  code  block  are  well  protected,  and  error  recovery  succeeds 
with  a  probability  that  rapidly  approaches  one  in  the  limit  of  a  large  code  block.  But 
if  the  error  rate  is  above  a  critical  value,  the  accuracy  threshold ,  then  quantum  error 
correction  is  ineffective. 

In  [9],  a  precise  connection  was  established  between  the  accuracy  threshold 
achievable  with  surface  codes  and  the  confinement-Higgs  transition  in  a  three-di¬ 
mensional  Z2  lattice  gauge  model  with  quenched  randomness.  The  model  has 
two  parameters:  the  temperature  T  and  the  concentration  p  of  4 ‘wrong- sign”  pla- 
quettes.  On  wrong-sign  plaque ttes  (which  are  analogous  to  antiferromagnetic 
bonds  in  a  spin  system)  it  is  energetically  favorable  for  the  Z2  magnetic  flux  to 
be  nontrivial.  In  the  mapping  between  quantum  error  recovery  and  the  gauge  mod¬ 
el,  the  quenched  fluctuations  correspond  to  the  actual  errors  introduced  by  the  en¬ 
vironment;  these  impose  sites  of  frustration,  magnetic  monopoles ,  corresponding  to 
an  “error  syndrome”  that  can  be  measured  by  executing  a  suitable  quantum  cir¬ 
cuit.  Thermally  fluctuating  magnetic  flux  tubes,  which  terminate  at  magnetic 
monopoles,  correspond  to  the  ensemble  of  possible  error  patterns  that  could  gen¬ 
erate  a  particular  error  syndrome.  (The  temperature  T  is  tied  to  the  strength  p  of 
the  quenched  fluctuations  through  a  Nishimori  relation  [10].)  When  the  disorder  is 
weak  and  the  temperature  low  (corresponding  to  a  small  error  rate),  the  system  is 
in  a  magnetically  ordered  Higgs  phase.  In  the  surface  code,  magnetic  order  means 
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that  all  likely  error  patterns  that  might  have  produced  the  observed  error  syn¬ 
drome  are  topologically  equivalent,  so  that  the  topologically  encoded  information 
resists  damage.  But  at  a  critical  value  pc  of  the  disorder  strength  (and  a  tempera¬ 
ture  determined  by  Nishimori’s  relation),  magnetic  flux  tubes  condense  and  the  sys¬ 
tem  enters  the  magnetically  disordered  confinement  phase.  In  the  surface  code, 
magnetic  disorder  means  that  the  error  syndrome  cannot  point  to  likely  error  pat¬ 
terns  belonging  to  a  unique  topological  class;  therefore  topologically  encoded  in¬ 
formation  is  vulnerable  to  damage. 

Although  the  code  block  is  two  dimensional,  the  gauge  model  is  three  dimen¬ 
sional  because  one  dimension  represents  time.  Time  enters  the  analysis  of  recovery 
because  measurements  of  the  error  syndrome  might  themselves  be  faulty;  there¬ 
fore  measurements  must  be  repeated  on  many  successive  time  slices  if  they  are 
to  provide  reliable  information  about  the  errors  that  have  afflicted  the  code  block. 
If  qubit  phase  errors,  qubit  bit-flip  errors,  and  errors  in  the  measurement  of  local 
check  operators  all  occur  at  rates  below  pc ,  then  encoded  quantum  information 
can  be  protected  perfectly  from  damage  in  the  limit  of  a  large  code  block.  As 
we  consider  more  and  more  reliable  measurements  of  the  syndrome,  the  corre¬ 
sponding  three-dimensional  gauge  model  becomes  more  and  more  anisotropic,  re¬ 
ducing  in  the  limit  of  perfect  measurements  to  the  two-dimensional  random-bond 
Ising  model. 

The  numerical  value  pc  of  the  accuracy  threshold  is  of  considerable  interest,  since 
it  characterizes  how  reliably  quantum  hardware  must  perform  in  order  for  a  quan¬ 
tum  memory  to  be  robust.  In  the  three-dimensional  Z2  gauge  model,  pc  is  the  value  of 
the  wrong-sign  plaquette  concentration  where  the  confinement-Higgs  boundary 
crosses  the  Nishimori  line  in  the  p-T  plane.  A  lower  bound  on  pc  is  provided  by 
the  critical  concentration  pc0  on  the  T  =  0  axis.  In  [9],  an  analytic  argument  estab¬ 
lished  that  pco  ^  .0114.  In  this  paper  we  report  on  a  numerical  calculation  that  finds 
pc0  =  .0293  ±  .0002. 

In  the  case  where  the  error  syndrome  can  be  measured  flawlessly,  the  critical  error 
rate  is  given  by  the  critical  antiferromagnetic  bond  concentration  on  the  Nishimori 
line  of  the  two-dimensional  random-bond  Ising  model  (RBIM).  Numerical  calcula¬ 
tions  performed  earlier  by  other  authors  [11,12]  have  established  pc  =  .1093  d=  .0002. 
According  to  a  conjecture  of  Nishimori  [13]  and  Kitatani  [14],  this  value  of  pc  should 
agree  with  the  critical  bond  concentration  /?c0  of  the  2D  RBIM  on  the  T  =  0  axis. 
The  same  reasoning  that  motivates  this  conjecture  for  the  RBIM  indicates  that 
Pc  =  Pc o  for  the  3D  random-plaquette  gauge  model  (RPGM)  as  well.  However,  we 
have  calculated  pc o  in  the  2D  RBIM  numerically,  finding  pc0  =  .1031  ±  .0001.  Our 
value  of  pco  agrees  with  an  earlier  numerical  calculation  by  Kawashima  and  Rieger 
[23],  but  disagrees  with  the  conjecture  that  pc  =  p&. 

In  Section  2  we  describe  in  more  detail  the  properties  of  the  2D  RBIM  and  the  3D 
RPGM,  emphasizing  the  importance  of  the  Nishimori  line  and  the  inferences  that 
can  be  made  about  the  behavior  of  order  parameters  on  this  line.  Section  3  reviews 
the  connection  between  the  models  and  error  recovery  using  surface  codes.  Our  nu¬ 
merical  results  for  pc{)  and  for  the  critical  exponent  v0  at  the  T  =  0  critical  point  are 
presented  in  Section  4.  Section  5  summarizes  our  conclusions. 
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2.  Models 


2.1.  Random-bond  Ising  model 


The  two-dimensional  ±J  random-bond  Ising  model  (RBIM)  has  a  much  studied 
multicritical  point  at  which  both  the  temperature  and  the  strength  of  quenched  dis¬ 
order  are  nonzero.  This  model  is  an  Ising  spin  system  on  a  square  lattice,  with  a  var¬ 
iable  Sf  =  ±1  residing  at  each  lattice  site  i.  Its  Hamiltonian  is 

H  =  -J^ijSiSj,  (1) 

(ij) 

where  J  is  the  strength  of  the  coupling  between  neighboring  spins,  and  t q  =  ±1  is  a 
quenched  random  variable.  (That  is,  depends  on  what  sample  of  the  system  is 
selected  from  a  certain  ensemble,  but  is  not  subject  to  thermal  fluctuations.)  The  t*/s 
are  independently  and  identically  distributed,  with  the  antiferromagnetic  choice 
t ^  =  - 1  (favoring  that  neighboring  spins  antialign)  occurring  with  probability  p ,  and 
the  ferromagnetic  choice  T,y  =  +1  (favoring  that  neighboring  spins  align)  occurring 
with  probability  1  —  p.  We  refer  to  p  as  the  concentration  of  antiferromagnetic 
bonds,  or  simply  the  bond  concentration. 

The  free  energy  F  of  the  model  at  inverse  temperature  /?,  averaged  over  samples,  is 


[PF{K,  t)]^  =  -  £  P(Kp,  t)  In  Z(K,  t), 

T 

where 

z(k,  x)  =  exp  (  k  S2  %iJSiSj 

s  \  (>j) 

is  the  partition  function  for  sample  t  (with  K  =  fiJ ),  and 

P(Kp,  t)  =  (2coshA^)_ArB  x  exp  |  Kp  ^  Xjj 

V  (ij) 

is  the  probability  of  the  sample  t;  here 

=  e~2Kp 

1  -p 


(2) 

(3) 

(4) 

(5) 


and  Nb  is  the  number  of  bonds. 

The  partition  function  Z(K,  t)  is  invariant  under  the  change  of  variable 

^  OiSi ,  Tij  >  <Ji<3jTij)  (6) 

where  cr*  =  ±1.  Thus  t  itself  has  no  invariant  meaning — samples  t  and  t'  that  differ 
by  the  change  of  variable  have  equivalent  physics.  The  only  invariant  property  of  t 
that  cannot  be  modified  by  such  a  change  of  variable  is  the  distribution  of  frustration 
that  t  determines.  If  an  odd  number  of  the  bonds  contained  in  a  specified  plaquette 
have  t  =  —  1  then  that  plaquette  is  frustrated — an  Ising  vortex  resides  at  the 


C.  Wang  et  al.  /  Annals  of  Physics  303  (2003)  31-58 


35 


plaquette.  For  purposes  of  visualization,  we  sometimes  will  find  it  convenient  to 
define  the  spin  model  on  the  dual  lattice  so  that  the  spins  reside  on  plaquettes  and  the 
Ising  vortices  reside  on  sites.  Then  excited  bonds  with  =  —  1  form  one¬ 

dimensional  chains  that  terminate  at  the  frustrated  sites. 

Changes  of  variable  define  an  equivalence  relation  on  the  set  of  2Nb  t  configura¬ 
tions:  there  are  the  2^s  elements  of  each  equivalence  class  (the  number  of  changes  of 
variable,  where  Ns  is  the  number  of  sites)  and  there  are  2^s  classes  (the  number  of 
configurations  for  the  Ising  vortices — note  that  NB  =  2N$  for  a  square  lattice  on 
the  2-torus,  and  that  the  number  of  plaquettes  is  N P  =  TVs).  Denote  a  distribution 
of  Ising  vortices,  or  equivalently  an  equivalence  class  of  t’s,  by  r\.  The  probability 
P(Kp ,  rj)  of  rj  is  found  by  summing  P(Kp,  t)  over  all  the  representatives  of  the  class; 
hence 


(2  coshKp)NBP(Kp,  tj)  =  (2coshKp)ItB  ^P{Kp,  t) 

=  Y)  exP  (  KP  Y)  )  =  Z(KI-  '/)•  (V 

"  V  (ij)  ) 

Apart  from  a  normalization  factor,  the  probability  of  a  specified  distribution  of 
frustration  is  given  by  the  partition  function  of  the  model,  but  with  K  =  f$J  replaced 
by  Kp. 

In  this  model,  we  can  define  an  order  parameter  that  distinguishes  the  ferromag¬ 
netic  and  paramagnetic  phases.  Let 

m\K,Kp)=  lim  [<^)J  (8) 

r  7l^°°  p 

where  (-)K  denotes  the  average  over  thermal  fluctuations,  [-]K  denotes  the  average 
over  samples,  and  |  i  —  j\  denotes  the  distance  between  site  i  and  site  y;  then  in  the 
ferromagnetic  phase  m2  >  0  and  in  the  paramagnetic  phase  m2  =  0.  But  the  two- 
point  correlation  function  ( SiSj)K  is  not  invariant  under  the  change  of  variable  Eq. 
(6),  so  how  should  m2  be  interpreted? 

Following  [9],  denote  by  E  the  set  of  bonds  that  are  antiferromagnetic  (t^  =  —  1), 
denote  by  E'  the  set  of  excited  bonds  with  t ijSfij  =  —  1 ,  and  denote  by  D  the  set  of 
bonds  with  SjSj  =  —  1  (those  such  that  the  neighboring  spins  antialign) — see  Fig.  1. 
Then  D  —  E  EE'  is  the  disjoint  union  of  E  and  E’  (containing  bonds  in  E  or  Er 
but  not  both).  Furthermore,  D  contains  an  even  number  of  the  bonds  that  meet  at 
any  given  site;  that  is,  D  is  a  cycle ,  a  chain  of  bonds  that  has  no  boundary  points. 
The  quantity  SiSj  just  measures  whether  a  line  connecting  i  and  j  crosses  D  an  even 
number  (StSj  =  1)  or  an  odd  number  (SfSj  =  —1)  of  times. 

Now  D  consists  of  disjoint  “domain  walls”  that  form  closed  loops.  If  loops  that 
are  arbitrarily  large  appear  with  appreciable  weight  in  the  thermal  ensemble,  then  the 
two-point  function  ( SiSj)K  decays  like  exp(— |/  —  j\/£) — fluctuations  far  from  the 
sites  i  and  j  contribute  to  the  correlation  function.  Thus  the  spins  are  disordered 
and  m 2  =  0.  But  if  large  loops  occur  only  with  negligible  probability,  then  only  fluc¬ 
tuations  localized  near  i  and  j  contribute  significantly;  the  spin  correlation  persists  at 
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Fig.  1 .  The  chain  E  of  antiferromagnetic  bonds  (darkly  shaded)  and  the  chain  E'  of  excited  bonds  (lightly 
shaded),  in  the  two-dimensional  random-bond  Ising  model.  Ising  spins  taking  values  in  {±1}  reside  on 
plaquettes;  Ising  vortices  (boundary  points  of  E)  are  located  on  the  sites  marked  by  filled  circles.  The 
bonds  of  E'  comprise  a  one-dimensional  defect  that  connects  the  vortices.  The  cycle  D  =  E  +  E'  encloses 
a  domain  of  spins  with  the  value  -1. 


large  distances  and  m2  >  0.  Thus,  the  order  parameter  probes  whether  the  chain  E'  of 
excited  bonds  can  wander  far  from  the  chain  E  of  ferromagnetic  bonds;  that  is, 
whether  D  =  E  +  E'  contains  arbitrarily  large  connected  closed  loops,  for  typical 
thermal  fluctuations  and  typical  samples. 

Nishimori  [10]  observed  that  the  model  has  enhanced  symmetry  properties  along  a 
line  in  the  p-T  plane  (the  Nishimori  line )  defined  by  K  =  Kp  or  exp(— 2 f}J)  = 
p/{\  - p).  In  this  case,  the  antiferromagnetic  bond  chain  E  and  the  excited  bond 
chain  E'  are  generated  by  sampling  the  same  probability  distribution,  subject  to 
the  constraint  that  both  chains  have  the  same  boundary  points.  This  feature  is  pre¬ 
served  by  renormalization  group  transformations,  so  that  renormalization  group 
flow  preserves  Nishimori’s  line  [15].  The  Nishimori  point  (pc,  Tc)  where  the  Nishimori 
line  crosses  the  ferromagnetic-paramagnetic  phase  boundary,  is  a  renormalization 
group  fixed  point,  the  model’s  multicritical  point. 

When  the  temperature  T  is  above  the  Nishimori  line,  excited  bonds  have  a  higher 
concentration  than  antiferromagnetic  bonds,  so  we  may  say  that  thermal  fluctua¬ 
tions  play  a  more  important  role  than  quenched  randomness  in  disordering  the  spins. 
When  T  is  below  the  Nishimori  line,  antiferromagnetic  bonds  are  more  common 
than  excited  bonds,  and  the  quenched  randomness  dominates  over  thermal  fluctua¬ 
tions.  Right  on  the  Nishimori  line,  the  effects  of  thermal  fluctuations  and  quenched 
randomness  are  in  balance  [16]. 

By  invoking  the  change  of  variable  Eq.  (6),  various  properties  of  the  model  on  the 
Nishimori  line  can  be  derived  [3,10].  For  example,  the  internal  energy  density  (or 
“average  bond”)  can  be  computed  analytically, 

lTij(SiSj)  kp\kp  =  1  —  (9) 

where  i  and  j  are  neighboring  sites;  averaged  over  thermal  fluctuations  and  samples, 
the  concentration  of  excited  bonds  is  p  as  one  would  expect  (and  the  internal  energy 
has  no  singularity  at  the  Nishimori  point).  Furthermore,  after  averaging  over 
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disorder,  the  (2m  —  l)st  power  of  the  &-spin  correlator  has  the  same  value  as  the 
(2m)th  power,  for  any  positive  integer  m : 


(ShSi2 ' '  ’  ) 


2m—  1 


-I  Kp 


'  ^  )  K, 


2m 

Kp 


(10) 


It  follows  in  particular  that  the  spin-glass  order  parameter 


q2(Kp,Kp) 


lim 

|/-y|^oc 


KP 


(ll) 


coincides  with  the  ferromagnetic  order  parameter  m2(Kp,Kp )  along  the  Nishimori 
line,  reflecting  the  property  that  thermal  fluctuations  and  quenched  randomness  have 
equal  strength  on  this  line. 

Comparing  Eqs.  (2)  and  (7),  we  see  that  for  K  =  Kp  the  free  energy  of  the  model 
coincides  with  the  Shannon  entropy  of  the  distribution  of  vortices,  apart  from  a  non¬ 
singular  additive  term: 

[W(KP,  x)]Kp  =  -  n)  In P(Kp,  n)  -  Ns  ln(2cosh Kp).  (12) 

n 


Since  the  free  energy  is  singular  at  the  Nishimori  point  (pc ,  Tc),  it  follows  that  the 
Shannon  entropy  of  frustration  (which  does  not  depend  on  the  temperature)  is 
singular  at  p  =  pc  [13].  This  property  led  Nishimori  to  suggest  that  the  boundary 
between  the  ferromagnetic  and  paramagnetic  phases  occurs  at  p  =  pc  at  sufficiently 
low  temperature,  and  thus  that  the  phase  boundary  is  vertical  in  the  p-T  plane  below 
the  Nishimori  point,  as  in  Fig.  2a.  Later,  Kitatani  [14]  arrived  at  the  same  conclusion 
by  a  different  route,  showing  that  the  verticality  of  the  phase  boundary  follows  from 


Fig.  2.  The  phase  diagram  of  the  random-bond  Ising  model  (shown  schematically),  with  the  temperature 
T  on  the  vertical  axis  and  the  concentration  p  of  antiferromagnetic  bonds  on  the  horizontal  axis.  The  solid 
line  is  the  boundary  between  the  ferromagnetic  (ordered)  phase  and  the  paramagnetic  (disordered)  phase. 
The  dotted  line  is  the  Nishimori  line  q~2^j  —  p/(l  —  p),  which  crosses  the  phase  boundary  at  the  Nishimori 
point  (the  heavy  black  dot).  It  has  been  conjectured,  but  not  proven,  that  the  phase  boundary  from  the 
Nishimori  point  to  the  p- axis  is  vertical ,  as  in  (a).  The  numerics  reported  in  Section  4  favor  the  reentrant 
phase  diagram  shown  in  (b).  The  deviation  of  the  critical  bond  concentration  pc  on  the  Nishimori  line  from 
the  critical  bond  concentration  pc0  on  the  T  =  0  axis  has  been  exaggerated  in  (b)  for  clarity. 
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an  “appropriate  condition.”  These  arguments,  while  suggestive,  do  not  seem  com¬ 
pelling  to  us.  There  is  no  known  rigorous  justification  for  Kitatani’s  condition,  and 
no  rigorous  reason  why  the  ferro-para  boundary  must  coincide  with  the  singularity 
in  the  entropy  of  frustration,  even  at  low  temperature.  Hence  we  regard  the  issue  of 
the  verticality  of  the  phase  boundary  as  still  unsettled.  Nishimori  did  argue  con¬ 
vincingly  that  the  phase  boundary  cannot  extend  to  any  value  of  p  greater  than  pc 
[10],  and  Le  Doussal  and  Harris  argued  that  the  tangent  to  the  phase  boundary  is 
vertical  at  the  Nishimori  point  [15],  but  these  results  leave  open  the  possibility  of  a 
“reentrant”  boundary  that  slopes  back  toward  the  T  axis  below  the  Nishimori  point, 
as  in  Fig.  2b. 

The  RBIM  can  also  be  defined  in  d  dimensions.  Much  of  the  above  discussion  still 
applies,  with  minor  modifications.  Consider,  for  example,  d  =  3.  On  the  dual  lattice, 
spins  reside  on  lattice  cubes  and  the  bonds  become  plaquettes  shared  by  two  neigh¬ 
boring  cubes.  The  set  of  antiferromagnetic  bonds  E  is  dual  to  a  two-dimensional  sur¬ 
face,  and  its  boundary  dE  consists  of  one-dimensional  loops — the  Ising  strings  where 
the  spins  are  frustrated.  The  set  of  excited  bonds  E'  is  dual  to  another  two-dimen¬ 
sional  surface  that  is  also  bounded  by  the  Ising  strings:  dE'  =  dE.  The  spins  are  dis¬ 
ordered  if  the  two-cycle  D  =  E  +  E'  contains  arbitrarily  large  closed  connected 
surfaces  for  typical  thermal  fluctuations  and  typical  samples.  Similarly,  in  d  dimen¬ 
sions,  frustration  is  localized  on  closed  surfaces  of  dimension  d  —  2,  and  the  ther¬ 
mally  fluctuating  defects  are  dimension- (d  —  1)  surfaces  that  terminate  on  the 
locus  of  frustration.  For  any  d ,  the  model  has  enhanced  symmetry  along  the  Nishi¬ 
mori  line  K  =  Kp,  where  antiferromagnetic  bonds  and  excited  bonds  are  drawn  from 
the  same  probability  distribution. 

In  the  absence  of  quenched  disorder,  the  two-dimensional  Ising  model  is  mapped 
to  itself  by  a  duality  relation  that  can  be  used  to  infer  properties  of  the  critical  theory. 
When  quenched  disorder  is  introduced,  however,  the  two-dimensional  random  bond 
Ising  model  is  mapped  under  duality  to  a  model  with  Boltzmann  weights  that  are  not 
positive  definite  [17],  so  that  it  is  not  easy  to  draw  any  firm  conclusions. 

2.2.  Random-plaque tte  gauge  model 

In  the  ^-dimensional  RBIM,  excitations  have  codimension  1  and  terminate  on  a 
closed  surface  of  codimension  2.  The  Z2  random-plaquette  gauge  model  (RPGM)  is 
defined  in  an  entirely  analogous  manner,  except  that  the  excitations  are  codimen- 
sion-2  objects  (“magnetic  flux  tubes”)  that  terminate  on  codimension-3  objects 
(“magnetic  monopoles”). 

More  concretely,  the  variables  of  the  model  are  Ui  =  ±  1  residing  on  each  link  /  of 
the  lattice,  and  the  Hamiltonian  is 

H  =  -J^tpUp,  (13) 

P 

where  J  is  the  coupling  strength, 

up = n  u i 

leP 


(14) 
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is  the  Z2-v alued  “magnetic  flux”  through  the  plaquette  P,  and  tp  =  ±1  is  a  quenched 
random  variable.  The  tp’s  are  independently  and  identically  distributed,  with  the 
“wrong-sign”  choice  tp  =  —  1  (favoring  nontrivial  flux)  occurring  with  probability  p , 
and  the  “right-sign”  choice  tp  =  +1  (favoring  trivial  flux)  occurring  with  probability 
1  —  p.  We  refer  to  p  as  the  concentration  of  wrong-sign  plaquettes,  or  simply  the 
plaquette  concentration. 

The  free  energy  F  of  the  model  at  inverse  temperature  /?,  averaged  over  samples,  is 
[WWx)]Kp  =  -J2nKP,^WZ(K,x),  (15) 


where 


Z  (K ,  t  )  —  ^  ^  exp  y^Tpc/pj 
is  the  partition  function  for  sample  t  (with  K  =  fiJ),  and 
P(Kp,t)  =  (2  cosh Kp)~Nr  x  exp  ^Kp  ^  tp  j 

is  the  probability  of  the  sample  t;  here 

p  =  e~2Kp 


1  -p 


and  NP  is  the  number  of  plaquettes. 

The  partition  function  Z(K,t)  is  invariant  under  the  change  of  variable 

Ui  — »  a/C//,  Tp  aPrP, 


(16) 


(17) 


(18) 


(19) 


where  a/  =  ±1  and  gp  =  <jh  While  t  itself  has  no  invariant  meaning,  t  deter¬ 
mines  a  distribution  of  frustration  that  cannot  be  altered  by  a  change  of  variable.  If 
an  odd  number  of  the  plaquettes  contained  in  a  specified  cube  have  t  =  —  1  then  that 
cube  is  frustrated — a  Z2  magnetic  monopole  resides  in  the  cube.  For  purposes  of 
visualization,  we  will  sometimes  find  it  convenient  to  define  the  gauge  model  on  the 
dual  lattice  so  that  the  gauge  variables  Ui  reside  on  plaquettes,  the  magnetic  flux  on 
bonds,  and  the  magnetic  monopoles  on  sites.  Then  excited  bonds  with  tpC/p  =  —1 
form  one-dimensional  strings  that  terminate  at  monopoles. 

We  can  define  an  order  parameter  that  distinguishes  the  Higgs  (magnetically  or¬ 
dered)  phase  and  the  confinement  (magnetically  disordered)  phase.  Consider  the  Wil¬ 
son  loop  operator  associated  with  a  closed  loop  C  (on  the  original  lattice,  not  the 
dual  lattice): 

W(C)  =  l[ul  (20) 

lec 


and  consider  the  behavior  of  the  expectation  value  of  W(C),  averaged  over  thermal 
fluctuations  and  over  samples.  In  the  Higgs  phase,  for  a  large  loop  C  the  Wilson  loop 
operator  decays  exponentially  with  the  perimeter  of  the  loop, 

[{W(C))K]Kp  ~  exp  [  -  n  ■  Perimeter(C)] 


(21) 
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while  in  the  confinement  phase  it  decays  exponentially  with  the  area  of  the  minimal 
surface  bounded  by  C, 

l(W(C))K]Kp  ~  exp  [  -  K  •  Area(C)].  (22) 

The  interpretation  is  that  on  the  dual  lattice  the  wrong-sign  plaquettes  correspond  to 
a  one-chain  E  bounded  by  magnetic  monopoles,  and  the  excited  plaquettes  corre¬ 
spond  to  another  one-chain  E'  with  the  same  boundary;  hence  D  =  E  +  E'  is  a  cycle, 
a  sum  of  disjoint  closed  “flux  tubes.”  If  arbitrarily  large  loops  of  flux  appear  with 
appreciable  weight  in  the  thermal  ensemble  for  typical  samples,  then  magnetic 
fluctuations  spanning  the  entire  surface  bounded  by  C  contribute  to  the  expectation 
value  of  W(C),  and  the  area-law  decay  results.  If  large  flux  tubes  are  suppressed,  then 
only  the  fluctuations  localized  near  the  loop  are  important,  and  the  perimeter-law 
decay  applies.  Thus,  the  Wilson-loop  order  parameter  probes  whether  the  chain  E'  of 
excited  plaquettes  can  wander  far  from  the  chain  E  of  wrong-sign  plaquettes;  that  is, 
whether  D  =  E  +  E’  contains  arbitrarily  large  connected  closed  loops. 

The  one-chain  E  bounded  by  the  magnetic  monopoles  is  analogous  to  a  Z2-valued 
Dirac  string — the  change  of  variable  Eq.  (19)  deforms  the  strings  while  leaving  in¬ 
variant  the  boundary  of  E  (the  locations  of  the  monopoles).  One  should  notice  that 
these  strings  are  not  invisible  to  our  Wilson  loop  operator;  that  is  W(C)  is  not  invari¬ 
ant  under  the  change  of  variable.  It  is  possible  to  modify  W(C)  to  obtain  an  invariant 
object  [18],  but  that  would  not  be  appropriate  if  the  order  parameter  is  supposed  to 
probe  the  extent  to  which  the  thermally  fluctuating  defects  (the  excited  plaquettes) 
depart  from  the  quenched  disorder  (the  Dirac  strings). 

Like  the  RBIM,  the  RPGM  has  enhanced  symmetry  on  the  Nishimori  line 
K  =  Kp,  and  the  change  of  variable  Eq.  (19)  may  be  invoked  to  derive  properties 
of  the  model  on  this  line.  The  Nishimori  line  is  preserved  by  renormalization  group 
flow,  and  crosses  the  confinement-Higgs  boundary  at  a  multicritical  point  ( pc,Tc ). 
The  internal  energy  (or  average  plaquette)  can  be  computed  on  this  line, 

MUP)Kp]  =l-2p  (23) 

L  A  Kp 

(excited  plaquettes  have  concentration  p)  and  for  each  positive  integer  m ,  the 
(2 m  —  l)’st  power  of  W(C)  and  the  2m’th  power  are  equal  when  averaged  over 
samples, 

\nc))%~l]Kpm  [(nc))f\Kp-  (24) 

Furthermore,  the  free  energy  on  the  Nishimori  line,  apart  from  a  nonsingular  ad¬ 
ditive  term,  is  equal  to  the  Shannon  entropy  of  the  distribution  of  magnetic 
monopoles,  so  that  the  latter  is  singular  at  p  =  pc. 

In  principle,  the  RPGM  could  have  what  might  be  called  a  “gauge  glass”  phase. 
In  this  phase,  the  Wilson  loop,  averaged  over  thermal  and  quenched  fluctuations,  has 
area-law  behavior, 

[( W{C))K\Kp  ~  exp  [  —  k  •  Area(C)], 


(25) 
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but  the  square  of  its  thermal  expectation  value,  averaged  over  quenched  fluctuations, 
has  perimeter-law  behavior: 


mc))2K 


KP 


exp  [  —  /i  •  Perimeter(C)]. 


(26) 


This  means  that  thermal  fluctuations  do  not  induce  magnetic  disorder  for  each 
typical  sample,  but  that  the  magnetic  fluctuations  are  large  when  we  compare  one 
sample  to  another.  However,  the  identity  Eq.  (24)  shows  that,  along  the  Nishimori 
line  K  =  Kp,  there  can  be  no  gauge  glass  phase.  Since  (W(C))  and  ( W(C ))2  have  the 
same  average  over  samples,  both  order  parameters  cross  from  perimeter  to  area  law 
at  the  same  point  on  the  Nishimori  line.  (Nishimori  [10]  used  the  analogous  argu¬ 
ment  to  show  that  there  is  no  spin  glass  behavior  in  the  RBIM  along  the  Nishimori 
line.) 

Another  useful  identity  that  can  be  derived  using  the  change  of  variable  is 

[( W(C))K]Kp  =  [( W{C))K{W{C))Kp]Kp.  (27) 


Since  —  1  ^  W(C)  ^  1,  it  follows  that 


\[{W{C))K 


\(  nQh 


(28) 


From  this  inequality,  we  may  infer  that  if  the  point  on  the  Nishimori  line  with 
concentration  p  is  in  the  confinement  phase,  then  the  point  (/?,  T)  is  in  the  confine¬ 
ment  phase  for  any  temperature  T.  (Again,  the  reasoning  is  exactly  analogous  to 
Nishimori’s  argument  for  the  RBIM  [10].)  Since  there  is  no  gauge-glass  behavior  on 
the  Nishimori  line,  if  a  point  on  the  Nishimori  line  is  in  the  confinement  phase,  then 
( W(C))K  already  exhibits  area-law  decay  before  averaging  over  samples.  Therefore 
the  right-hand  side  of  Eq.  (28)  shows  area-law  decay  and  so  must  the  left-hand  side. 
We  conclude  that,  as  for  the  RBIM,  the  phase  boundary  of  the  RPGM  below  the 
Nishimori  line  must  either  be  vertical  (parallel  to  the  T  axis  as  in  Fig.  2a)  or  reentrant 
(tipping  back  toward  the  T  axis  as  T  decreases  as  in  Fig.  2b). 


2.3.  Further  generalizations 

In  d  dimensions,  the  magnetic  order  parameter  of  the  RBIM  explores  whether  a 
thermally  excited  chain  E'  of  codimension  1  (domain  walls)  deviates  far  from  a 
quenched  codimension- 1  chain  E  (antiferromagnetic  bonds),  where  both  E  and  E ' 
have  the  same  codimension-2  boundary  (the  Ising  vortices).  Similarly,  the  RPGM 
can  be  defined  in  d  dimensions,  and  its  Wilson-loop  order  parameter  probes  whether 
a  thermally  excited  chain  E'  of  codimension  2  (flux  tubes)  deviates  far  from  a 
quenched  codimension-2  chain  E  (Dirac  strings),  where  both  E  and  E'  have  the  same 
codimension-3  boundary  (the  magnetic  monopoles). 

This  concept  admits  further  generalizations.  In  ^-dimensions,  we  may  consider 
the  lattice  theory  of  a  “rank -r  antisymmetric  tensor  field”  with  quenched  disorder. 
Then  variables  reside  on  the  r-cells  of  the  lattice,  and  the  Hamiltonian  is  expressed 
in  terms  of  a  field  strength  defined  on  (r  +  l)-cells.  The  sign  of  the  coupling  is  deter- 
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mined  by  a  random  variable  t  taking  values  ±1  on  (r  +  l)-cells;  cells  with  the 
“wrong  sign”  have  concentration  p.  On  the  dual  lattice,  t  corresponds  to  a  codimen¬ 
sion-  (r  +  1)  chain  E ,  and  the  excited  cells  to  a  codimension- (r  +  1)  chain  E' ,  where  E 
and  E'  are  bounded  by  the  same  codimension- (r  +  2)  chain  of  frustration.  An  oper¬ 
ator  analogous  to  the  Wilson  loop  can  be  defined  that  detects  the  flux  through  the 
dimension-(r  +  1)  “surface”  bounded  by  a  dimension-r  “loop”  C;  this  operator 
serves  as  the  order  parameter  for  an  order-disorder  transition.  The  order  parameter 
probes  whether  the  thermally  fluctuating  codimension- (r  +  1)  chain  E'  deviates  far 
from  the  quenched  codimension- (r  +  1)  chain  E. 

For  any  d  and  r,  the  model  has  enhanced  symmetry  on  the  Nishimori  line,  where 
K  =  Kp.  Properties  of  the  model  on  this  line  can  be  derived,  analogous  to  those  dis¬ 
cussed  above  for  the  RBIM  and  the  RPGM. 


3.  Accuracy  threshold  for  quantum  memory 

How  the  RBIM  and  RPGM  relate  to  the  performance  of  topological  quantum 
memory  was  extensively  discussed  in  [9].  Here  we  will  just  briefly  reprise  the  main 
ideas. 


3.1.  Toric  codes 


Quantum  information  can  be  protected  from  decoherence  and  other  possible 
sources  of  error  using  quantum  error-correcting  codes  [6,7]  and  fault-tolerant  error 
recovery  protocols  [8].  Topological  codes  (or  surface  codes)  are  designed  so  that  the 
quantum  processing  needed  to  control  errors  has  especially  nice  locality  properties 
[4,5]. 

Specifically,  consider  a  system  of  2 L2  qubits  (a  qubit  is  a  two-level  quantum  system), 
with  each  qubit  residing  at  a  link  of  an  L  x  L  square  lattice  drawn  on  a  two-dimen¬ 
sional  torus.  (Other  examples  of  surface  codes,  including  codes  defined  on  planar  sur¬ 
faces,  are  discussed  in  [9].)  This  system  can  encode  two  qubits  of  quantum  information 
that  are  well  protected  from  noise  if  the  error  rate  is  low  enough.  The  two-qubit  code 
space,  where  the  protected  information  resides,  can  be  characterized  as  a  simultaneous 
eigenspace  with  eigenvalue  one  of  a  set  of  check  operators  (or  “stabilizer  generators”); 
check  operators  are  associated  with  each  site  and  with  each  elementary  cell  (or  “pla- 
quette”)  of  the  lattice,  as  shown  in  Fig.  3.  We  use  the  notation 

/=(j  ?)■  **(?  j)'  ,29> 


Y  = 


(30) 


for  the  2  x  2  identity  and  Pauli  matrices.  The  check  operator  at  site  i  acts  nontrivially 
on  the  four  links  that  meet  at  the  site;  it  is  the  tensor  product 

Xt  =  ®i3sXi 


(31) 
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Fig.  3.  The  check  operators  of  the  toric  code.  Each  plaquette  operator  is  a  tensor  product  of  Z’s  acting  on 
the  four  links  contained  in  the  plaquette.  Each  site  operator  is  a  tensor  product  of  X’s  acting  on  the  four 
links  that  meet  at  the  site. 


acting  on  those  four  qubits,  times  the  identity  acting  on  the  remaining  qubits.  The 
check  operator  at  plaquette  P  acts  nontrivially  on  the  four  links  contained  in  the 
plaquette,  as  the  tensor  product 

Zp  =  0/GpZ/,  (32) 


times  the  identity  on  the  remaining  links. 

The  check  operators  can  be  simultaneously  diagonalized,  and  the  toric  code  is  the 
space  in  which  each  check  operator  acts  trivially.  Because  of  the  periodic  boundary 
conditions  on  the  torus,  the  product  of  all  L 2  site  operators  or  all  L2  plaquette  oper¬ 
ators  is  the  identity — each  link  operator  occurs  twice  in  the  product,  and 
X 2  =  Z2  =  I.  There  are  no  further  relations  among  these  operators;  therefore,  there 
are  2  •  (T,2  -  1)  independent  check  operators  constraining  the  2 L2  qubits  in  the  code 
block,  and  hence  two  encoded  qubits  (the  code  subspace  is  four  dimensional). 

Since  the  check  operators  are  spatially  local,  it  is  useful  to  think  of  a  site  or  pla¬ 
quette  where  the  check  operator  has  the  eigenvalue  —  1  as  the  position  of  a  localized 
excitation  or  “defect.”  The  code  space  contains  states  with  no  defects,  which  are 
analogous  to  vacuum  states  of  a  Z2  gauge  theory  on  the  torus:  ZP  =  1  means  that 
there  is  no  Z2  magnetic  flux  at  plaquette  P,  and  Xt  =  1  means  that  there  is  no  Z2  elec¬ 
tric  charge  at  site  i.  (This  Z2  gauge  theory  on  the  two-torus  should  not  be  confused 
with  the  three-dimensional  Z2  gauge  theory,  described  in  Section  3.3,  that  arises  in 
the  analysis  of  the  efficacy  of  error  correction!) 

Consider  applying  to  the  vacuum  state  an  operator  that  is  a  tensor  product  of  Pa¬ 
uli  matrices  {Z/}  acting  on  each  of  a  set  of  links  forming  a  connected  chain  {/}.  This 
operator  creates  isolated  site  defects  at  the  ends  of  the  chain.  Similarly,  if  we  apply  to 
the  vacuum  a  tensor  product  of  Pauli  matrices  {Xi}  acting  on  a  connected  chain  of 
the  dual  lattice,  isolated  plaquette  defects  are  created  at  the  ends  of  the  chain,  as  in 
Fig.  4.  A  general  “Pauli  operator”  (tensor  product  of  Pauli  matrices)  can  be  ex¬ 
pressed  as  tensor  product  of  Xfs  and  7/’s  times  a  tensor  products  of  Z/’s  and  7/s;  this 
operator  preserves  the  code  space  if  and  only  if  the  links  acted  upon  by  Z’s  comprise 
a  cycle  of  the  lattice  (a  chain  with  no  boundary)  and  the  links  acted  upon  by  X’s  com¬ 
prise  a  cycle  of  the  dual  lattice. 
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Fig.  4.  Site  defects  and  plaquette  defects  in  the  toric  code.  Applied  to  the  code  space,  72 s  acting  on  a  con¬ 
nected  chain  of  links  (darkly  shaded)  create  site  defects  (electric  charges)  at  the  ends  of  the  chain.  Similarly, 
X’s  applied  to  a  connected  chain  of  dual  links  (lightly  shaded)  create  plaquette  defects  (magnetic  fluxes)  at 
the  ends  of  the  chain. 


Cycles  on  the  torus  are  of  two  types.  A  homologically  trivial  cycle  is  the  boundary 
of  a  region  that  can  be  tiled  by  plaque ttes.  A  product  of  Z’s  acting  on  the  links  of  the 
cycle  can  be  expressed  as  a  product  of  the  enclosed  plaquette  operators,  which  acts 
trivially  on  the  code  space.  A  homologically  nontrivial  cycle  wraps  around  the  torus 
and  is  not  the  boundary  of  anything.  A  product  of  Z’s  acting  on  the  links  of  the  cycle 
preserves  the  code  space,  but  acts  nontrivially  on  the  encoded  quantum  information. 
Associated  with  the  two  fundamental  nontrivial  cycles  of  the  torus  are  encoded  op¬ 
erations  Z\  and  Z2  acting  on  the  two  encoded  qubits.  Similarly,  associated  with  the 
two  dual  cycles  of  the  dual  lattice  are  the  corresponding  encoded  operations  X\ 
and  X2,  as  shown  in  Fig.  5. 

A  general  error  acting  on  the  code  block  can  be  expanded  in  terms  of  Pauli  oper¬ 
ators.  Therefore,  we  can  characterize  the  efficacy  of  error  correction  by  considering 
how  well  we  can  protect  the  encoded  state  against  Pauli  operator  errors.  With  the 
toric  code,  X  errors  (bit  flips)  and  Z  errors  (phase  flips)  can  be  corrected  indepen- 


(a)  (b) 


Fig.  5.  Basis  for  the  operators  that  act  on  the  two  encoded  qubits  of  the  toric  code,  (a)  The  encoded  Z\  is  a 
tensor  product  of  Z’s  acting  on  lattice  links  comprising  a  cycle  of  the  torus,  and  the  encoded  X\  is  a  tensor 
product  of  X’s  acting  on  dual  links  comprising  the  complementary  cycle,  (b)  Z2  and  X2  are  defined  simi¬ 
larly. 
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dently;  this  suffices  to  protect  against  general  Pauli  errors,  since  a  Y  error  is  just  a  bit 
flip  and  a  phase  flip  acting  on  the  same  qubit.  We  may  therefore  confine  our  atten¬ 
tion  to  Z  errors;  the  X  errors  may  be  dealt  with  in  essentially  the  same  way,  but  with 
the  lattice  replaced  by  its  dual. 

3.2.  Perfect  measurements  and  the  random-bond  I  sing  model 

To  be  concrete,  suppose  that  the  Z  errors  are  independently  and  identically  dis¬ 
tributed,  occurring  with  probability  p  on  each  qubit.  Noise  produces  an  error 
chain  E ,  a  set  of  qubits  acted  upon  by  Z.  To  diagnose  the  errors,  the  code’s  local 
check  operators  are  measured  at  each  lattice  site,  the  measurement  outcomes  pro¬ 
viding  a  “syndrome”  that  we  may  use  to  diagnose  errors.  However,  the  syndrome 
is  highly  ambiguous.  It  does  not  completely  characterize  where  the  errors  oc¬ 
curred;  rather  it  only  indicates  whether  the  number  of  damaged  qubits  adjacent 
to  each  site  is  even  or  odd.  That  is,  the  measurement  determines  the  boundary 
dE  of  the  error  chain  E. 

To  recover  from  the  damage,  we  choose  a  recovery  chain  E'  that  has  the  same 
boundary  as  the  measured  boundary  of  E ,  and  apply  Z  to  all  the  qubits  of  E'.  Since 
dE  =  02s',  the  chain  D  =  E  +  E'  is  a  cycle  with  dD  =  0.  Now,  if  D  is  homologically 
trivial,  then  the  recovery  successfully  protects  the  two  encoded  qubits — the  effect 
of  the  errors  together  with  the  recovery  step  is  just  to  apply  a  product  of  check  op¬ 
erators,  which  has  trivial  action  on  the  code  space.  But  if  D  is  homologically  nontriv¬ 
ial,  then  recovery  fails — the  encoded  quantum  information  suffers  an  error. 

Error  recovery  succeeds,  then,  if  we  can  guess  the  homology  class  of  the  randomly 
generated  chain  E ,  knowing  only  its  boundary  dE — we  succeed  if  our  guess 
E'  =  E  +  D  differs  from  E  by  a  homologically  trivial  cycle  D.  If  the  error  rate  p  is  be¬ 
low  a  certain  critical  value  pc  called  the  accuracy  threshold ,  it  is  possible  to  guess  cor¬ 
rectly,  with  a  probability  of  failure  that  approaches  zero  for  a  sufficiently  large  linear 
size  L  of  the  lattice.  But  if  p  is  above  pc ,  the  failure  probability  approaches  a  nonzero 
constant  as  L  — >  oo.  The  numerical  value  of  pc  is  of  considerable  interest,  since  it 
characterizes  how  reliably  quantum  hardware  must  perform  for  a  quantum  memory 
to  be  robust. 

Let  prob(E)  denote  the  probability  that  the  error  chain  is  £,  and  let 
prob[(£  +  D)\E]  denote  the  normalized  conditional  probability  for  error  chains 
E'  =  E  +  D  that  have  the  same  boundary  as  E.  Then,  the  probability  of  error  per  qu¬ 
bit  lies  below  threshold  if  and  only  if,  in  the  limit  L  — >  oo, 

y]prob(£)-  prob[(£’  +  D)|£’]  =  0.  (33) 

E  D  nontrivial 

Eq.  (33)  says  that  error  chains  that  differ  from  the  actual  error  chain  by  a  homo¬ 
logically  nontrivial  cycle  have  probability  zero.  Therefore,  the  outcome  of  the 
measurement  of  the  check  operators  is  sure  to  point  to  the  correct  homology  class,  in 
the  limit  of  an  arbitrarily  large  code  block. 

This  criterion  is  identical  to  the  criterion  for  long-range  order  in  the  two-dimen¬ 
sional  RBIM,  along  the  Nishimori  line.  The  error  chain  E  can  be  identified  with  the 
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chain  of  antiferromagnetic  bonds  of  a  sample,  bounded  by  Ising  vortices  that  are 
pinned  down  by  the  measurement  of  the  local  check  operators.  The  ensemble  of 
all  the  chains  {E'}  with  a  specified  boundary  can  be  interpreted  as  a  thermal  ensem¬ 
ble.  If  the  temperature  T  and  the  error  rate  p  obey  Nishimori’s  relation,  then  the 
chain  E'  and  the  chain  E  have  the  same  bond  concentration.  At  low  temperature 
along  the  Nishimori  line,  the  cycle  D  =  E  +  Er  contains  no  large  connected  loops 
for  typical  samples  and  typical  thermal  fluctuations — the  spin  system  is  magnetically 
ordered  and  error  recovery  succeeds  with  high  probability.  But  at  higher  tempera¬ 
ture,  the  quenched  chain  E  and  the  thermal  chain  E'  fluctuate  more  vigorously.  At 
the  Nishimori  point,  D  contains  loops  that  “condense,”  disordering  the  spins  and 
compromising  the  effectiveness  of  error  correction.  Thus,  the  critical  concentration 
pc  at  the  Nishimori  point  of  the  two-dimensional  RBIM  coincides  with  the  accuracy 
threshold  for  quantum  memory  using  toric  codes  (where  pc  is  the  largest  acceptable 
probability  for  either  an  X  error  or  a  Z  error). 

The  optimal  recovery  procedure  is  to  choose  a  recovery  chain  E'  that  belongs  to 
the  most  likely  homology  class,  given  the  known  boundary  of  the  chain  dE'  =  dE. 
For  p  <  pc,  the  probability  distribution  has  support  on  a  single  class  in  the  limit 
L  — >  oo,  and  the  optimal  recovery  procedure  is  sure  to  succeed.  In  the  language  of 
the  RBIM,  for  a  given  sample  with  antiferromagnetic  chain  E,  a  chain  E'  of  excited 
bonds  can  be  classified  according  to  the  homology  class  to  which  the  cycle 
D  =  E  +  E'  belongs,  and  a  free  energy  can  be  defined  for  each  homology  class. 
For  p  <  pc  along  the  Nishimori  line,  the  trivial  homology  class  has  lowest  free  en¬ 
ergy,  and  the  free  energy  cost  of  choosing  a  different  class  diverges  as  L  — >  oo. 

An  alternative  recovery  procedure  is  to  choose  the  single  most  likely  recovery 
chain  E',  rather  than  a  chain  that  belongs  to  the  most  likely  class.  In  the  language 
of  the  RBIM,  this  most  likely  recovery  chain  E'  for  a  given  sample  is  the  set  of 
excited  links  that  minimizes  energy  rather  than  free  energy.  This  energy  minimiza¬ 
tion  procedure  is  sure  to  succeed  if  the  error  rate  is  p  <  pco,  where  pco  is  the  critical 
bond  concentration  of  the  RBIM  at  T  =  0.  Since  minimizing  energy  rather  than 
free  energy  need  not  be  optimal,  we  see  that  />c0  ^ pc .  However,  the  energy  minimi¬ 
zation  procedure  has  advantages:  it  can  be  carried  out  efficiently  using  the  Ed¬ 
monds  perfect  matching  algorithm  [19,20],  and  without  any  prior  knowledge  of 
the  value  of  p. 

3.3.  Imperfect  measurement  and  the  random-plaquette  gauge  model 

But  the  RBIM  applies  only  to  an  unrealistic  situation  in  which  the  outcomes  of 
measurements  of  check  operators  are  known  with  perfect  accuracy.  Since  these  are 
four-qubit  measurements,  they  must  be  carried  out  with  a  quantum  computer  and 
are  themselves  degraded  by  noise.  To  obtain  reliable  information  about  the  positions 
of  the  Ising  vortices,  we  must  repeat  the  measurements  many  times,  assembling  a 
measurement  history  from  which  we  can  infer  the  world  lines  of  the  vortices  in 
three-dimensional  spacetime. 

To  visualize  the  world  lines  in  three  dimensions,  consider  a  three-dimensional  sim¬ 
ple  cubic  lattice  on  T 2  x  R,  where  T 2  is  the  two-torus  and  R  is  the  real  line.  The  error 
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operation  acts  at  each  integer-valued  time  t ,  and  check  operators  are  measured  be¬ 
tween  each  t  and  t  +  1 .  Qubits  in  the  code  block  are  associated  with  timelike  pla- 
quettes,  those  lying  in  the  tx  and  ty  planes.  A  qubit  error  that  occurs  at  time  t  is 
associated  with  a  horizontal  (spacelike)  bond  that  lies  in  the  time  slice  labeled  by 
t.  An  error  in  the  measurement  of  a  check  operator  at  site  j  between  time  t  and  time 
t  +  1  is  associated  with  the  vertical  (timelike)  bond  connecting  site  j  at  time  t  and  site 
j  at  time  t+\.  Qubit  errors  on  horizontal  bonds  occur  with  probability  p ,  and  mea¬ 
surement  errors  on  vertical  links  occur  with  probability  q.  The  set  of  all  errors,  both 
horizontal  and  vertical,  defines  a  one-chain  E ,  shown  darkly  shaded  in  Fig.  6.  The  set 
of  all  syndrome  measurements  with  nontrivial  outcomes  (those  where  the  observed 
value  of  the  check  operator  is  —1  rather  than  +1)  defines  a  (vertical)  one-chain  S, 
shown  lightly  shaded  in  Fig.  6.  The  chains  E  and  S  share  the  same  boundary;  there¬ 
fore  the  (possibly  faulty)  measurements  of  the  check  operators  reveal  the  boundary 
of  the  error  chain  E. 

Error  recovery  succeeds  if  we  can  guess  the  homology  class  of  the  error  chain  F, 
given  knowledge  of  its  boundary  dE;  that  is,  we  succeed  if  our  guess  E'  =  E  +  D  dif¬ 
fers  from  E  by  a  cycle  D  that  is  homologically  trivial  on  T 2  x  R.  Thus,  the  accuracy 
threshold  can  be  mapped  to  the  confinement-Higgs  transition  of  the  RPGM.  The  er¬ 
ror  one-chain  E  on  the  dual  lattice  becomes  the  set  of  wrong-sign  plaquettes  on  the 
lattice;  its  boundary  points  are  magnetic  monopoles,  whose  locations  are  determined 
by  the  measurements  of  local  check  operators.  Since  q  need  not  equal  p,  the  gauge 
model  can  be  anisotropic — on  the  original  lattice,  the  concentration  of  spacelike 


space 

Fig.  6.  An  error  history  shown  together  with  the  syndrome  history  that  it  generates,  for  the  toric  code.  For 
clarity,  the  three-dimensional  history  of  the  two-dimensional  code  block  has  been  compressed  to  two  di¬ 
mensions.  Qubits  reside  on  plaquettes,  and  four-qubit  check  operators  are  measured  at  each  vertical  link. 
Links  where  errors  have  occurred  are  darkly  shaded,  and  links  where  the  syndrome  is  nontrivial  are  lightly 
shaded.  Errors  on  horizontal  links  indicate  where  a  qubit  flipped  between  successive  syndrome  measure¬ 
ments,  and  errors  on  vertical  links  indicate  where  the  syndrome  measurement  was  wrong.  Vertical  links 
that  are  shaded  both  lightly  and  darkly  are  locations  where  a  nontrivial  syndrome  was  found  erroneously. 
The  chain  S  of  lightly  shaded  links  (the  syndrome)  and  the  chain  E  of  darkly  shaded  links  (the  errors)  both 
have  the  same  boundary. 
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wrong-sign  plaquettes  is  q  (spacelike  plaquettes  are  dual  to  timelike  bonds)  and  the 
concentration  of  timelike  wrong-sign  plaquettes  is  p  (timelike  plaquettes  are  dual  to 
spacelike  bonds).  The  ensemble  of  error  chains  {E'}  that  have  the  same  boundary  as 
E  becomes  the  thermal  ensemble  determined  by  an  anisotropic  Hamiltonian,  with 
the  coupling  Ks pace  on  spacelike  plaquettes  obeying  the  Nishimori  relation 
^space  =  Kq  and  the  coupling  Ktime  on  timelike  plaquettes  the  relation  Ktime  =  Kp. 

For  small  p  and  q ,  the  cycle  D  =  E  +  E'  contains  no  large  connected  loops  for  typ¬ 
ical  samples  and  typical  thermal  fluctuations — the  gauge  system  is  magnetically  or¬ 
dered  and  error  recovery  succeeds  with  high  probability.  But  there  is  a  critical  curve 
in  the  (p,q)  plane  where  the  magnetic  flux  tubes  “condense,”  magnetically  disorder¬ 
ing  the  system  and  compromising  the  effectiveness  of  error  correction.  For  the  sort  of 
error  model  described  in  [9],  the  qubit  error  rate  and  the  measurement  error  rate  are 
comparable,  so  the  isotropic  model  with  p  =  q  provides  useful  guidance.  For  that 
case,  the  critical  concentration  pc  at  the  Nishimori  point  of  the  three-dimensional 
RPGM  coincides  with  the  accuracy  threshold  for  quantum  memory  using  toric  codes 
(where pc  is  the  largest  acceptable  probability  for  an!  error,  a  Z  error,  or  a  measure¬ 
ment  error).  In  the  extreme  anisotropic  limit  q  — >  0,  flux  on  spacelike  plaquettes  is 
highly  suppressed,  and  the  timelike  plaquettes  on  each  time  slice  decouple,  with  each 
slice  described  by  the  RBIM. 

For  both  the  2D  RBIM  and  the  3D  (isotropic)  RPGM,  we  may  infer  (as  Nishi¬ 
mori  argued  for  the  RBIM  [10])  that  the  phase  boundary  lies  in  the  region  p^pc, 
i.e.,  does  not  extend  to  the  right  of  the  Nishimori  point.  From  the  perspective  of 
the  error  recovery  procedure,  this  property  reflects  that  the  best  hypothesis  about 
the  error  chain,  when  its  boundary  is  known,  is  obtained  by  sampling  the  distribu¬ 
tion  prob[(F  +  D)\E].  Thus,  for  each  value  of  p,  the  fluctuations  of  D  are  best  con¬ 
trolled  (the  spins  or  gauge  variables  are  least  disordered)  by  choosing  the 
temperature  on  the  Nishimori  line.  For  p  >  pc  the  magnetization  of  the  2D  RBIM 
vanishes  on  the  Nishimori  line,  and  so  must  vanish  for  all  T.  A  similar  remark  ap¬ 
plies  to  the  Wilson-loop  order  parameter  of  the  3D  RPGM. 

In  particular,  the  critical  value  of  p  on  the  T  =  0  axis  (denoted  pc0)  provides  a  low¬ 
er  bound  on  pc.  Rigorous  arguments  in  [9]  established  that  pc0  >  .0373  in  the  2D 
RBIM  and  pc0  >  -01 14  in  the  3D  RPGM.  (A  similar  lower  bound  for  the  2D  RBIM 
was  derived  by  Horiguchi  and  Morita  many  years  ago  [21].)  We  have  estimated  the 
value  of  pco  using  numerical  simulations  that  we  will  now  describe. 


4.  Numerics 

4.1.  Method 

For  the  RBIM  in  two  dimensions  (but  not  in  higher  dimensions),  and  for  the 
RPGM  in  three  dimensions  (but  not  in  higher  dimensions),  it  is  numerically  tractable 
to  study  the  phase  transition  on  the  T  =  0  axis.  Specifically,  for  the  RBIM,  we 
proceed  as  follows:  Consider  anlxl  lattice  on  the  torus,  and  generate  a  sample 
by  choosing  a  random  t at  each  bond  (where  t =  —  1  occurs  with  probability  p). 
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Consider,  for  this  sample,  the  one-chain  E  on  the  dual  lattice  containing  bonds  with 
t ij  =  —  1,  and  compute  its  boundary  dE  to  locate  the  Ising  vortices. 

Then,  to  find  the  ground  state  of  the  Hamiltonian  for  this  sample,  construct  the 
one-chain  E'  of  the  dual  lattice,  bounded  by  the  Ising  vortices,  with  the  minimal 
number  of  bonds.  This  minimization  can  be  carried  out  in  a  time  polynomial  in  L 
using  the  Edmonds  perfect  matching  algorithm  [19,20].  (If  the  ground  state  is  not  un¬ 
ique,  choose  a  ground  state  at  random.)  Now  examine  the  one-cycle  D  =  E  +  E'  on 
the  torus  and  compute  whether  its  homology  class  is  trivial.  If  so,  we  declare  the 
sample  a  “success”;  otherwise  the  sample  is  a  “failure.”  Repeat  for  many  randomly 
generated  samples,  to  estimate  the  probability  of  failure  P^\\  (p) . 

We  expect  Pm{p)  to  be  discontinuous  at  p  =  pc o  in  the  infinite  volume  limit.  For 
p  <  /?c0,  large  loops  in  D  are  heavily  suppressed,  so  that  Pfaii  falls  exponentially  to 
zero  for  L  sufficiently  large  compared  to  the  correlation  length  £.  But  for  p  >  pc0,  ar¬ 
bitrarily  large  loops  are  not  suppressed,  so  we  anticipate  that  the  homology  class  is 
random.  Since  there  are  four  possible  classes,  we  expect  P^w  to  approach  3/4  as 
L  — >  oo. 

This  expectation  suggests  a  finite-size  scaling  ansatz  for  the  failure  probability. 
Let  the  critical  exponent  v0  characterize  the  divergence  of  the  correlation  length  £ 
at  the  critical  point  p  =  pc0: 

Z~\p-Pc0r°.  (34) 

For  a  sufficiently  large  linear  size  L  of  the  sample,  the  failure  probability  should  be 
controlled  by  the  ratio  Z/£;  that  is,  it  is  a  function  of  the  scaling  variable 

X  =  (p  -  Pco)Ll/v°.  (35) 

Thus  the  appropriate  ansatz  is 

flail  ~^/(*),  (36) 

where  the  function  /  has  the  properties 

lim  f(x)  =  0,  lim  f(x)  =  1.  (37) 

oo  x — ^oo 

Though  the  scaling  ansatz  should  apply  asymptotically  in  the  limit  of  large  L ,  there 
are  systematic  corrections  for  finite  L  that  are  not  easily  estimated. 

According  to  Eq.  (36),  the  failure  probability  at  p  =  pc0  has  a  universal  value 
(3/4)/(0)  that  does  not  depend  on  L.  Thus,  by  plotting  Pfaii  vs.  p  for  various  values 
of  L ,  we  can  estimate  pc0  by  identifying  the  value  of  p  where  all  the  curves  cross.  To 
find  Vo,  we  observe  that 

log  f  )  =  —  logZ  +  constant.  (38) 

V  Qp  p=pJ  v° 

Hence,  if  we  estimate  the  slope  of  Pfaii  at  p  =  pc0,  we  can  extract  v0  from  a  linear  fit  to 
a  plot  of  log(slope)  vs.  logZ. 

The  three-dimensional  RPGM  can  be  analyzed  by  the  same  method.  A  sample  is 
generated  by  randomly  choosing  tp  on  each  plaquette  of  an  L 3  cubic  lattice  on  the 
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3-torus.  The  wrong-sign  plaquettes  define  a  one-chain  E  on  the  dual  lattice,  whose 
boundary  defines  the  locations  of  the  magnetic  monopoles.  The  ground  state  of 
the  sample  is  constructed  by  finding  the  one-chain  E'  with  the  same  boundary  that 
has  the  minimal  length,  and  the  one-cycle  D  =  E  ±  E'  is  examined  to  determine  if 
it  is  homologically  trivial.  Since  there  are  eight  homology  classes  on  the  3-torus, 
the  scaling  ansatz  becomes 

iW-T/to,  (39) 

and  pc{)  and  v0  are  estimated  as  described  above. 

For  the  RBIM  in  three  dimensions,  or  the  RPGM  in  four  dimensions,  E  and  E' 
become  two-chains.  To  construct  the  ground  state,  then,  we  must  find  the  minimal 
two-dimensional  surface  that  has  a  specified  boundary.  Unfortunately,  this 
problem  is  known  to  be  NP-hard  [22]  and  so  appears  to  be  computationally 
intractable. 

Detailed  numerical  studies  of  the  two-dimensional  RBIM  in  the  vicinity  of  the 
Nishimori  point  have  been  done  earlier  by  other  authors  [11,12],  using  methods 
that  are  not  very  effective  at  low  temperature.  The  T  =  0  phase  transition  has  been 
studied  using  methods  related  to  ours  [20,23],  but  with  less  numerical  accuracy.  As 
far  as  we  know,  numerical  studies  of  the  RPGM  have  not  been  previously 
attempted. 

4.2.  Random-bond  I  sing  model 

We  measured  /Tail  by  generating  106  samples  for  each  value  of  L  from  2  to  36,  and 
for  each  value  of  p  increasing  in  increments  of  .001  from  .100  to  .107;  in  addition  we 
generated  106  samples  at  L  =  37, 38,40,42  for  p  =  .102,  .103,  .104.  Values  of  Pfaii  for 
even  L  he  slightly  but  systematically  above  the  values  for  odd  L  at  the  same  p\  there¬ 
fore  we  analyzed  the  data  for  even  and  odd  L  separately.  Data  for 
L  =  16,20,24,28,32,36  are  shown  in  Fig.  7,  and  data  for  L  =  15,19,23,27,31,35 
are  shown  in  Fig.  8.  Crudely,  the  point  of  concordance  of  the  data  sets  provides 
an  estimate  of  pc0,  while  the  trend  of  the  data  with  L  determines  the  exponent  v0. 

We  did  a  global  fit  of  the  data  to  the  form 

/fail  =  ±  ±  Bx  ±  Cx2,  (40) 

where  x  =  (p  —  pc0)LllVo,  adopting  a  quadratic  approximation  to  the  scaling  function 
/(x)  in  the  vicinity  of  x  =  0.  (In  the  range  of  x  we  considered,  the  quadratic  term  is 
small  but  not  quite  negligible.)  For  even  L  ranging  from  22  to  42,  our  fit  found 

pco  =  .10330  ±  .00002,  /  x 

F  41 

v0  =  1.49  ±.02,  v  ' 

where  the  quoted  errors  are  one-sigma  statistical  errors.  For  odd  L  ranging  from  21 
to  37,  our  fit  found 


pc0  =  .10261  ±.00003, 
v0  =  1.46  ±.02. 


(42) 
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Fig.  7.  The  failure  probability  /Tail  as  a  function  of  the  error  probability  p  for  linear  size  L  =  16,20, 24, 
28,  32,  36,  in  the  two-dimensional  random-bond  Ising  model.  Each  data  point  was  generated  by  averaging 
106  samples. 


Fig.  8.  The  failure  probability  Pm  as  a  function  of  the  error  probability  p  for  linear  size  /  =  15, 19, 23, 
27, 31,  35,  in  the  two-dimensional  random-bond  Ising  model.  Each  data  point  was  generated  by  averaging 
106  samples. 

The  discrepancy  between  the  values  of  />c0  f°r  even  and  odd  L  indicates  a  nonneg- 
ligible  finite-size  effect. 

On  closer  examination,  we  see  evidence  for  small  but  detectable  violations  of  our 
scaling  ansatz  in  both  the  even  and  odd  data  sets.  These  violations  are  very  well  ac¬ 
counted  for  by  the  modified  ansatz 
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PfaiX  —  A  Ay  Bx  A~  Cx^ 


( Z)even  •  L  (L  even), 

\DoM-L (L  odd), 


(43) 


which  includes  a  nonuniversal  additive  correction  to  Pfaii  at  criticality,  different  for 
even  and  odd  sizes.  Fitting  the  modified  ansatz  to  the  data  for  even  L  ranging  from  2 
to  42,  we  find 


Pc0  =  .10309  ±.00003, 
v0  =  1.461  ±.008, 
Deven  =  0.165  ±  .002, 
AW n  =  0.71  ±  .01. 


Fitting  to  the  data  for  odd  L  ranging  from  3  to  37,  we  find 
pc0  =  .10306  ±.00008, 
v0  =  1.463  ±.006, 

45 

Dodd  =  -.053  ±  .003,  v  ' 

AW  =  2.1  ±  .3. 

In  Fig.  (9)  we  show  the  data  for  all  values  of  L  and  p\  using  the  values  of  pc0,  v0,  D ,  and 
H  found  in  our  fits,  we  have  plotted  Pfaii,  with  the  nonuniversal  correction  of  Eq.  (43) 
subtracted  away,  as  a  function  of  the  scaling  variable  x  =  (p  —  All  of  the  data 

lie  on  a  single  line,  indicating  that  residual  scaling  violations  are  quite  small.  Fur¬ 
thermore,  the  agreement  between  the  values  of  pc o  and  Vo  extracted  from  the  even  and 
odd  data  sets,  which  were  fit  independently,  indicates  that  our  extrapolation  to  large  L 


Fig.  9.  The  failure  probability  Aiii ,  with  the  nonuniversal  correction  of  Eq.  (43)  subtracted  away,  as  a 
function  of  the  scaling  variable  v  =  (p  —  pc0)Lllv°  for  the  two-dimensional  random-bond  Ising  model, 
where  pc0  and  v0  are  determined  by  the  best  fit  to  the  data.  A  two-sigma  error  bar  is  shown  for  each  point. 
The  data  for  values  of  L  from  2  to  42  lie  on  a  single  line,  indicating  that  the  (small)  scaling  violations  are 
well  accounted  for  by  our  ansatz. 
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is  reasonable,  and  that  the  statistical  errors  in  Eqs.  (44)  and  (45)  do  not  seriously 
underestimate  the  actual  errors  in  our  measurement.  A  plausible  conclusion  is  that 

pc0  =  .1031  ±  .0001, 

(46) 

v0  =  1.46  ±.01. 

An  earlier  measurement  reported  by  Kawashima  and  Rieger  found  [23] 
pc0  =  .104  ±.001, 
v0  =  1.30  ±.02; 

their  value  of pc0,  but  not  of  v0,  is  compatible  with  ours.  An  important  reason  why  our 
value  of  pco  has  a  smaller  statistical  error  than  theirs  is  that  they  computed  a  different 
observable  (the  domain  wall  energy)  for  which  the  finite-size  scaling  analysis  is  more 
delicate  than  for  the  failure  probability  (another  critical  scaling  exponent  is  involved). 
In  a  recent  study  of  the  Nishimori  point,  Merz  and  Chalker  found  [12] 

pc  =  .  1093  ±.0002, 

(48) 

v  =  1.50  ±.03. 

There  is  a  clear  discrepancy  between  the  values  of  pc  and  pc o,  in  disagreement  with 
the  conjecture  of  Nishimori  [13]  and  Kitatani  [14].  Evidence  for  a  reentrant  phase 
diagram  has  also  been  found  by  Nobre  [24],  who  reported 

Pc0  =  .1049  ±.0003.  (49) 

In  principle,  the  phase  transitions  at  T  =  0  and  at  the  Nishimori  point  could  be 
in  different  universality  classes,  so  that  the  critical  exponents  v0  and  v  could  have 
different  values.  However,  our  measurement  of  v0  at  T  =  0  is  consistent  with  the  va¬ 
lue  of  v  at  the  Nishimori  point  reported  by  Merz  and  Chalker  [12]. 

4.3.  Random-plaquette  gauge  model 

We  measured  P^\  by  generating  106  samples  for  each  value  of  L  from  9  to  14,  and 
for  each  value  of  p  increasing  in  increments  of  .0004  from  .02805  to  .03005;  in  addi¬ 
tion  we  generated  106  samples  at  L  =  15, 16  for  p  =  .02845,  .02925,  .03005.  Values  of 
Tfaii  for  even  L  lie  slightly  but  systematically  above  the  values  for  odd  L  at  the  same  p\ 
therefore  we  analyzed  the  data  for  even  and  odd  L  separately.  Data  for  even  L  are 
shown  in  Fig.  10.  Crudely,  the  point  of  concordance  of  the  data  sets  provides  an  es¬ 
timate  of  />c0,  while  the  trend  of  the  data  with  L  determines  the  exponent  v0. 

We  did  a  global  fit  of  the  data  to  the  form 

Efaii  —  A  ±  Bx  ±  Cx2,  (50) 

where  x  =  (p  —  pco)Lllv°,  adopting  a  quadratic  approximation  to  the  scaling  function 
f(x )  in  the  vicinity  of  x  =  0.  For  L  ranging  from  9  to  16,  our  fit  found 

pco  =  .02937  ±  .00002,  v0  =  0.974  ±  .026  (L  even), 

pc0  =  .02900  ±  .00001,  v0  =  1.025  ±  .016  (L  odd), 


(51) 
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error  rate  p 


Fig.  10.  The  failure  probability  /Tail  as  a  function  of  the  error  probability  p  for  linear  size  L  =  10, 12, 14, 16, 
in  the  three-dimensional  random-plaquette  gauge  model.  Each  data  point  was  generated  by  averaging  106 
samples. 

where  the  quoted  errors  are  one-sigma  statistical  errors.  The  results  for  even  and  odd 
L  are  incompatible,  indicating  a  nonnegligible  finite-size  effect. 

We  believe  that  our  analysis  for  even  L  is  likely  to  be  more  reliable;  finite  size 
effects  are  enhanced  for  odd  L ,  the  case  in  which  the  failure  probability  is  smaller. 
All  of  the  even  L  data  are  shown  in  Fig.  11,  with  P^x  plotted  as  a  function  of 


Fig.  11.  The  failure  probability  Pfaii  as  a  function  of  the  scaling  variable  x  =  (p  —  pc0)LllVo  for  the  random- 
plaquette  gauge  model,  where  pc0  and  v0  are  determined  by  the  best  fit  to  the  data.  A  two-sigma  error  bar  is 
shown  for  each  point.  The  data  for  all  even  values  of  L  from  10  to  16  lie  on  a  single  curve,  indicating  that 
scaling  violations  are  small. 
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x  =  (p  —  pco)Ll/v°,  where  pc0  and  v0  are  determined  by  our  fit.  The  data  fit  a  single 
curve,  indicating  that  scaling  violations  are  small.  (Scaling  violations  are  more  dis- 
cernable  in  the  odd  L  data  set.)  A  reasonable  conclusion  is  that 


pc0  =  .0293  ±  .0002, 
vq  =  1 .00  dz  .05. 


4.4.  The  failure  probability  at  finite  temperature 

Our  numerical  studies  of  the  RBIM  and  the  RPGM  were  restricted  to  the  T  =  0 
axis.  We  calculated  the  failure  probability  to  estimate  the  critical  disorder  strength 
pco  and  the  critical  exponent  v0.  Here  we  will  describe  how  the  calculation  of  the  fail¬ 
ure  probability  could  be  extended  to  nonzero  temperature. 

To  calculate  the  failure  probability  in  the  zero -temperature  RBIM,  we  generate  a 
sample  by  specifying  a  one-chain  E  of  antiferromagnetic  links,  and  then  we  construct 
the  one-chain  E '  of  minimal  energy  with  the  same  boundary  as  E.  Failure  occurs  if 
the  cycle  D  =  E  +  E'  is  homologically  nontrivial. 

At  nonzero  temperature  we  should  construct  E'  to  belong  to  the  homology  class 
that  minimizes  free  energy  rather  than  energy.  For  a  given  sample  with  antiferromag¬ 
netic  one-chain  E,  the  free  energy  F(E,h)  of  homology  class  h  is  found  by  summing 
over  domain  wall  one-chains  {Ef  such  that  E  +  E'  e  h: 

expH PF(E,h)]=Z(E,h)=  £  e"^,  (53) 

E1  \E-\-E1  G/z 

where  HE  denotes  the  Hamiltonian  Eq.  (1)  with  antiferromagnetic  chain  E.  If  the 
trivial  homology  class  h  =  e  has  the  lowest  free  energy,  then  the  sample  is  a  “suc¬ 
cess”;  otherwise  it  is  a  “failure.”  We  can  estimate  the  failure  probability  P{&{p,  T)  by 
randomly  generating  many  samples,  and  determining  for  each  whether  it  is  a  success 
or  a  failure. 

For  the  random  bond  Ising  model  on  a  torus,  the  sum  Eq.  (3)  includes  only  the 
chains  E'  such  that  E  +  E'  is  in  the  trivial  homology  class.  To  sum  over  the  class 
h,  we  can  augment  E  by  adding  to  it  a  representative  of  h.  For  each  h,  we  can  com¬ 
pute 

|||^|  =  exp  [-p(F(E,h)-F(E,e))\-  (54) 

the  sample  E  is  a  success  if  this  ratio  of  partition  functions  is  less  than  one  for  each 
h  e. 

The  ratio  is  the  thermal  expectation  value  {(9h)K  of  an  observable  Oh  that  “inserts 
a  domain  wall”  wrapping  around  a  cycle  C  representing  h.  That  is,  the  effect  of  (9h  is 
to  flip  the  sign  of  the  bond  variable  t tj  for  each  bond  (ij)  in  C: 

-  2*  £  liASj 

(ij)£C 


®h  =  exp 


(55) 
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In  principle,  we  could  measure  (0h)K  by  the  Monte-Carlo  method,  generating 
typical  configurations  in  the  thermal  ensemble  of  HE ,  and  evaluating  (9h  in  these 
configurations.  Unfortunately,  this  method  might  not  produce  an  accurate  mea¬ 
surement,  because  the  configurations  that  dominate  {@h)K  maY  be  exponentially 
rare  in  the  thermal  ensemble — a  configuration  with  excited  bonds  on  C  can  have 
an  exponentially  large  value  of  (9h  that  overcomes  exponential  Boltzmann  sup¬ 
pression. 

One  solution  to  this  problem  is  to  express  Z{E,h)/Z{E,e)  as  a  product  of  quan¬ 
tities,  each  of  which  can  be  evaluated  accurately  by  Monte  Carlo.  Let 
{e  •  •  • ,  Pk-uPk  =  C}  be  a  sequence  of  open  chains  interpolating  between 

the  empty  chain  and  the  cycle  C,  where  pj+ 1  -  Pj  contains  just  a  single  bond.  We 
may  write 

Z(E,h)  _Z(E,PX)  Z(E,P2)  Z(E,Pk) 

Z(E,e)  Z(E,P0)'  Z(E,PX)'"  Z[E,Pk_x\  1  j 

Each  ratio  Z(E,Pj+\)/Z(E,Pj)  is  the  expectation  value  of  an  operator  that  acts  on  a 
single  bond,  evaluated  in  the  thermal  ensemble  of  the  Hamiltonian  with  antiferro¬ 
magnetic  bonds  on  the  chain  E  +  Pf  this  expectation  value  can  be  evaluated  by 
Monte-Carlo  with  reasonable  computational  resources.  (For  an  application  of  this 
trick  in  a  related  setting,  see  [25].) 

Using  this  method,  we  can  determine  whether  Z(E,  h)/Z{E ,  e)  exceeds  one  for  any 
e  and  hence  whether  the  sample  E  is  a  success  or  a  failure.  Generating  many 
samples,  we  can  estimate  P^\\{p,  T).  In  principle,  then  we  can  calculate  the  failure 
probability  for  the  optimal  recovery  scheme,  in  which  p  and  T  obey  Nishimori’s  re¬ 
lation.  By  a  similar  method,  we  can  calculate  the  failure  probability  for  the  RPGM. 
However,  we  have  not  attempted  this  calculation. 


5.  Conclusions 

The  three-dimensional  random-plaquette  gauge  model,  and  the  analogous  anti¬ 
symmetric  tensor  models  in  higher  dimensions,  provide  new  examples  of  multicrit- 
ical  points  with  strong  disorder.  These  models  have  phase  diagrams  that 
qualitatively  resemble  the  phase  diagram  of  the  two-dimensional  random-bond  Is- 
ing  model. 

Our  results  indicate  that  the  boundary  between  the  ferromagnetic  and  paramag¬ 
netic  phases  of  the  RBIM  is  reentrant  rather  than  vertical  below  the  Nishimori  line. 
If  the  disorder  strength  p  satisfies  pco  <  p  <  pc,  then  the  ground  state  of  the  spin  sys¬ 
tem  does  not  have  long-range  order.  As  the  temperature  T  increases  with  p  fixed, 
long-range  order  is  first  restored,  and  then  lost  again  as  the  temperature  increases 
further.  At  T  =  0  the  spins  are  frozen  in  a  disordered  state  driven  by  quenched  ran¬ 
domness.  But  apparently  this  ground  state  is  entropically  unfavorable — at  low  but 
nonzero  temperature  typical  states  in  the  thermal  ensemble  have  long-range  ferro¬ 
magnetic  order. 
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This  behavior  seems  less  remarkable  when  considered  from  the  viewpoint  of  our 
error  recovery  protocol.  For  given  p  and  a  specified  error  syndrome,  the  recovery 
method  with  optimal  success  probability  proceeds  by  inferring  the  most  likely  ho¬ 
mology  class  of  errors  consistent  with  the  syndrome.  There  is  no  a  priori  reason 
for  the  most  likely  single  error  pattern  (the  ground  state)  to  belong  to  the  most  likely 
error  homology  class  (the  class  with  minimal  free  energy)  even  in  the  limit  of  a  large 
sample.  Our  numerical  results  indicate  that  for  error  probability  p  such  that 
Pco  <P  <  Pc ,  the  probability  that  the  ground  state  does  not  lie  in  the  most  likely  ho¬ 
mology  class  remains  bounded  away  from  zero  as  L  — >  oo. 

In  our  numerical  studies  of  the  RBIM  and  RPGM  at  zero  temperature,  we  have 
computed  a  homological  observable,  the  failure  probability.  This  observable  has  ad¬ 
vantages  over,  say,  the  domain  wall  energy,  because  it  obeys  a  particularly  simple 
finite-size-scaling  ansatz.  Therefore,  we  have  been  able  to  determine  the  critical  dis¬ 
order  strength  pc0  and  the  critical  exponent  v0  to  good  accuracy  with  relatively  mod¬ 
est  computational  resources. 

Not  surprisingly,  our  numerical  values  for  pco  are  notably  larger  than  rigorous 
lower  bounds  derived  using  crude  combinatoric  arguments  in  [9]:  pc0  «  .1031  com¬ 
pared  with  the  bound  pc0  ^  .0373  in  the  RBIM,  and  pc0  «  .0293  compared  with 
pco  ^  .0114  in  the  RPGM. 

The  zero -temperature  critical  disorder  strength  pc0  is  a  lower  bound  on  the  value 
of  the  critical  disorder  strength  pc  along  the  Nishimori  line,  and  of  special  interest 
because  of  its  connection  with  the  accuracy  threshold  for  robust  storage  of  quantum 
information.  Our  result  means  that  stored  quantum  data  can  be  preserved  with  ar¬ 
bitrarily  good  fidelity  if,  in  each  round  of  syndrome  measurement,  qubit  errors 
and  syndrome  measurement  errors  are  independently  and  identically  distributed, 
with  error  probability  per  qubit  and  per  syndrome  bit  both  below  2.9%.  For  qubit 
errors  and  measurement  errors  occurring  at  differing  rates,  an  accuracy  threshold 
could  be  inferred  by  analyzing  an  anisotropic  random-plaquette  gauge  model,  with 
differing  disorder  strength  for  horizontal  and  vertical  plaquettes.  Relating  these 
threshold  error  rates  to  fidelity  requirements  for  quantum  gates  requires  further 
analysis  of  the  sort  discussed  in  [9]. 

We  have  also  measured  the  critical  exponent  Vo  that  controls  the  divergence  of  the 
correlation  length  as  p  approaches  /?c0,  finding  v0  ~  1.46  in  the  RBIM  and  v0  ~  1.0  in 
the  RPGM.  The  value  of  v0  is  also  relevant  to  the  efficacy  of  quantum  error  correc¬ 
tion  —  through  its  connection  with  finite-size  scaling,  v0  determines  how  large  the 
code  block  should  be  to  achieve  a  specified  storage  fidelity,  for  p  less  than  but  close 
to  pc0. 

Quantum  computers  are  believed  to  be  more  powerful  than  classical  comput¬ 
ers — classical  computers  are  unable  to  simulate  quantum  computers  efficiently. 
The  accuracy  threshold  for  quantum  memory  is  a  fascinating  phase  transition,  sep¬ 
arating  a  low-noise  quantum  phase  from  a  high-noise  classical  phase.  In  this  paper, 
we  have  described  one  way  to  analyze  this  phase  transition  using  methods  from  tra¬ 
ditional  statistical  physics.  Furthermore,  the  connection  with  quantum  memory  pro¬ 
vides  an  enlightening  new  perspective  on  local  spin  and  gauge  systems  with  strong 
quenched  disorder. 
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We  prove  the  security  of  the  Bennett-Brassard  (BB84)  quantum  key  distribution  protocol  for  an 
arbitrary  source  whose  averaged  states  are  basis  independent,  a  condition  that  is  automatically  satisfied 
if  the  source  is  suitably  designed.  The  proof  is  based  on  the  observation  that,  to  an  adversary,  the  key 
extraction  process  is  equivalent  to  a  measurement  in  the  frx  basis  performed  on  a  pure  <xz-basis 
eigenstate.  The  dependence  of  the  achievable  key  length  on  the  bit  error  rate  is  the  same  as  that 
established  by  Shor  and  Preskill  [Phys.  Rev.  Lett.  85,  441  (2000)]  for  a  perfect  source,  indicating  that 
the  defects  in  the  source  are  efficiently  detected  by  the  protocol. 
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Quantum  key  distribution  is  an  ingenious  application 
of  quantum  mechanics,  in  which  two  remote  parties 
(Alice  and  Bob)  establish  a  shared  secret  key  through 
the  transmission  of  quantum  signals.  In  the  Bennett- 
Brassard  (BB84)  protocol  [1],  Alice  sends  a  key  bit  to 
Bob  by  preparing  a  qubit  in  one  of  two  conjugate  bases 
and  Bob  measures  the  qubit  in  one  of  the  two  bases;  the 
eavesdropper  Eve,  who  does  not  know  the  basis  chosen 
by  Alice  or  by  Bob,  cannot  collect  information  about  the 
key  without  producing  a  detectable  disturbance.  This 
protocol,  when  suitably  augmented  by  classical  error 
correction  and  privacy  amplification,  is  provably  secure 
against  any  attack  by  Eve  [2-5]. 

Though  security  can  be  proven  without  imposing  any 
restriction  on  Eve’s  attack  (other  than  the  requirement 
that  she  has  no  a  priori  information  about  the  basis  used), 
it  is  necessary  to  place  conditions  on  the  performance  of 
the  source  and  detector  employed  in  the  protocol.  In  the 
Shor-Preskill  proof  [5],  it  is  assumed  that  any  flaws  in  the 
source  and  detector  can  be  absorbed  into  Eve’s  basis- 
independent  attack.  The  proof  by  Mayers  [2],  however, 
applies  to  a  more  general  setting:  although  the  source  is 
perfect,  the  detector  has  never  been  tested  and  is  com¬ 
pletely  uncharacterized.  Indeed,  the  detector  could  be 
under  the  control  of  Eve’s  collaborator  Fred.  Fred  is 
unable  to  send  messages  to  Eve,  but  he  knows  Bob’s  basis 
and  can  adjust  the  measurement  performed  by  the  detec¬ 
tor  accordingly.  Still,  as  Mayers  showed,  Fred  cannot  fool 
Alice  and  Bob  into  accepting  a  key  that  Eve  knows,  as 
long  as  the  efficiency  of  the  detector  is  basis  independent. 
Since  a  real  device  could  have  an  indefinite  number  of 
degrees  of  freedom,  no  test  can  fully  characterize  it; 
therefore,  proving  security  in  the  case  of  an  uncharacter¬ 
ized  apparatus  provides  comfort  to  a  highly  suspicious 
user  of  the  key  distribution  scheme. 

In  this  Letter,  we  present  a  simple  proof  of  the  security 
of  the  BB84  protocol  that  applies  to  a  setting  opposite  to 
that  considered  by  Mayers:  the  detector  is  perfect  and 


Fred  controls  the  source.  We,  however,  place  one  impor¬ 
tant  restriction  on  Fred’s  attack — the  source  must  not 
leak  any  information  to  Eve  about  the  basis  chosen  by 
Alice.  That  is,  the  state  emitted  by  the  source,  averaged 
over  the  values  of  Alice’s  key  bit,  is  required  to  be 
independent  of  Alice’s  basis.  Our  proof  applies  to  faulty 
sources  that  are  notably  more  general  than  those  encom¬ 
passed  by  the  Shor-Preskill  proof;  to  give  just  one  ex¬ 
ample,  it  applies  to  a  source  that  performs  perfectly  when 
Alice  chooses  the  frx  basis  but  that  rotates  the  qubit  when 
Alice  chooses  the  frz  basis.  Nevertheless,  our  proof  shows 
that  a  secure  key  can  be  extracted  from  a  sifted  key  at  the 
same  rate  established  by  Shor  and  Preskill. 

Our  proof  combines  insights  gleaned  from  both  the 
Mayers  proof  and  the  Shor-Preskill  proof.  Following 
Mayers,  we  analyze  the  information  about  (Bob’s)  key 
collected  by  Eve  in  the  case  where  Alice  and  Bob  are 
using  different  bases.  Following  Shor  and  Preskill,  we 
bound  Eve’s  information  by  observing  that  Bob  could 
have  performed  an  error  correction  to  remove  any  entan¬ 
glement  with  Eve’s  probe  before  executing  the  measure¬ 
ment  that  extracts  his  final  key.  The  core  of  our  proof  is 
the  observation  that  a  single  quantum  circuit  computes 
Bob’s  final  key  in  the  &x  basis  and  reverses  the  damage 
inflicted  by  Eve  if  the  error  rate  is  small  in  the  frz  basis. 
Using  the  same  method,  we  can  also  prove  security  for 
the  case  of  an  uncharacterized  detector. 

Before  proceeding  to  the  proof,  let  us  specify  in  more 
detail  our  models  of  the  source  and  detector.  Alice  pre¬ 
pares  a  physical  system  with  Hilbert  space  3~[ A,  which 
has  an  arbitrary  size,  in  one  of  four  states  p{a,  g)  with 
probability  pag\  a  =  0,  1  labels  Alice’s  basis  choice  and 
g  =  0,  1  is  the  value  of  her  key  bit.  The  choice  of  a  is 
assumed  to  be  completely  random:  pa  0  +  pa  l  =  1/2.  We 
assume  that  the  states  satisfy 

Po.oP (0»  0)  +  Po,iP(0,  1)  =  PiopV,  0)  +  PuPih  1), 

(1) 
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which  is  vital  in  the  security  proof.  A  convenient  way  to 
prepare  such  an  ensemble  is  to  introduce  an  auxiliary 
system  A'  with  Hilbert  space  3~C  A>.  Alice  first  prepares 
3~C  A  ®  3~[  pj  in  an  entangled  state  pAAr  and  then  performs 
a  measurement  Ma  on  system  A'  alone.  The  measurement 
Ma  gives  a  binary  outcome,  determining  g.  Equation  (1) 
is  then  satisfied  because  the  choice  of  the  measure¬ 
ment,  M0  or  Mx,  does  not  affect  the  marginal  state  of 
3~Ca.  Hence,  if  the  source  is  realized  in  this  way,  there 
is  no  need  to  test  its  performance. 

As  noted  in  [6],  if  A'  is  a  qubit,  M0  is  a  measurement  of 
frz,  and  Mx  is  a  measurement  of  frx,  then  security  can  be 
established  by  the  method  of  Shor  and  Preskill.  But  our 
security  proof  invokes  only  the  condition  (1);  no  further 
properties  of  pAA/,  M0,  or  Mx  need  be  specified. 

At  the  end  of  the  transmission  channel  J~CA  — ►  J~CB, 
Bob  switches  between  two  measurements  on  3~CB.  We 
assume  that  the  two  measurements  are  modeled  by  a 
common  quantum  channel  J~CB  — ►  J~C2,  where  dim  J~C2  = 
2,  followed  by  the  measurement  of  the  Pauli  operator  frz 
or  ax.  In  the  security  proof,  we  include  the  common 
quantum  channel  in  the  transmission  channel  between 
Alice  and  Bob,  so  that  Bob  receives  a  qubit  at  the  end  of 
the  channel. 

The  protocol  that  we  shall  prove  to  be  secure  is  the 
following:  Let  ft  =  {1, . . 4V(1  +  e)}.  The  variable  de¬ 
noted  by  a  takes  the  value  opposite  to  a. 

Protocol  1  (BB84). — (1)  Alice  creates  random  bit  se¬ 
quences  {at}  and  {gt}  for  /Eft.  Alice  randomly  chooses  a 
subset  R  Cft  with  size  \R\  =  2N(\  +  e).  (2)  Bob  creates 
a  random  bit  sequence  {/?,}.  (3)  When  i  E  R ,  Alice  sends 
p(aiy  gi ).  When  i  E  R(=  ft  —  R ),  Alice  sends  p(aiy  gt). 
(4)  Bob  measures  frz  when  bt  =  0  and  measures  frx  when 
bi  =  1.  For  either  case,  he  sets  bit  ht  according  to  the 
outcome  (hi  =  0  for  outcome  1  and  ht  =  1  for  outcome 
—  1).  (5)  Bob  announces  {/?,}.  Alice  announces  {a^  and  R. 
If  the  size  of  T  =  {/  E  R\at  =  bt)  is  less  than  V,  the 
protocol  aborts.  Bob  decides  randomly  on  a  subset  S  C 
{/  E  R\di  =  bj}  with  |Sj  =  N  and  announces  (if  he  can¬ 
not  do  this,  the  protocol  aborts).  (6)  Alice  and  Bob 
compare  gt  and  h{  for  /  E  T  and  determine  the  error 
rate  5.  If  5  is  too  large,  the  protocol  aborts.  (7)  Bob 
randomizes  the  positions  of  the  N  qubits  in  S  by  a  per¬ 
mutation  7 t  and  announces  tt.  Bob  announces  a  linear 
code  C  with  |C|  =  2r  that  corrects  N(S  +  e)  errors  occur¬ 
ring  in  random  positions  with  probability  exponentially 
close  to  unity.  (8)  The  sifted  key  Acsif  of  length  N  is 
defined  as  the  sequence  {hi}iGS.  The  final  key  is  the  coset 
Ksif  +  C1.  (9)  Alice  obtains  /csif  by  applying  an  error 
correction  scheme  to  {g/l/es  via  encrypted  communica¬ 
tion  with  Bob,  consuming  r  bits  of  the  previously  shared 
secret  key.  Then  Alice  obtains  the  final  key. 

Protocol  1  is  the  standard  BB84  protocol,  except  for  the 
use  of  dt  in  place  of  at  in  steps  (3)  and  (5),  which  we  have 
adopted  for  later  convenience  in  the  proof.  The  random 
permutation  tt  in  step  (7)  is  redundant,  since  it  suffices  to 
choose  the  code  C  randomly  instead  of  doing  the  permu¬ 


tation.  In  the  limit  of  large  N ,  the  achievable  r/N  reaches 
1  -  h(8)  [where  h(8)  =  -<51og2<5  -  (1  -  <5)log2(l  -  5)], 
and  t/N  in  step  (9)  approaches  h(S),  resulting  in  the  rate 
of  key  generation  1—2 h(8). 

Our  proof  uses  some  basic  properties  of  (classical) 
error-correcting  codes.  The  linear  code  C  appearing  in 
step  (7)  is  an  r-dimensional  subspace  of  the  binary  vector 
space  F^.  The  code  C1  appearing  in  step  (8)  is  the 
orthogonal  complement  of  C,  called  the  dual  of  C.  We 
can  specify  a  linear  coding  function  G  :  F£  — ►  F^,  which 
assigns  a  distinct  codeword  of  C  to  each  binary  sequence 
of  length  r.  We  have  assumed  in  the  protocol  that  C 
corrects  N(8  +  e)  errors  occurring  in  random  positions 
with  probability  exponentially  close  to  unity.  More  spe¬ 
cifically,  there  exists  a  set  of  correctable  errors  £  CF^ 
and  a  decoding  function  /  :  F^  — ►  F£,  satisfying 

f(G(y)  +  x)  =  y  (2) 

for  any  y  E  F£  and  any  v  E  £.  A  random  error  with 
weight  at  most  N(8  +  e)  belongs  to  £  with  probability 
exponentially  close  to  unity.  The  function  /  is  not  neces¬ 
sarily  linear  and  may  be  hard  to  compute,  but  we  will 
need  only  its  existence  for  the  proof  of  security — Bob 
does  not  compute  /  in  the  actual  protocol. 

What  Bob  actually  calculates  is  the  coset  /csif  +  C1  in 
step  (8).  One  way  to  do  this  is  to  use  the  function  GT  : 
F^  — ►  F£,  which  is  the  adjoint  (matrix  transpose)  of  G 
satisfying  GT(x)  •  y  =  x  •  G(y)  (mod2)  for  any  v  E  F^ 
and  y  E  F£.  Since  the  kernel  of  GT(x)  is  C1,  the  final 
key  is  the  r-bit  sequence  Gr(Acsif).  The  duality  between  G 
and  Gt  will  play  an  important  role  in  the  proof  below. 

In  order  to  prove  that  protocol  1  is  secure,  we  need  to 
show  that  Eve’s  maximum  knowledge  Ii  about  the  final 
key  is  negligible.  Note  that  Bob’s  final  key  is  determined 
at  step  (8);  step  (9),  which  assures  that  Alice’s  key  agrees 
with  Bob’s  and  leaks  no  information  to  Eve,  is  not  rele¬ 
vant  to  1 1 .  Let  us  compare  protocol  1  with  a  modified  one. 

Protocol  2. — (3)7  When  i  E  R ,  Alice  sends  p(ai,gi). 
When  /  E  R ,  Alice  also  sends  p(aiy  gt).  The  other  steps 
are  the  same  as  in  protocol  1. 

This  modification  follows  Mayers’s  argument  [2]  ex¬ 
cept  for  the  exchanged  roles  of  the  sender  and  the  receiver. 
The  only  difference  between  protocols  1  and  2  is  a  flip  in 
Alice’s  basis  for  i  E  R.  But  the  bits  {g/}  for  i  E  R  are  kept 
secret  by  Alice.  Hence,  for  Eve  and  Bob  only  the  state 
averaged  over  {g/}  is  relevant,  and  this  state  is  identical  for 
the  two  protocols  by  the  condition  Eq.  (1).  Therefore, 
Eve’s  maximum  knowledge  /2  about  Bob’s  final  key  in 
protocol  2  is  the  same  as  Ix . 

Next,  let  us  further  modify  protocol  2  in  favor  of  Eve, 
by  allowing  Eve  to  control  Alice’s  source.  Now  Eve 
knows  { at }  and  {g/}  and  is  free  to  prepare  the  states 
measured  by  Bob  however  she  pleases.  Since  the  states 
p(aiy  gi)  have  been  removed  from  the  protocol  and  Bob’s 
measurements  are  symmetric  in  bb  the  protocol  is  com¬ 
pletely  symmetric  in  {a,}  and  {g/}.  Therefore  we  may 
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assume  at  =  gt  =  0  without  loss  of  generality.  The  result¬ 
ing  protocol  is  as  follows. 

Protocol  3. — (1)  Alice  randomly  chooses  a  subset  R  C 
ft  with  size  \R\  =  2N(1  +  e).  (2)  Bob  creates  a  random 
bit  sequence  {bt}.  (3)  Eve  prepares  Bob’s  qubits  and  her 
ancilla  system  in  a  state.  (4)  Bob  measures  frz  when  bt  = 
0  and  measures  frx  when  b{  =  1.  For  either  case,  he  sets 
the  bit  ht  according  to  the  outcome.  (5)  Bob  announces 
{bi\.  Alice  announces  R.  If  the  size  of  T  =  {i  E  R\bi  =  0} 
is  less  than  V,  the  protocol  aborts.  Bob  decides  randomly 
on  a  subset  S  C  {/  E  R\bf  =  l}with|Sj  =  N  and  announ¬ 
ces  (if  he  cannot  do  this,  the  protocol  aborts).  (6)  Bob 
counts  the  number  n  of  bits  with  ht  =  1  for  i  E  T  and 
determines  the  error  rate  S  =  n/\T\.  If  8  is  too  large,  the 
protocol  aborts.  (7)  Bob  randomizes  the  positions  of  the  N 
qubits  in  S  by  a  permutation  it  and  announces  it.  Bob 
announces  a  linear  code  C.  (8)  The  sifted  key  /csif  of 
length  N  is  defined  as  {h^iE.s.  The  final  key  is  the  coset 

*sif  +  C±- 

Since  the  modifications  in  the  protocol  favor  Eve,  Eve’s 
maximum  knowledge  /3  about  the  final  key  in  protocol  3 
is  no  less  than  /2:  thus  Ix  =  /2  <  /3.  To  complete  the 
proof,  we  will  show  that  /3  is  small — Eve  cannot  predict 
Bob’s  key  accurately  because  Bob  is  measuring  in  the 
“wrong”  basis. 

Let  us  denote  the  Hilbert  space  of  Eve’s  system  as  J~CE 
and  that  of  the  N  qubits  belonging  to  S  as  3~CS.  We  may 
imagine  that  Bob’s  measurement  on  set  S  is  delayed  until 
step  (8)  and  denote  by  p  the  state  over  3~CS  0  J~CE  after 
the  verification  test  on  the  set  T  is  done,  but  before  the 
qubits  in  S  are  measured.  The  test  on  T  finds  that  the  rate 
of  error  (frz  =  —  1)  over  N  (or  more)  randomly  chosen 
qubits  is  5.  If  the  qubits  in  the  set  S  were  also  measured  in 
the  crz  basis,  then  the  joint  probability  of  finding  an  error 
rate  less  than  S  in  T  and  finding  more  than  N(S  +  e) 
errors  in  S  would  be  asymptotically  less  than 
exp[—  £2N/4(8  —  <52)]  for  any  strategy  by  Eve.  Ignoring 
any  inefficient  strategy  that  has  only  an  exponentially 
small  probability  of  giving  an  error  rate  less  than  S  in 
T,  we  conclude  that  for  the  state  p,  the  probability  of 
finding  more  than  N(S  +  e)  errors  in  S  is  exponentially 
small. 

Let  {| v)z,  v  E  F^}  denote  the  “Z  basis”  of  J~CS,  where 
the  value  of  the  jth  bit  of  v  corresponds  to  the  eigenvalue 
of  frz  on  the  jth  qubit,  and  let  {\v)x  =  HN\v)z}  denote  the 
“X  basis,”  where  HN  is  the  Hadamard  transformation 
acting  on  the  N  qubits.  The  announcement  of  it  in 
step  (7)  can  be  described  as  the  transmission  from  Bob 
to  Eve  of  a  particle  J  in  one  of  N\  orthogonal  states  {|  7r)j}. 
The  symmetrized  state  held  by  Bob  and  Eve  after  trans¬ 
mission  of  the  particle  is 

Ps  =  (wr1  x  9  (un  9  iE)p(ut  9  iE).  o) 


error  pattern  that  is  not  in  £  is  exponentially  small: 
Tr [(P£  0  lE)ps]  >  1  —  p,  where  rj  is  an  exponentially 
small  number.  (We  are  now  regarding  the  particle  J  as 
part  of  Eve’s  system  E.)  If  we  define  p'  as 

-/  (P s  ®  1e)Ps(Pe  ®  1e)  //n 

Tr[(P £  0  1e)ps] 

its  fidelity  [7]  to  ps,  F{p',ps)  =  [Tr(V^psV/&01/2]2,  is 
given  by 

F(p',  ps)  =  Tr [(P£  9  lE)ps]  >1-7?.  (5) 

In  what  follows,  we  will  show  that  if  the  state  p'  instead 
of  ps  were  used,  Eve  would  have  no  information  about  the 
final  key  (/3  =  0).  Then  we  will  infer  that  any  actual 
strategy  by  Eve  (that  passes  the  verification  test  with  a 
probability  that  is  not  exponentially  small)  gives  her 
exponentially  small  information. 

In  protocol  3,  Bob  measures  in  the  Z  basis  for  the 
verification  test,  and  in  the  X  basis  to  generate  the 
key — we  need  to  show  that  if  the  error  rate  is  low  in 
the  Z  basis,  then  the  key  is  random  and  private.  Our  proof 
invokes  a  quantum  circuit  that  outputs  the  same  r-bit  final 
key  as  Bob  finds  in  protocol  3,  and  that  also  expunges 
Eve’s  entanglement  with  the  key  bits.  Though  Bob  might 
not  have  actually  executed  this  circuit,  it  would  be  all  the 
same  to  Eve  if  he  had,  which  is  sufficient  to  ensure 
privacy. 

The  circuit,  shown  in  Fig.  1,  uses  an  auxiliary  system  Q 
of  r  qubits  initially  prepared  in  the  state  \0)x  and  is  a 
composition  U  =  U2U\  of  two  unitary  operators  U\  and 
{] 2-  The  operator  Ux,  which  calculates  the  final  key,  acts 
in  the  X  basis  as 

u  1  :  \x)x  ®  |y)x  —■  \x)x  ®  ly  +  GT(xj)x.  (6) 

Using  the  duality  between  G  and  GT,  we  easily  see  that 
U !  acts  in  the  Z  basis  as 

U  i  :  \x)z  ®  | y)z  ~ U  +  G(y))z  ®  I y)z-  (7) 

The  operator  U2  is  defined  in  the  Z  basis  as 

u2  :  \x)z  ®  \y)z  \x)z  ®  \y  +  f(x))z,  (8) 

and  in  the  X  basis  acts  as 

u2  :  \x)x®  \y)x^  \y)x-  (9) 

(a)  (b) 


Let  P£  be  the  projection  of  J~fs  onto  the  subspace  FIG.  1.  (a)  A  quantum  circuit  calculating  Kfin  =  GT{> csif)  in 

spanned  by  the  states  \e)z  such  that  e  G  £.  The  successful  the  X  basis,  (b)  The  same  circuit  in  the  Z  basis.  When  x  E.  8, 

verification  test  ensures  that  the  probability  of  finding  an  the  final  state  of  system  Q  is  |0)z. 
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Here  \^x>y)  is  a  rather  complicated  state  of  J~CS ,  but  its 
exact  form  is  not  relevant  here. 

If  the  initial  state  of  the  ancilla  Q  is  \0)x ,  then  from 
Eqs.  (6)  and  (9)  we  have 

^(l^sif)x  ®  l0)x)  =  l^x,/cfin)x  ®  l^r(^sif))x^  (10) 

the  final  key  Acfin  =  Gr(/csif)  is  obtained  by  measuring  the 
system  Q  in  the  X  basis  after  execution  of  the  circuit. 
Meanwhile,  Eqs.  (7)  and  (8)  with  |0)x  | y)z  lead  to 

U(\x)z  ®  |0)x)  £  \x  +  G(y))z  ®  \y  +  fix  +  G(y)))z; 

y 

(11) 

if  v  E  5,  the  final  state  of  Q  is  |0)z,  due  to  Eq.  (2).  Then, 
Eq.  (4)  ensures  that  if  the  initial  state  of  3~CS  ®  J~CE  is  p' , 
the  final  marginal  state  of  Q  is  still  |0)z.  Therefore,  the 
final  state  of  Q  obtained  when  we  start  from  the  actual 
state  ps  is  exponentially  close  to  |0)z: 

z(0\pQ\0)z^F(p',ps)>l-V.  (12) 

Equation  (12)  establishes  that  the  final  key  can  be 
obtained  from  a  complete  Y-basis  measurement  on  the 
state  p q,  whose  fidelity  to  the  Z-basis  eigenstate  |0)z  is 
exponentially  close  to  unity.  From  this,  we  conclude  the 
following:  (a)  The  mutual  information  /3  between  the 
final  key  and  Eve,  who  may  conduct  any  measurement 
on  her  system,  is  upper  bounded  by  the  von  Neumann 
entropy  S(p Q)  [8].  Since  pQ  has  an  eigenvalue  greater 
than  or  equal  to  1  —  rj,  we  have  /,  <  /3  <  S(pQ)  — 
h{rj)  +  plog2(2r  —  1)  <  h(yf)  +  rp.  (b)  The  probability 
distribution  py  over  the  2r  final  keys  is  very  close  to 
uniform.  In  fact,  the  fidelity  to  the  uniform  distribution 
cannot  be  lower  than  the  fidelity  in  Eq.  (12).  Thus  we  have 
2-r(^y  JjXy)2  >  1  —  r\.  Using  the  inequality  —  21og2v  < 
r(l  —  xY  +  (1  —  v2)/(ln2)  which  holds  for  v  >  2~r^2 
when  r  is  large,  the  Shannon  entropy  of  {py}  is  bounded 
as  H{{py})  =  r  +  2Y.y  Pylog2(2r py)~1/2  >  r{2f\  -  rj  - 
1)  >  r(  1  —  2 p). 

The  two  imperfections  of  the  final  key  derived  in  (a) 
and  (b)  can  be  combined  into  a  single  parameter  by  the 
following  argument.  Let  us  assume  that  Bob  randomly 
chooses  and  announces  a  bit  sequence  w  E  F£  and  pro¬ 
duces  a  new  key  w  +  y  which  is  truly  uniformly  distrib¬ 
uted.  If  Eve’s  information  about  y  is  71?  then  her 
information  about  w  +  y  is 

/  =  r  -  H({py})  +  h<  3rri  +  h{r)),  (13) 

which  is  also  exponentially  small,  concluding  the  proof. 

Finally,  suppose  that  Bob  uses  a  detector  with  imper¬ 
fect  efficiency,  which  has  a  “null”  outcome  (signifying  a 
detection  failure)  in  addition  to  the  valid  binary  outcome. 
Our  proof  remains  valid,  provided  that  the  efficiency 
(probability  of  obtaining  a  valid  outcome)  is  the  same 
for  the  two  bases,  and  the  size  of  II  is  increased. 


Our  proof  of  security  applies  to  an  uncharacterized 
source  with  basis-independent  averaged  states.  By  inter¬ 
changing  the  roles  of  sender  and  receiver,  the  same  proof 
can  be  applied  to  the  case  of  an  uncharacterized  detector, 
considered  by  Mayers  [2].  Indeed,  in  that  case  our  proof 
allows  a  more  general  source  (one  triggered  by  a  perfect 
measurement  on  half  of  an  entangled  state,  as  opposed  to 
a  perfect  source)  and  a  higher  rate  of  key  generation 
[1  -  2h(S)  rather  than  1  -  h(S )  —  h(2S)]  than  estab¬ 
lished  by  Mayers.  In  either  case,  by  exploiting  the  duality 
between  the  operation  that  encodes  a  message  using  C  and 
the  operation  that  computes  a  C1  coset,  our  proof  illumi¬ 
nates  the  connection  between  a  low  error  rate  and  suc¬ 
cessful  privacy  amplification. 

It  is  also  interesting  to  consider  characterized  imper¬ 
fect  sources  and  detectors  that  have  limited  basis- 
dependent  flaws.  One  important  case  of  a  characterized 
defective  source,  recently  analyzed  in  [9],  is  a  source  that 
occasionally  emits  two  identical  copies  of  a  qubit,  one  of 
which  can  be  intercepted  by  Eve.  In  this  case,  our  proof 
does  not  apply  because  Eq.  (1)  is  not  satisfied.  Security 
criteria  for  characterized  sources  and  detectors  are  fur¬ 
ther  discussed  in  [10]. 
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Optimal  sizes  of  dielectric  microspheres  for  cavity  QED  with  strong  coupling 

J.  R.  Buck  and  H.  J.  Kimble 

Norman  Bridge  Laboratory  of  Physics  12-33,  California  Institute  of  Technology,  Pasadena,  California  91125 
(Received  18  October  2002;  published  21  March  2003) 

The  whispering  gallery  modes  (WGMs)  of  quartz  microspheres  are  investigated  for  the  purpose  of  strong 
coupling  between  single  photons  and  atoms  in  cavity  quantum  electrodynamics  (cavity  QED).  Within  our 
current  understanding  of  the  loss  mechanisms  of  the  WGMs,  the  saturation  photon  number  n{)  and  critical  atom 
number  N0  cannot  be  minimized  simultaneously,  so  that  an  “optimal”  sphere  size  is  taken  to  be  the  radius  for 
which  the  geometric  mean  ^n0N0,  is  minimized.  While  a  general  treatment  is  given  for  the  dimensionless 
parameters  used  to  characterize  the  atom-cavity  system,  detailed  consideration  is  given  to  the  D2  transition  in 
atomic  cesium  at  \0=852  nm  using  fused-silica  microspheres,  for  which  the  maximum  coupling  coefficient 
ga/(27r)^750  MHz  occurs  for  a  sphere  radius  a  =  3.63  jam  corresponding  to  the  minimum  for  ^0^6.06 
X  10“ 6.  By  contrast,  the  minimum  for  A0^9.00X  10“ 6  occurs  for  a  sphere  radius  of  a  =  8.12  jam,  while  the 
optimal  sphere  size  for  which  ^n0N0  is  minimized  occurs  at  a  =  7.83  jam.  On  an  experimental  front,  we  have 
fabricated  fused-silica  microspheres  with  radii  a— 10  jam  and  consistently  observed  quality  factors  g^0.8 
X  107.  These  results  for  the  WGMs  are  compared  with  corresponding  parameters  achieved  in  Fabry-Perot 
cavities  to  demonstrate  the  significant  potential  of  microspheres  as  a  tool  for  cavity  QED  with  strong  coupling. 
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I.  INTRODUCTION 

Motivated  by  the  pioneering  work  of  Braginsky  and 
Ilchenko  [1],  some  of  the  highest-quality  optical  resonators 
to  date  have  been  achieved  with  the  whispering  gallery 
modes  (WGMs)  of  quartz  microspheres  [2,3].  Over  the 
wavelength  range  630-850  nm,  quality  factors  <2^8X109 
have  been  realized,  and  cavity  finesse  5F=23X  106  demon¬ 
strated  [2,3].  Such  high-quality  factors  make  the  WGMs  of 
small  dielectric  spheres  a  natural  candidate  for  use  in  cavity 
QED  [1,4-20]. 

While  much  of  the  work  regarding  quartz  microspheres 
has  centered  around  achieving  the  ultimate  quality  factors 
[2,3],  the  quality  factor  of  the  resonator  is  one  of  the  factors 
that  determines  the  suitability  of  the  WGMs  for  investiga¬ 
tions  of  cavity  quantum  electrodynamics  in  a  regime  of 
strong  coupling.  In  this  case,  the  coherent  coupling  coeffi¬ 
cient  g  for  a  single  atom  interacting  with  the  cavity  mode 
must  be  much  larger  than  all  other  dissipative  rates,  includ¬ 
ing  the  cavity  decay  rate  k  and  the  rate  of  atomic  spontane¬ 
ous  emission  y;  namely,  g>(K,y).  Note  that  2g  =  Cl  gives 
the  Rabi  frequency  associated  with  a  single  quantum  of  ex¬ 
citation  shared  by  the  atom-cavity  system  [21,22].  The  atom- 
field  interaction  can  be  characterized  by  two  important  di¬ 
mensionless  parameters:  the  saturation  photon  number  n0 
&y2/g2  and  the  critical  atom  number  N0^Ky/g2.  Since 
these  parameters  correspond  respectively,  to  the  number  of 
photons  required  to  saturate  an  intracavity  atom  and  the 
number  of  atoms  required  to  have  an  appreciable  effect  on 
the  intracavity  field,  strong  coupling  requires  that  (n0,N0) 

1 .  Ideally  one  would  hope  to  minimize  both  of  these  pa¬ 
rameters  in  any  particular  resonator.  Unfortunately,  within 
the  context  of  our  current  understanding  of  the  loss  mecha¬ 
nisms  of  the  WGMs  [3],  the  critical  parameters  (n0  ,N0)  can¬ 
not  be  minimized  simultaneously  in  a  microsphere. 

Motivated  by  these  considerations,  in  this  paper  we  ex¬ 
plore  possible  limits  for  the  critical  parameters  (n0,N0)  for 


the  WGMs  of  quartz  microspheres.  Following  the  analysis  of 
Refs.  [4,5,18],  we  study  the  particular  case  of  a  single  atom 
coupled  to  the  external  field  of  a  WGM  near  the  sphere’s 
surface.  We  show  that  there  are  radii  that  minimize  (n()  ,N0) 
individually,  and  that  there  is  an  “optimal”  sphere  size  that 
minimizes  the  geometric  mean,  y n0XN0 ,  of  these  two 
cavity-QED  parameters  and  allows  both  parameters  to  be 
near  their  respective  minima.  We  also  report  our  progress  in 
the  fabrication  of  small  microspheres  with  radii  a  —10  jam, 
and  compare  our  experimental  results  for  Q  with  those  from 
our  theoretical  analysis.  Finally,  we  present  a  detailed  com¬ 
parison  for  the  state  of  the  art  and  future  prospects  for 
achieving  strong  coupling  in  cavity  QED  for  both  micro¬ 
sphere  and  Fabry-Perot  cavities.  Throughout  the  presenta¬ 
tion,  we  attempt  to  develop  a  general  formalism  that  can  be 
applied  to  diverse  systems.  However,  for  definiteness  we  also 
present  results  for  a  particular  system  of  some  interest, 
namely,  an  individual  cesium  atom  coupled  to  the  WGMs  of 
quartz  microspheres. 


II.  MODES  OF  A  MICROSPHERE 

Solving  for  the  mode  structure  of  the  resonances  of  a 
dielectric  sphere  in  vacuum  is  a  classic  problem  in  electricity 
and  magnetism,  and  the  resulting  field  distributions  have 
been  known  for  some  time  [23].  The  electric  field  of  the  TM, 
electric  type,  modes  inside  and  outside  a  sphere  of  refractive 
index  n  at  free-space  wavelength  \0  are  respectively, 
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FIG.  1.  The  magnitude  of  the  normalized  mode  function  as  a 
function  of  radius  for  the  TM  mode  of  a  5.305  jam  radius  sphere 
(p=  l, l  =  m  =  50)  with  0=tt/2  and  <fi  =  0  for  a  wavelength  of  X0 
=  852.359  nm  and  index  of  refraction  n  =  1.452  46.  In  our  case,  the 
function  is  normalized  to  have  a  maximum  value  of  unity.  Note  that 
there  is  a  discontinuity  at  the  surface. 
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Throughout  this  paper,  we  normalize  the  mode  functions 
such  that  their  maximum  value  is  unity.  This  condition  then 
yields  for  the  l  =  m  modes  of  the  sphere 
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where  a  is  the  radius  of  the  sphere,  k  =  27rn/X0  is  the  wave 
vector  inside  the  sphere,  y'/(v)  is  the  spherical  Bessel  func¬ 
tion,  h\l\x)  is  the  spherical  Hankel  function,  (r,0,(f>)  are 
unit  vectors,  and  the  '  refers  to  differentiation  with  respect  to 
the  argument.  Note  that  the  TM  modes  have  a  predominantly 
radial  electric-field  vector. 

In  order  to  satisfy  the  boundary  conditions  at  the  surface 
of  the  microsphere,  the  tangential  components  of  the  mode 
function  immediately  inside  and  outside  the  sphere  must  be 
equal.  However,  there  is  a  discontinuity  in  the  radial  compo¬ 
nent  of  the  electric  field  at  the  dielectric  boundary  (as  can  be 
seen  from  Fig.  1.)  The  eigenmodes  are  determined  by  solv¬ 
ing  for  the  roots  of  a  characteristic  equation  [23],  which  can 
be  reduced  to 
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and  N  is  the  normalization  factor.  Because  we  will  require 
the  field  outside  the  sphere  to  be  as  large  as  possible,  we  will 
choose  the  p  =  1  modes.  Also,  because  the  coherent  coupling 
constant  l/y/Vp,  where  Vf>  is  the  cavity  mode  volume,  we 
choose  the  l  =  m  modes,  since  they  yield  the  smallest  elec¬ 
tromagnetic  mode  volume,  as  will  be  explained  in  the  fol¬ 
lowing  section. 


III.  ELECTROMAGNETIC  MODE  VOLUME 

The  effective  mode  volume  Vp  associated  with  the  elec¬ 
tromagnetic  field  distribution  ^(r,  #,</>)  [4]  is  given  by 
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[  n2  if  r<a 

e(r)=  (10) 

[1  if  r>a 

and  P  corresponds  to  the  (/?,/, m)  mode.  Vq  is  the  quantiza¬ 
tion  volume  discussed  in  Ref.  [4].  As  long  as  a  radius  rQ  is 
chosen  large  enough  to  include  the  effects  of  the  evanescent 
field,  the  mode  volume  is  relatively  insensitive  to  the  particu¬ 
lar  choice  of  quantization  radius  [24].  As  discussed  more 
extensively  in  Refs.  [21,22]  the  interaction  between  the  in¬ 
ternal  atomic  degrees  of  freedom  and  the  intracavity  field  is 
characterized  by  the  coherent  coupling  constant  g(r,0,  <fi), 
where 


and 


g(r,0,4>)=g0$<r-l-m\r,e,<l>)  (11) 

1 
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V Vp 


Note  that  in  the  absence  of  damping,  2 g(r)  gives  the  fre¬ 
quency  for  Rabi  nutation  associated  with  a  single  photon  in 
the  cavity  for  an  atom  initially  in  the  ground- state  located  at 
position  r  within  the  mode.  Therefore,  in  order  to  maximize 
the  coupling  strength,  one  must  endeavor  to  minimize  the 
cavity  mode  volume. 

In  order  to  derive  an  answer  that  can  be  applied  to  differ¬ 
ent  wavelengths,  one  can  define  a  dimensionless  mode  vol¬ 
ume  parameter  V  and  plot  as  a  function  of  a  dimensionless 
sphere  size  parameter  x  defined  as 


and 


27 rna 


(13) 


(14) 


where  Vp  is  the  cavity  mode  volume,  n  is  the  index  of  re¬ 
fraction  at  the  free-space  wavelength  \0,  and  a  is  the  sphere 
radius.  The  plots  then  only  depend  on  the  index  of  refraction 
(see  Fig.  2). 

Naively,  one  might  assume  that  the  sphere  should  be 
made  as  small  as  possible  in  order  to  minimize  the  electro¬ 
magnetic  mode  volume,  and  hence  to  provide  a  maximum 
for  g o  and  hence  globally  for  g(r).  However,  as  shown  in 
Figs.  2  and  3,  the  mode  volume  for  the  TM  modes  of  a 
quartz  microsphere  actually  passes  through  a  minimum  at 
some  particular  radius  a0.  This  behavior  can  be  understood 
by  noting  that  for  a<a0 ,  the  intrinsic,  radiative  losses  are 
increasing  rapidly  and  ultimately  cause  the  mode  to  no 
longer  be  well  confined  by  the  sphere,  with  a  concomitant 
increase  of  the  mode  volume.  Note  that  in  Fig.  2  and  subse¬ 
quent  figures,  we  give  results  for  ft —  1.45  corresponding  to 
fused  silica,  as  well  as  for  n  =  2.00  and  n  =  3.00.  These  latter 


FIG.  2.  (a)  The  dimensionless  volume  parameter  V  [defined  by 
Eq.  (13)],  as  a  function  of  the  dimensionless  size  parameter  x  [de¬ 
fined  by  Eq.  (14)].  The  solid  line  is  for  an  index  of  refraction  n 
=  1.452  46,  the  index  of  refraction  for  fused  silica  at  X0 
=  852  nm,  with  a  minimum  of  V=  34  883.4  for  x  =  39.946  9  (/ 
=  m  =  34).  The  dotted  line  is  for  an  index  of  refraction  n  =  2.00, 
with  a  minimum  of  V=  15  596.2  for  x=  18.986  4  (/  =  m=14).  The 
dashed  line  is  for  an  index  of  refraction  ft  =  3.00,  with  a  minimum 
of  V=  11  546.4  for  x=  10.274  8  (Z  =  m  =  6).  (b)  Because  the  index 
of  refraction  for  fused  silica  varies  from  n=  1.444  at  X0 
=  1550  nm  to  n  =  1.458  for  X0  =  600  nm  (see  Fig.  4),  this  plot  of 
the  dimensionless  volume  parameter  V  as  a  function  of  the  dimen¬ 
sionless  size  parameter  x  is  made  for  that  range  of  values.  The  solid 
line  is  for  an  index  of  refraction  n  =  1.44,  with  a  minimum  of  V 
=  36  247.5  for  x  =  40.981  2,  (Z  =  m  =  35).  The  dotted  line  is  for  an 
index  of  refraction  n=  1.45,  with  a  minimum  of  V=  35  161.1  for 
x  =  41 .003  6,  (/  =  m  =  35) .  The  dashed  line  is  for  an  index  of  refrac¬ 
tion  ft=1.46,  with  a  minimum  of  V=34  129.1  for  *  =  39.963  1, 
(/  =  m  =  34). 


cases  serve  to  illuminate  the  role  of  n  as  well  as  being  appli¬ 
cable  to  other  materials  (i.e.,  the  index  of  refraction  for  GaAs 
is  n  =  3.4  for  X  =  1550  nm  [25]).  For  a  very  low-OH  fused 
silica  microsphere  at  X0=  852  nm  (the  wavelength  of  the  D2 
transition  in  atomic  cesium)  with  index  of  refraction  n 
=  1.45246,  the  minimum  mode  volume  V^n^28.4  /xm3  oc¬ 
curs  for  radius  a ^3. 13  jmm  corresponding  to  mode  numbers 
p=\J  =  m  =  3A  (see  Fig.  3).  One  might  at  first  believe  that 
this  value  for  the  radius  represents  the  optimal  sphere  size  for 
use  as  a  cavity  with  single  atoms.  However,  while  the  mode 
volume  Vp  plays  an  important  role  in  determining  the  cou¬ 
pling  constant  [Eq.  (12)],  it  is  not  the  only  parameter  relevant 
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FIG.  3.  The  electromagnetic  mode  volume  Vp  for  the  TM 
modes  of  a  very  low-OH  fused  silica  microsphere  as  a  function  of 
sphere  radius  at  the  wavelength  \0  =  852  nm  for  the  D2  line  of 
atomic  cesium.  The  minimum,  28.4  jam3,  occurs  for  radius  a0 
^3.13  jam  corresponding  to  mode  numbers  p—  1  and  l  =  m  =  34. 

to  cavity  QED  with  single  atoms  in  a  regime  of  strong  cou¬ 
pling.  As  discussed  in  the  following  sections,  the  quality  fac¬ 
tor  Q  of  a  WGM  has  a  strong  dependence  on  the  sphere 
radius,  and  must  also  be  considered  in  an  attempt  to  optimize 
the  critical  atom  and  saturation  photon  numbers. 

IV.  LOSSES  IN  DIELECTRIC  SPHERES 

For  fused  silica  spheres  with  radius  15  /z m,  the  effect 
of  intrinsic  radiative  losses  can  be  safely  neglected,  since 
they  allow  quality  factor  <2^1021,  as  illustrated  in  Fig.  5. 
Such  large  values  of  Q  greatly  exceed  those  imposed  by 
technical  constraints  of  material  properties,  such  as  bulk  ab¬ 
sorption  and  surface  scattering. 

However,  as  one  moves  to  very  small  spheres  with  radius 
<2^10  /z m,  the  intrinsic  radiative  Q  falls  steeply  enough  to 
become  the  dominant  loss  mechanism  even  in  the  face  of 
other  technical  imperfections.  When  assessing  the  usefulness 
of  microspheres  for  cavity  QED,  one  must  account  for  the 
entire  set  of  loss  mechanisms  to  determine  the  optimal  size 
for  the  microsphere,  which  is  the  subject  to  which  we  now 
turn  our  attention. 

The  quality  factors  of  the  WGMs  of  fused  silica  micro¬ 
spheres  are  determined  by  several  different  loss  mechanisms. 


FIG.  4.  The  index  of  refraction  of  very  low-OH  fused  silica  as  a 
function  of  wavelength. 


FIG.  5.  (a)  Semi-log  plot  of  the  radiative  quality  factor  grad  for 
various  indices  of  refraction  as  a  function  of  the  dimensionless  size 
parameter  x  =  2irna/\0.  (b)  Semi-log  plot  of  the  radiative  quality 
factor  grad  as  a  function  of  sphere  radius  for  a  wavelength  of  X0 
=  852.359  nm  (index  of  refraction  is  n=  1.452  46). 

The  overall  quality  factor  can  then  be  calculated  by  adding 
the  different  contributions  in  the  following  way  [2]: 


Q  '  -  Grad  +  Q mat’ 

(15) 

Gmat=  Qsl  +  2w  1  +  2 bulk’ 

(16) 

where  grad  is  due  to  purely  radiative  losses  for  an  ideal  di¬ 
electric  sphere  and  gmat  results  from  nonideal  material  prop¬ 
erties.  The  principal  mechanisms  contributing  to  gmat  are 
scattering  losses  from  residual  surface  inhomogeneities 
(gs  s),  absorption  losses  due  to  water  on  the  surface  of  the 
sphere  (gw),  and  bulk  absorption  in  the  fused  silica  ( <2  bulk)- 
The  intrinsic  material  losses  are  known  very  accurately,  since 
they  arise  from  absorption  in  the  material  at  the  wavelength 
of  concern  [26].  Considerably  greater  uncertainty  is  associ¬ 
ated  with  the  losses  due  to  surface  scattering  and  absorption 
due  to  adsorbed  material  on  the  surface  of  the  sphere,  of 
which  water  is  likely  the  principal  component.  We  will  adopt 
the  models  for  these  losses  presented  in  Refs.  [2,3],  extrapo¬ 
lated  to  the  regime  of  small  spheres  of  interest  here. 

A.  Intrinsic  radiative  losses 

The  contribution  to  the  quality  factor  for  purely  radiative 
effects  <2rad  can  be  derived  by  following  the  arguments  pre¬ 
sented  in  Ref.  [27].  These  losses  are  due  to  the  leakage  of 
light  from  the  resonator  due  to  its  finite  dielectric  constant 
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and  radius  of  curvature.  The  results  can  then  be  compared  to 
numerical  results  obtained  by  Lorenz-Mie  theory  [28].  We 
find  from  Ref.  [27]  that 

2rad=  \[l^nl-2b{n2-l)me2T‘,  (17) 


where 


Tl=\l+2  (^-tanh  W)* 


7] i  =  arccosh 


1- 


1-2  b 


l  + 


r 


-i ' 


(18) 

(19) 


and 


b  =  ' 


1 1 

1  1 

1/3 

- 

2  1 

l  2/  J 

0 

TE  modes 

1 

TM  modes. 

(20) 


(21) 


Also,  n  is  the  index  of  refraction  and  t°p  is  the  pth  zero  of  the 
Airy  function  Ai.  This  p  corresponds  to  the  mode  number 
( p,l,m ).  In  our  case,  we  are  only  interested  in  the  p  =  1 
modes  of  the  sphere  to  maximize  the  electromagnetic  field 
outside  the  sphere  while  maintaining  a  small  mode  volume. 
Note  that  these  expressions  for  Qrad  become  invalid  in  the 
limit  of  small  /  mode  numbers.  The  error  in  the  mode  func¬ 
tions  used  to  derive  these  results  reaches  1%  for  7=18. 
However,  the  error  is  less  than  0.2%  for  1  =  16  (This  is  the 
optimal  sphere  size  discussed  in  Sec.  VI).  Fortunately,  the 
expressions  are  valid  in  the  regimes  for  which  we  are  con¬ 
cerned.  This  has  been  confirmed  by  making  comparisons 
with  numerical  values  obtained  using  Lorenz-Mie  scattering 
theory. 

From  Fig.  5,  we  see  that  the  radiative  Q  falls  approxi¬ 
mately  exponentially  as  the  radius  a  is  decreased,  and  can 
become  quite  important  as  the  sphere  size  is  decreased  below 
10  pi m.  For  example,  for  a  15  pi m  radius  sphere  and  a  wave¬ 
length  \0  =  852.359  nm,  grad^2Xl021.  Therefore,  the  net 
quality  factor  would  most  certainly  be  dominated  by  other 
loss  mechanisms  in  Eq.  (15).  However,  for  a  7  pi m  radius 
sphere,  grad^4Xl08,  and  the  radiative  losses  can  play  a 
crucial  role  in  the  characteristics  of  the  spheres  that  are  op¬ 
timal  for  use  in  cavity  QED. 


B.  Material  loss  mechanisms 

The  quality  factor  due  to  bulk  absorption  2buik  in  fused 
silica  is  actually  known  very  well,  since  this  depends  only  on 
the  absorption  of  the  material  at  the  wavelength  of  concern 
[2]: 


2  TTYl 

Qbu\k=~V~  >  (22) 


Wavelength  (nm) 

FIG.  6.  The  quality  factor,  2buik>  from  Eq.  (22)  for  a  very 
low-OH  fused  silica  microsphere  as  a  function  of  wavelength.  Be¬ 
cause  fused  silica  has  a  minimum  in  absorption  at  1550  nm,  there  is 
a  maximum  for  the  quality  factor  due  to  bulk  absorption  of  2  bulk 
-3.8X  1011.  At  852  nm,  the  quality  factor  due  to  bulk  absorption  is 
ebulk~2.4X1010. 

where  n  is  the  index  of  refraction,  and  a  is  the  absorption 
coefficient  of  the  material.  From  Fig.  6  we  see  that  for  very 
low-OH  fused  silica,  the  absorption  coefficient  at  852  nm  is 
4.5X  10-4  m-1  [26].  This  would  correspond  to  a  quality 
factor  of  2buik~2.4X  1010.  Fused  silica  has  a  minimum  in  its 
absorption  coefficient  of  1.5X  10-5  m_1  at  1550  nm, 
which  yields  a  quality  factor  of  2  bulk  ~  3.8  X  1011. 

The  quality  factor  due  to  surface  scattering  2s.s.  and  ab¬ 
sorption  by  adsorbed  water  2W  has  also  been  studied  and 
modeled,  Albeit  for  larger  spheres  with  <2^600  pi m.  For 
losses  due  to  surface  scattering,  we  follow  the  work  of  Refs. 
[2,3]  and  take 


3s(s  +  2)2  \o2(2a)1/2 

(4i7)3(s—  1)5/2  ( crB )2 


(23) 


where  s  =  n2  is  the  dielectric  constant  and  crB~5  nm2  is  an 
empirical  parameter  determined  by  the  size  and  correlation 
length  of  the  distribution  of  residual  surface  inhomogene¬ 
ities.  This  quantity  was  reported  in  Ref.  [3]  based  upon 
atomic  force  microscopy  measurements  of  a  microsphere. 

The  quality  factor  due  to  water  adsorbed  on  the  surface, 
Gw ,  is  given  by  [3] 


Gw~ 


IIT  (2a) 1,2 

V8  n38\l0,2/3w’ 


(24) 


where  S~  0.2  nm  is  an  estimated  thickness  for  the  water 
layer,  and  /3W~4.33  m-1  is  the  absorption  coefficient  of  wa¬ 
ter  at  852  nm. 

Combining  these  various  results,  we  display  in  Fig.  7  a 
curve  for  the  quantity  2mat  as  a  function  of  sphere  radius  a 
for  a  wavelength  \0=852  nm.  This  same  figure  shows  the 
quality  factor  2rad>  set  by  intrinsic  radiative  losses  [Eq. 
(17)],  as  well  as  the  overall  quality  factor  Q 
=  2rad2mat/(2rad+2mat)-  From  this  plot,  we  see  that  the 
radiative  losses  dominate  the  overall  quality  factor  below  a 
radius  of  a  ^  8  pi m,  while  the  losses  due  to  material  proper- 
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Sphere  Radius,  a  (jam) 

FIG.  7.  (a)  Semilog  plot  of  the  quality  factors  due  to  the  various 
loss  mechanisms  discussed  in  Sec.  IV  for  a  very  low-OH  fused 
silica  microsphere  as  a  function  of  sphere  radius  for  the  /  =  ra,  TM 
modes  at  a  wavelength  of  \0  =  852  nm.  In  particular,  traces  are 
shown  for  the  quality  factor  due  to  purely  radiative  losses  (Qrad), 
the  bulk  absorption  of  fused  silica  (Gbuik)>  both  radiative  losses  and 
bulk  absorption,  the  three  loss  mechanisms  comprising  <2mat: 
(Gbuik»Gs.s.  and  the  predicted  Q  due  to  all  four  loss  mecha¬ 

nisms.  (b)  This  linear  plot  zooms  in  on  the  region  of  interest  at  the 
transition  where  the  radiative  losses  become  the  dominant  loss 
mechanism.  The  plot  contains  the  quality  factor  due  to  purely  ra¬ 
diative  losses  (Grad) >  the  three  loss  mechanisms  comprising  Gmat: 
(Qbuik»Qs.s.  »GW)»  and  the  predicted  Q  due  to  all  four  loss  mecha¬ 
nisms. 

ties  are  most  significant  for  8  jmm.  Because  of  the  ex¬ 
tremely  steep  dependence  of  grad  on  sphere  size,  the  point  of 
transition  from  material  to  radiative  dominated  loss  should 
be  reasonably  insensitive  to  details  of  the  models  employed 
to  describe  the  material  losses.  Although  we  focus  our  atten¬ 
tion  here  on  the  wavelength  appropriate  to  the  particular  case 
of  the  D2  transition  in  atomic  cesium,  a  similar  analysis 
could  be  carried  out  for  other  wavelengths  of  interest  using 
the  above  formalism,  as  for  example  the  2S^2P  transition 
at  1.083  jmm  in  metastable  helium. 

V.  THE  STRONG  COUPLING  REGIME 

The  ultimate  goal  that  we  consider  here  is  to  employ  the 
WGMs  of  quartz  microspheres  as  cavity  modes  for  achieving 
strong  coupling  to  atoms  within  the  setting  of  cavity  QED. 
The  atom  of  choice  in  this  paper  is  cesium,  and  in  particular, 
the  D2  (F  =  4^Fr  =  5)  transition  in  cesium  at  \0 
=  852.359  nm  as  an  illustrative  example.  Such  an  analysis 


allows  a  direct  comparison  with  the  state  of  the  art  in  Fabry- 
Perot  cavities  [29]. 

The  coupling  coefficient  g(r )  is  the  coupling  frequency  of 
a  single  atom  to  a  particular  cavity  mode  and  corresponds  to 
one-half  the  single-photon  Rabi  frequency  [21,22].  For  an 
atom  located  just  at  the  outer  surface  of  the  microsphere  (i.e., 
in  vacuum)  and  interacting  with  a  whispering  gallery  mode 
P  =  (p,l,m ),  the  coupling  coefficient  is  given  by  [4] 

g(a)  =  ga=y±\^oat(a)\-\J-^,  (25) 

'  V  p 


where  a  is  the  sphere  radius,  yJ2iT  =  2.61  MHz  is  the  trans¬ 
verse  spontaneous  decay  rate  for  our  transition  in  cesium, 
V0  =  3  cX^iry ±  is  the  effective  volume  of  the  atom  for 
purely  radiative  interactions,  and  Vf>  is  the  electromagnetic 
mode  volume  of  the  whispering  gallery  mode  designated  by 
P  =  (pJ,m). 

Armed  with  a  knowledge  of  g,  we  are  now  able  to  deter¬ 
mine  certain  dimensionless  parameters  relevant  to  the  strong 
coupling  regime  of  cavity  QED.  In  particular,  we  consider  an 
atom-cavity  system  to  be  in  the  strong  coupling  regime  when 
the  single-photon  Rabi  frequency  2  g  for  a  single  intracavity 
atom  dominates  the  cavity  field  decay  rate  k  the  atomic  di¬ 
pole  decay  rate  y±  and  the  inverse  atomic  transit  time  T~ 1 
[21,22].  We  will  defer  further  discussion  of  T~l,  however, 
this  requirement  relates  to  the  need  for  atomic  localization 
[4,5].  In  the  strong  coupling  regime,  important  parameters 
for  characterizing  the  atom-cavity  system  are  the  two  dimen¬ 
sionless  parameters:  the  saturation  photon  number  n0,  and 
the  critical  atom  number  N0.  The  saturation  photon  number, 
given  by 


(26) 


corresponds  to  the  number  of  photons  required  to  saturate  an 
intracavity  atom  [21,22].  The  critical  atom  number,  defined 
by 


2yi  k 


(27) 


corresponds  to  the  number  of  atoms  required  to  have  an  ap¬ 
preciable  effect  on  the  intracavity  field  [21,22].  Ideally,  one 
hopes  to  minimize  simultaneously  both  the  critical  atom 
number  N0  and  the  saturation  photon  number  n0  which  cor¬ 
responds  to  simultaneous  maxima  for  both  g2/Ky±  and 

g2/ri  • 

The  saturation  photon  number  and  critical  atom  number 
are  useful  because  of  their  physical  meaning.  However,  one 
can  define  a  new  dimensionless  parameter 


8772Vp  1 

3\l  lUu,(«)|2’ 


(28) 
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that  corresponds  to  the  cavity  mode  volume  in  units  of  X3 
weighted  by  the  inverse  of  the  strength  of  the  mode  function 
at  the  atomic  position.  This  enables  the  equations  for  the 
saturation  photon  number  and  critical  atom  number  to  be 
expressed  as 


n0  = 


P 

^  Q  atom 


and 


N0  = 


P 

G  cavity 


(29) 


(30) 


where 


Gatom  x  (31) 

*0  7± 

and 

7TC 

2cavity=V?  (32) 

This  parameter  0  then  also  determines  the  coupling  coeffi¬ 
cient  in  the  following  manner: 

V2ircyL 

-TixT  (33) 

Therefore,  we  see  that  one  can  use  a  single  parameter  0 

combined  with  the  properties  of  the  atom  to  be  used  (X0  and 
7j_)  and  the  quality  factor  of  the  resonator,  G  cavity  ^  t0  deter¬ 
mine  the  three  parameters  ( n0,N0,g0 )  of  importance  in  de¬ 
termining  the  quality  of  an  atom-cavity  system. 

Figures  8  and  9  are  plots  of  this  dimensionless  parameter 
/ 3  and  of  l/yffi  as  functions  of  the  dimensionless  size  param¬ 
eter  x  =  27rnalX0  for  a  few  values  of  index  of  refraction. 
Because  the  index  of  refraction  for  fused  silica  varies  from 
n  =  1.444  at  X0  =  1550  nm  to  n=  1.458  for  X0  =  600  nm  (see 
Fig.  4),  Figs.  8(b)  and  9(b)  are  made  for  that  range  of  values. 
From  Figs.  8  and  9  one  sees  that  there  is  a  minimum  for  0 
and  a  maximum  for  1/^0  that  depends  on  the  index  of  re¬ 
fraction. 


VI.  STRONG  COUPLING  WITH  CESIUM 

The  results  of  the  previous  section  can  now  be  used  to 
determine  the  saturation  photon  number  n{)  the  critical  atom 
number  N0  and  the  coupling  coefficient  g(a )  for  any  atomic 
transition.  In  our  case,  we  are  concerned  with  the  D2  transi¬ 
tion  in  cesium  (X0=  852.359  nm).  For  this  transition,  the 
spontaneous  transverse  decay  rate  is  y/27r  =  2.61  MHz. 
Also,  at  this  wavelength  the  index  of  refraction  for  fused 
silica  is  n=  1.452  46.  This  allows  one  to  compute  the  cou¬ 
pling  coefficient,  g(a)  =  \l2ircyj_  / 0XO.  Figure  10  shows 
that  there  is  a  maximum  of  g/27T=  749.986  MHz  for  a  radius 
a  =  3.63  /x m,  (l  =  m  =  33).  Interestingly,  because  we  are  re¬ 
stricted  to  having  the  atom  couple  to  the  external  field  of  the 


FIG.  8.  (a)  The  dimensionless  parameter  (3  as  a  function  of  the 
dimensionless  size  parameter  x  =  27rna/\0.  For  an  index  of  refrac¬ 
tion  n=  1.452  46  (i.e.,  the  index  of  refraction  for  fused  silica  at  X0 
=  852  nm),  there  is  a  minimum  of  /3=  1  632.01  for  x  =  38.883  3, 
(l  =  m  =  33).  For  an  index  of  refraction  n  =  2.00,  there  is  a  mini¬ 
mum  of  [3=  221.124  for  *  =  17.876  3,  ( l  =  m=  13).  For  an  index  of 
refraction  n  =  3.00,  there  is  a  minimum  of  f3  =  45.374  4  for  * 
=  10.274  8,  (/  =  m  =  6).  (b)  Because  the  index  of  refraction  for 
fused  silica  varies  from  n  =  1.444  at  X0=  1550  nm  to  n  =  1.458  for 
X0  =  600  nm  (see  Fig.  4),  this  plot  is  made  for  that  range  of  values. 
For  an  index  of  refraction  n=1.44,  there  is  a  minimum  of  [3 
=  1  753.92  for  x  =  39.918  8,  (/  =  m  =  34).  For  an  index  of  refraction 
n=  1.45,  there  is  a  minimum  of  /3=  1  653.7  for  *  =  38.877  8,  (/ 
=  m  =  33).  For  an  index  of  refraction  n=  1.46,  there  is  a  minimum 
of  0=1  561.45  for  *  =  37.834  8,  (Z  =  m  =  32). 

microsphere,  the  maximum  in  the  coupling  coefficient  g(a) 
does  not  coincide  with  the  minimum  for  the  mode  volume, 
Vp  (see  Figs.  3  and  10.) 

The  saturation  photon  number  n0  is  proportional  to  the 
dimensionless  parameter  0  as  shown  in  Eq.  (29).  Since  the 
factor  of  proportionality  is  a  constant  that  depends  only  on 
the  properties  of  the  particular  atom  of  concern,  the  curve  is 
determined  by  that  of  f3  along  with  the  quality  factor  of  the 
atomic  resonance  (in  our  case  cesium),  which  is  given  by  Eq. 
(31)  to  be  Qatom  =  6.738X  107.  Figure  11  is  a  plot  of  the 
saturation  photon  number  for  the  D2  transition  in  cesium  as  a 
function  of  sphere  size.  Figure  11  shows  that  there  is  a  mini¬ 
mum  for  the  saturation  photon  number  of  n0  =  6.055  27 
X10-6  for  a  sphere  radius  of  a  =  3.631 63  jam  ( l  =  m 
=  33). 

The  critical  atom  number  N{)  is  also  proportional  to  the 
dimensionless  parameter  f3  as  shown  in  Eq.  (30).  However, 
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FIG.  9.  (a)  The  dimensionless  parameter  l/V/3  as  a  function  of 
the  dimensionless  size  parameter  x  =  2i Tna/X0.  For  an  index  of 
refraction  n  —  1.452  46  (i.e.,  the  index  of  refraction  for  fused  silica 
at  X0  =  852  nm),  there  is  a  maximum  of  1/^/3  =  0.024  753  6  for  x 
=  38.883  3,  (l  =  m  =  33).  For  an  index  of  refraction  n  =  2.00,  there 
is  a  maximum  of  l/V/3  =  0.067  248  4  for  v=  17.876  3,  ( l  =  m 
=  13).  For  an  index  of  refraction  n  =  3.00,  there  is  a  maximum  of 
1/V^=  0.148  455  for  1=10.274  8,  (/  =  m  =  6).  (b)  Because  the  in¬ 
dex  of  refraction  for  fused  silica  varies  from  n=  1.444  at  X0 
=  1550  nm  to  n=  1.458  for  \0  =  600  nm  (see  Fig.  4),  this  plot  is 
made  for  that  range  of  values.  For  an  index  of  refraction  n=  1.44, 
there  is  a  maximum  of  l/V/3=  0.023  877  9  for  v  =  39.918  8,  (/ 
=  m  =  34).  For  an  index  of  refraction  n  =  1.45,  there  is  a  minimum 
of  l/V/3  =  0.024  590  8  for  *  =  38.877  8,  (/  =  m  =  33).  For  an  index 
of  refraction  n  =  1.46,  there  is  a  minimum  of  l/V/3=  0.025  306  8  for 
x  =  37.834  8,  (l  =  m  =  32). 

its  factor  of  proportionality  is  the  quality  factor  of  the  reso¬ 
nator,  Q Cavity  >  which  has  a  very  strong  dependence  on  the 
sphere  radius  a  in  the  region  below  10  jmm  (see  Fig.  7). 
Therefore,  the  minimum  for  the  critical  atom  number  does 
not  occur  for  the  same  sphere  size  as  for  the  saturation  pho¬ 
ton  number.  Figure  12  is  a  plot  of  the  critical  atom  number  as 
a  function  of  sphere  size.  Using  for  Qcavity  the  model  that 
incorporates  all  of  the  loss  mechanisms  discussed  in  Sec.  IV 
(radiative  losses,  bulk  absorption,  surface  scattering,  and  ab¬ 
sorption  due  to  water  on  the  surface),  we  find  that  the  mini¬ 
mum  for  the  critical  atom  number  Nq  =  8.999  35  X  10  6  oc¬ 
curs  for  a  sphere  radius  of  a  =  8.12015  jmm  ( l  =  m  =  19 ).  At 
this  radius,  the  coupling  coefficient  is  g/(2ir) 
=  304.16  MHz. 

Unfortunately,  as  illustrated  in  Fig.  13,  the  minima  for  the 
two  parameters,  n0  and  N0 ,  do  not  occur  for  the  same  sphere 


Sphere  Radius,  a  (pm) 

FIG.  10.  The  coupling  coefficient,  g/2ir,  as  a  function  of  sphere 
size  for  the  D2  transition  in  cesium  (X0  =  852.359  nm).  There  is  a 
maximum  of  g/27T  =  749.986  MHz  for  a  sphere  radius  of  a 
=  3.631  63  jmm,  (l  =  m  =  33).  Note  that  the  maximum  for  g/ 2tt 
does  not  coincide  with  the  minimum  for  the  cavity  mode  volume, 
Vp  (see  Fig.  3). 

radius.  However,  if  one  uses  the  minimum  of  the  geometric 
mean  of  the  two  parameters,  each  can  have  a  value  near  its 
respective  minimum.  The  minimum  of  the  geometric  mean 
occurs  for  a  sphere  radius  a  =  7.830  38  jmm  ( /  =  ra  =  76 ) .  For 
this  sphere  size,  the  coupling  coefficient  is  g/  2tt 
=  318.333  MHz,  the  saturation  photon  number  is  n0 
=  3.361 07X 10-5,  and  the  critical  atom  number  is  N0 
=  9.278  34X  10-6.  Therefore,  each  cavity  QED  parameter 
can  be  made  to  achieve  simultaneously  a  value  near  its  re¬ 
spective  minimum. 

VII.  PROGRESS  IN  SMALL  SPHERE  MANUFACTURE 

A  large  portion  of  the  work  being  done  on  microspheres 
has  been  to  push  the  quality  factors  of  the  spheres  to  record 
levels  [2,3].  This  effort  has  produced  some  of  the  highest 
finesse  (*F=2.3X106)  optical  cavities  to  date  with  quality 
factors  2~1010  [2,3].  However,  we  have  seen  that  Q  is  not 
the  only  relevant  factor  in  determining  the  suitability  of  the 
WGMs  for  cavity  QED  in  a  regime  of  strong  coupling.  In 


Sphere  Radius,  a  (pm) 

FIG.  11.  The  saturation  photon  number  «0  as  a  function  of 
sphere  size  for  the  D2  transition  in  cesium  (X0  =  852.359  MHz). 
There  is  a  minimum  n0  =  6.055  27 X  10“ 6  for  a  sphere  radius  of  a 
=  3.631  63  jam  ( l  =  m  =  33).  At  this  radius,  the  coupling  coefficient 
is  g/2 77  =  749.986  MHz. 
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Sphere  Radius,  a  (pm) 

FIG.  12.  The  critical  atom  number  N0  as  a  function  of  sphere 
size  for  the  D2  transition  in  cesium  (\0  =  852.359  MHz).  There  is  a 
minimum  N0=  8.999  35 X 10“ 6  for  a  sphere  radius  of  a 
=  8.1201  5  jmm  (l  =  m  =  79).  At  this  radius,  the  coupling  coefficient 
is  g/2ir=  304.16  MHz.  This  plot  of  the  critical  atom  number  incor¬ 
porates  the  model  for  the  quality  factor  of  the  resonator,  Qcavity , 
outlined  in  Sec.  IV,  for  the  four  loss  mechanisms:  bulk  absorption, 
surface  scattering,  absorption  due  to  water  on  the  surface,  and  ra¬ 
diative  losses.  The  dark  gray  region  is  bounded  by  the  effects  of 
purely  radiative  losses.  The  light  gray  region  is  bounded  by  the 
effects  of  both  radiative  losses  and  bulk  absorption. 

general,  the  preceding  analysis  demonstrates  the  requirement 
to  push  to  microspheres  of  small  radius,  10  jmm.  Unfor¬ 
tunately,  the  experiments  that  have  achieved  the  highest 
quality  factors  and  which  have  investigated  certain  material 
loss  mechanisms  are  of  rather  larger  size,  and  hence  not  op- 


FIG.  13.  This  plot  shows  the  two  parameters,  (n0,N0),  of  im¬ 
portance  to  cavity  QED  as  a  function  of  sphere  radius.  The  geomet¬ 
ric  mean  of  these  two  parameters  is  also  plotted.  The  solid  line 
represents  the  saturation  photon  number  n0,  the  dashed  line  gives 
the  critical  atom  number  N0,  and  the  dotted  line  shows  the  geomet¬ 
ric  mean  of  the  two  parameters,  ^n0N0.  The  minimum  of  each  plot 
corresponds  to  the  following  dimensionless  parameters:  n0 
=  6.05527X  10-6  for  a  =  3.631  63  yum  (l  =  m  =  33),  and  N0 
=  8.999  35 X  10-6  at  a  =  8.120  15  jmm  (l  =  m  =  19).  The  two  curves 
cross  at  a  =  1.03  jmm  with  n0  =  N0  =  2.56X10-5.  The  geometric 
mean  of  these  two  parameters,  yJn0XN0,  is  minimized  for  a 
=  7.830  38  jmm  (l  =  m  =  16).  For  this  radius,  the  parameters  are: 
n0  =  3.361  07X  10~5  and  A0  =  9.278  34X  10“ 6.  Note  that  the  curve 
for  N0  assumes  the  model  for  the  Q  discussed  in  this  paper,  and  that 
the  coupling  coefficient  g(r)  is  evaluated  at  the  maximum  of  the 
mode  function  for  r  =  a . 


timal  for  cavity  QED  in  a  regime  of  strong  coupling.  For 
example,  the  experiment  of  Ref.  [3]  achieved  a  quality  factor 
of  2  =  7.2X109  at  850  nm  in  a  sphere  of  radius  a 
=  340  jmm. 

To  explore  the  possibilities  of  cavity  QED  with  strong 
coupling  in  substantially  smaller  spheres,  we  have  under¬ 
taken  a  program  to  study  fabrication  techniques  for  quartz 
microspheres  with  30  jmm,  while  still  maintaining  high- 
quality  factors.  We  have  been  able  to  fabricate  10  jm m  radius 
spheres  using  an  oxygen-hydrogen  microtorch  to  melt  the 
ends  of  very  low-OH  fused  silica  rods  to  form  a  sphere  on 
the  end  of  a  stem.  Light  is  then  coupled  to  the  sphere  using 
frustrated  total  internal  reflection  of  a  prism,  as  in  Refs. 
[3,4,31].  Our  observations  demonstrate  that  spheres  of  this 
size  can  be  made  consistently  to  have  quality  factors  Q 
^0.8X  107.  While  this  is  encouraging  progress,  the  resulting 
Q  is  two  orders  of  magnitude  smaller  than  the  theoretical 
maximum  of  «  1.3X  109  for  this  size  based  upon  the  model 
discussed  in  Sec.  IV. 

One  possible  reason  for  this  discrepancy  could  be  the  im¬ 
portance  of  minimizing  the  ellipticity  of  the  small  spheres. 
Because  the  small  resonators  fabricated  by  our  technique 
have  a  stem  protruding  out  of  them,  they  are  far  from  spheri¬ 
cal.  When  coupling  to  an  l  =  m  mode  in  spheres  with  a 
^100  jmm  and  hence  large  /,  the  mode  is  tightly  confined  to 
the  equator;  therefore,  the  poles  do  not  have  an  appreciable 
impact  on  the  mode  structure  or  quality  factor.  In  this  case,  it 
is  not  of  critical  importance  to  have  the  best  sphere  possible, 
but  rather  the  best  great  circle  possible  to  achieve  large  qual¬ 
ity  factors.  However,  this  is  not  the  case  in  small  spheres 
with  10  jmm.  As  a  decreases,  the  l  =  m  modes  occupy  an 
increasingly  larger  proportion  of  the  sphere  in  polar  angle, 
and  the  ellipticity  of  the  sphere  becomes  increasingly  impor¬ 
tant  in  determining  the  mode  structure  as  well  as  the  Q. 
However,  while  there  is  certainly  room  for  improvement  in 
our  fabrication  technique  and  in  the  resulting  mode  struc¬ 
tures  and  quality  factors,  we  shall  see  in  the  following  sec¬ 
tion  that  the  current  results  have  promising  implications. 

VIII.  COMPARING  MICROSPHERES  AND  FABRY-PEROT 
CAVITIES 

Figure  14  offers  a  comparison  of  the  state  of  the  art  for 
Fabry-Perot  and  microsphere  cavities  for  cavity  QED,  as 
well  as  projections  of  likely  limits  for  each.  It  is  interesting 
to  note  that  in  our  projections  for  the  limiting  cases  of  each, 
microspheres  allow  for  a  significant  improvement  in  the 
critical  atom  number  N{)  relative  to  Fabry-Perot  cavities.  On 
the  other  hand,  a  principal  advantage  of  Fabry-Perot  cavities 
relative  to  microspheres  would  seem  to  be  significant  im¬ 
provements  in  the  saturation  photon  number  n0.  The  specific 
specific  task  at  hand  would  then  dictate  which  technology  to 
apply. 

As  shown  in  Fig.  14,  there  has  already  been  some 
progress  in  coupling  atoms  to  the  external  fields  of  a  micro¬ 
sphere  [31].  The  sphere  employed  for  the  work  of  Ref.  [31] 
had  a  radius  of  a^6 0  jmm,  and  quality  factor  g~5Xl07, 
corresponding  to  a  mode  volume  of  Vp^3.7Xl03  jmm3, 
coupling  coefficient  ga/(27r)^24  MHz,  saturation  photon 
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FIG.  14.  The  solid  line  gives  a  parametric  plot  of  the  critical 
atom  number  N0  and  the  saturation  photon  number  n0,  for  fused- 
silica  microspheres  and  the  D2  transition  of  atomic  cesium,  incor¬ 
porating  the  loss  mechanisms  outlined  in  Sec.  IV.  The  dark  gray 
region  is  bounded  by  the  effects  of  radiative  losses.  The  light  gray 
region  is  bounded  by  the  effects  of  bulk  absorption  and  radiative 
losses.  This  plot  also  offers  a  comparison  of  experimental  and  the¬ 
oretical  cavity  QED  parameters  for  microsphere  and  Fabry-Perot 
cavities.  ■  represents  the  current  state  of  the  art  for  cavity  QED  in 
Fabry-Perot  cavities  as  in  Ref.  [30],  □  is  a  projection  of  the  prac¬ 
tical  limit  for  Fabry-Perot  cavities  based  upon  Ref.  [29].  A  repre¬ 
sents  the  60  jmm  radius  sphere  implemented  for  cavity  QED  in  Ref. 
[31].  T  is  the  current  state  of  the  art  in  10  jmm  microspheres  based 
upon  the  results  presented  in  Sec.  VI.  A  is  the  currently  achievable 
Q  with  the  optimal  sphere  size  of  7.83  jmm  based  upon  the  analysis 
of  Secs.  IV  and  V.  V  is  the  theoretically  achievable  Q  —  9.76 
X  108  at  the  optimal  sphere  size,  a  ^7.83  jmm. 

number  ft0  =  5.54XlO-3,  and  critical  atom  number  N0 
=  2.99X  10“ 2 .  If  instead  this  experiment  were  to  be  imple¬ 
mented  with  a  smaller  sphere  with  10  jmm  radius  and  with 
quality  factor  <2~0.8X107  such  as  we  have  manufactured 
and  described  in  Sec.  VI,  the  following  parameters  would  be 
achieved:  a  mode  volume  of  Vp^l.4Xl02  jmm3,  coupling 
coefficient  ga/(2ir)^233  MHz,  saturation  photon  number 
^0-6.27  X 10  5,  and  critical  atom  number  2.11 

X  10“ 3 .  Therefore,  we  see  that  currently  achievable  quality 
factors  in  spheres  of  radius  10  jmm  already  would  allow  for 
impressive  results  in  cavity  QED  with  single  atoms. 

By  comparison,  the  state  of  the  art  for  Fabry-Perot  cavi¬ 
ties  has  already  achieved  the  following  results  for  the  TEM00 
modes  [30]:  a  cavity  finesse  of  .F=4.8X  105,  a  mode  volume 
of  Vm^  1.69X  103  jmm3,  coupling  coefficient  g0/(27r) 
^110  MHz,  saturation  photon  number  2.82X  10-4,  and 
critical  atom  number  7V0^6.13XlO_3.  If  one  then  looks  at 
possible  limits  of  Fabry-Perot  technology  for  cavity  QED  as 
analyzed  in  Ref.  [29],  the  following  may  be  possible;  a  cav¬ 
ity  of  length  X0/2  with  a  cavity  finesse  of  T=  7.8  X  106  yields 
coupling  coefficient  g0/(27r)^770  MHz,  saturation  photon 


number  ft0^5.7XlO  6,  and  critical  atom  number  N0^  1.9 
X1(T4. 

It  is  encouraging  that  the  currently  achievable  results  for 
small  sphere  manufacture  would  already  allow  the  WGMs  to 
compete  favorably  with  the  current  state  of  the  art  in  Fabry- 
Perot  cavity  QED.  However,  if  one  were  able  to  manufacture 
and  couple  to  spheres  at  the  optimal  size  a ^7.83  jmm  with  a 
<2~9.76X  108  (the  theoretical  maximum  predicted  from  the 
analysis  of  Sec.  IV),  the  following  results  could  be  achieved: 
a  mode  volume  of  V 90  jmm3,  coupling  coefficient 
ga/(27r)^318  MHz,  saturation  photon  number  n0^ 3.36 
X1(T5,  and  critical  atom  number  V0^9.28X  1CT6.  This 
would  represent  a  significant  improvement  over  the  current 
Fabry-Perot  technology  and  be  competitive  with  the  likely 
limits  of  Fabry-Perot  technology.  However,  even  short  of 
achieving  this  stated  maximum  Q  for  the  WGMs,  impressive 
results  can  already  be  attained.  With  a  quality  factor  Q 
—  0.8X107  at  the  optimal  sphere  radius  a  ^7.83  jmm,  one 
would  obtain  these  same  results  [i.e.,  ga/( 27r)^318  MHz 
and  saturation  photon  number  n0^3.36X  10-5),  except  that 
the  critical  atom  number  N0,  would  increase  to  A/q^I.13 
X  10-3.  This  is  still  an  impressive  gain  over  the  current  ca¬ 
pabilities  of  Fabry-Perot  cavities  for  the  saturation  photon 
number,  with  room  for  improvement  in  the  critical  atom 
number. 

Overall,  we  thus  find  that  the  technologies  of  micro¬ 
spheres  and  Fabry-Perot  resonators  each  have  their  advan¬ 
tages  and  disadvantages.  However,  there  is  one  notable  ad¬ 
vantage  of  microspheres;  they  can  be  made  cheaply  and 
relatively  simply  given  sufficient  training  and  skill.  By  con¬ 
trast,  the  Fabry-Perot  cavities  considered  here  require  spe¬ 
cialized  coating  runs  with  expensive  equipment  and  consid¬ 
erable  expertise,  which  is  to  be  found  at  only  a  few  locations 
worldwide.  This  alone  makes  microspheres  an  attractive  al¬ 
ternative  to  Fabry-Perot  cavities  for  cavity  QED.  Another 
unique  advantage  of  the  WGMs  is  the  ability  to  control  the 
cavity  decay  rate  k  by  controlling  the  coupling  efficiency 
into  and  out  of  the  microsphere  (e.g.,  by  adjusting  the  dis¬ 
tance  between  a  coupling  prism  and  the  microsphere  [32]). 
Furthermore,  as  one  moves  to  the  limit  of  small  cavities,  the 
open  geometry  of  microspheres  offers  a  considerable  advan¬ 
tage  when  compared  to  the  geometry  of  Fabry-Perot  cavities. 
Such  possibilities  combined  with  our  projected  values  of  the 
critical  parameters,  (n0,N0),  shown  in  Fig.  14  point  to  the 
competitiveness  of  microspheres  with  current  and  future 
Fabry-Perot  technology  and  demonstrate  their  potential  as  a 
powerful  tool  for  cavity  QED  in  the  regime  of  strong  cou¬ 
pling. 
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Generation  of  nonclassical 
photon  pairs  for  scalable 
quantum  communication 
with  atomic  ensembles 
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Pasadena,  California  91125,  USA 

Quantum  information  science  attempts  to  exploit  capabilities 
from  the  quantum  realm  to  accomplish  tasks  that  are  otherwise 
impossible  in  the  classical  domain1.  Although  sufficient  con¬ 
ditions  have  been  formulated  for  the  physical  resources  required 
to  achieve  quantum  computation  and  communication2,  there  is  a 
growing  understanding  of  the  power  of  quantum  measurement 
combined  with  the  conditional  evolution  of  quantum  states  for 
accomplishing  diverse  tasks  in  quantum  information  science3-5. 
For  example,  a  protocol  has  recently  been  developed6  for  the 
realization  of  scalable  long-distance  quantum  communication 
and  the  distribution  of  entanglement  over  quantum  networks. 
Here  we  report  the  first  enabling  step  in  the  realization  of  this 


protocol,  namely  the  observation  of  quantum  correlations  for 
photon  pairs  generated  in  the  collective  emission  from  an  atomic 
ensemble.  The  nonclassical  character  of  the  fields  is  demon¬ 
strated  by  the  violation  of  an  inequality  involving  their  normal¬ 
ized  correlation  functions.  Compared  to  previous  investigations 
of  non-classical  correlations  for  photon  pairs  produced  in  atomic 
cascades7  and  in  parametric  down-conversion8,  our  experiment  is 
distinct  in  that  the  correlated  photons  are  separated  by  a 
programmable  time  interval  (of  about  400  nanoseconds  in  our 
initial  experiments). 

The  theoretical  proposal  of  ref.  6  (hereafter  ‘DLCZ’)  is  a  prob¬ 
abilistic  scheme  based  upon  the  entanglement  of  atomic  ensembles 
via  detection  events  of  single  photons  in  which  the  sources  are 
intrinsically  indistinguishable,  and  generates  entanglement  over 
long  distances  via  a  quantum  repeater  architecture9.  The  DLCZ 
scheme,  with  built-in  quantum  memory  and  entanglement  purifi¬ 
cation,  is  well  within  the  reach  of  current  experiments  and  accom¬ 
plishes  the  same  objectives  as  previous  more  complex  protocols  that 
require  as  yet  unattainable  capabilities9,10. 

In  our  experiment,  we  demonstrate  a  basic  primitive  integral  to 
the  DLCZ  scheme.  Specifically,  an  initial  ‘write’  pulse  of  (classical) 
light  is  employed  to  create  a  state  of  collective  atomic  excitation  as 
heralded  by  photoelectric  detection  of  a  first  photon  1.  After  a 
programmable  delay  bt,  a  subsequent  ‘read’  pulse  interrogates  the 
atomic  sample,  leading  to  the  emission  of  a  second  (delayed) 
photon  2.  The  manifestly  quantum  (or  nonclassical)  character  of 
the  correlations  between  the  initial  ‘write’  photon  1  and  the 
subsequent  ‘read’  photon  2  is  verified  by  way  of  the  observed 
violation  of  a  Cauchy-Schwarz  inequality  for  coincidence  detection 
of  the  (1,  2)  fields7.  Explicitly,  we  find  [J^2(50  =  (5.45  ±  0.11)]  ^ 
\gi,ig2,2  —  (2.97  ±  0.08)],  where  gij  are  normalized  correlation 
functions  for  the  fields  (i,  j)  and  bt  =  405  ns  is  the  time  separation 
between  the  (1,  2)  emissions.  The  capabilities  realized  in  our 
experiment  provide  an  important  initial  step  towards  the 
implementation  of  the  full  DLCZ  protocol,  which  would  enable 
the  distribution  and  storage  of  entanglement  among  atomic  ensem¬ 
bles  distributed  over  a  quantum  network.  Extensions  of  these 
capabilities  could  facilitate  scalable  long-distance  quantum  com¬ 
munication6  and  quantum  state  engineering11.  For  example,  by 
employing  spin-polarized  samples  in  optical-dipole  or  magnetic 
traps12,  it  should  be  possible  to  extend  the  interval  bt  to  times  of 
several  seconds. 

Our  experiment  arises  within  the  context  of  prior  work  on  spin 
squeezing13,14,  and  in  particular  on  atomic  ensembles  where  signifi¬ 
cant  progress  has  been  made  in  the  development  of  methods  to 
exploit  collective  enhancement  of  atom-photon  interactions  pro¬ 
vided  by  optically  thick  atomic  samples15-20.  Instead  of  homodyne 
or  heterodyne  detection  of  light  as  used  in  spin-squeezing  exper¬ 
iments18-20,  the  DLCZ  scheme  involves  photon-counting  tech¬ 
niques,  which  present  stringent  requirements  for  broad 
bandwidth  detection  and  for  the  suppression  of  stray  light  from 
the  atomic  ensemble. 

As  illustrated  in  Fig.  1,  an  optically  thick  sample  of  three-level 
atoms  in  a  lambda-configuration  is  exploited  to  produce  correlated 
photons  via  the  following  sequence.  With  atoms  initially  prepared 
in  state  |  a)  by  optical  pumping,  a  laser  pulse  from  the  ‘write’  beam 
tuned  near  the  | a) —>  \e)  transition  illuminates  the  sample  and 
induces  spontaneous  Raman  scattering  to  the  initially  empty  level 
\b)  via  the  \e) —>  \b)  transition  at  time  f(1).  The  ‘write’  pulse  is  made 
sufficiently  weak  so  that  the  probability  to  scatter  one  Raman 
photon  into  the  preferred  forward-propagating  mode  ^(1)  (r,  t)  is 
much  less  than  unity  for  each  pulse.  Detection  of  a  photon  in  the 
mode  (r,  t)  produced  by  the  \e)  —*■  | b)  transition  results  in  a 
single  excitation  in  the  atomic  level  |  b)  distributed  across  the 
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sample.  In  the  ideal  case,  this  coherently  symmetrized  state  is6: 

N 

|*ia>~£|«>,..|%..|«V  (i) 

j= i 

Although  the  initial  detection  of  photon  1  generated  by  the  ‘write’ 
beam  is  probabilistic,  the  detection  of  photon  1  results  in  the 
conditional  state  |#1A)  with  one  collective  atomic  ‘excitation’. 
This  excitation  can  subsequently  be  converted  into  an  excitation 


^  MOT  OFF  ^ 


MOT  OFF 


At  - > 


yth  trial  (y'+1)th  trial 


Figure  1  A  simplified  schematic  of  the  experiment  is  presented,  a,  Diagram  of  the 
apparatus;  b,  the  timing  sequence  for  data  acquisition.  Further  details  are  as  follows, 
a,  ‘write’  and  ‘read’  pulses  propagate  sequentially  into  a  cloud  of  cold  Cs  atoms  (MOT), 
generating  pairs  of  correlated  output  photons  (1 ,2),  with  controlled  separation  bt.  Fields 
with  frequency  near  that  of  the  \a)  <-►  \e >  (or  | b)  <-+  |e»  transition  are  coloured  red  (or 
blue)  here  and  in  Figs  2  and  3.  The  ‘write’  and  ‘read’  pulses  have  orthogonal  polarizations, 
are  combined  into  a  single  input  at  polarizing  beam  splitter  PBS1 ,  and  are  then  focused 
into  the  Cs  MOT  with  a  waist  of  approximately  30  |im.  The  output  fields  are  split  by  PBS2, 
which  also  serves  as  the  first  stage  of  filtering  the  (‘write’,  ‘read’)  beams  from  the  (1 ,2) 
fields.  For  example,  field  2  is  transmitted  by  PBS2  to  be  subsequently  registered  by 
detector  D2  while  the  ‘read’  pulse  itself  is  reflected  by  90°  at  PBS2  and  then  blocked  by  an 
acousto-optical  modulator  that  serves  as  Gate  1 .  Further  filtering  is  achieved  by  passing 
each  of  the  outputs  from  PBS2  through  separate  frequency  filters,  each  of  which  consists 
of  a  glass  of  Cs  vapour  optically  pumped  to  place  atoms  into  either  6S 1/2,  F  =  3  or  F  =  4 
(ref.  28).  The  small  residual  reflected  (transmitted)  light  of  the  ‘write’  or  ‘read’  pulse  from 
PBS2  at  frequency  4i4  (or  o>  3>4)  passes  through  a  filter  cell  with  atoms  in  the  F=  4(3) 
level.  It  is  thereby  strongly  attenuated  (>1 06),  while  the  accompanying  Raman -scattered 
light  as  photons  1  (or  2)  at  frequency  w  3i4(or  co  4i4)  is  transmitted  with  high  efficiency 
(=*80%).  Transmission  efficiencies  from  the  MOT  to  detectors  (D1 ,  D2)  are  both  about 
30%  for  light  with  the  spatial  shape  of  the  ‘write’  and  ‘read’  beams  and  of  the  correct 
polarization.  (D1 ,  D2)  have  overall  quantum  efficiencies  of  approximately  50%  (photon  in 
to  TTL  pulse  ‘out’),  b,  Gating  windows  for  the  joint  detection  of  photons  (1 ,2)  are  centred  at 
times  for  the  yth  trial  of  the  experiment  during  intervals  when  the  MOT  is  ‘OFF’. 


of  the  light  field  with  high  probability  ‘on  demand’  with  a  specified 
emission  and  a  programmable  pulse  shape6,10,21,22.  To  achieve  the 
conversion  from  atoms  to  field,  a  laser  pulse  from  the  ‘read’  beam 
tuned  near  the  \b)  — +  \e)  transition  illuminates  the  atomic  sample, 
thereby  affecting  the  transfer  |  b) —>  \  a)  for  the  sample  with  the 
accompanying  emission  of  a  second  Raman  photon  2  on  the 
\e)  — ►  | a)  transition.  For  an  optically  thick  atomic  sample,  photon 
2  is  emitted  with  high  probability  into  a  specified  mode  i/d2)  (r,  t ) 
offset  in  time  by  +  ht.  The  spatial  and  temporal  structure 

of  the  modes  ^(1,2)  (r,  t)  are  set  by  the  geometry  of  the  atomic 
sample  and  by  the  shape  and  timing  of  the  ‘write’  and  ‘read’ 
beams21.  In  our  experiment,  the  modes  of  the  (write,  read)  beams 
are  spatially  mode-matched,  with  measured  visibility  greater  than 
95%  for  the  case  of  equal  frequency  and  polarization.  The  time  delay 
bt  is  limited  in  principle  only  by  the  coherence  time  between  the 
levels  |  a)  and  |  b),  which  can  be  long  in  practice. 

The  atomic  sample  for  our  experiment  is  provided  by  caesium 
atoms  in  a  magneto -optical  trap  (MOT)12,  where  the  Cs  hyperfine 
manifolds  { |6S  1/2,  F  =  4),  |6S1/2,  F  =  3),  |6P3/2,  F'  =  4)}  corre¬ 
spond  to  the  levels  { |a),  | b),  \e)}y  respectively.  As  illustrated  by  the 
timing  diagram  in  Fig.  1,  the  MOT  is  chopped  from  ON  to  OFF  with 
At  —  4  (jis.  In  each  cycle  there  is  a  ‘dark’  period  of  duration  1  ps 
when  all  light  responsible  for  trapping  and  cooling  is  gated  OFF, 
with  less  than  0.1%  of  atoms  measured  to  remain  in  the  F  =  3  level 
at  this  stage.  The  jth  trial  of  the  protocol  for  single  photon 
generation  is  initiated  by  a  ‘write’  pulse  which  is  resonant  with 
the  6S1/2,  F  =  4  — ►  6P3/2,  F'  =  4  transition  at  frequency  co4j4  and 
that  has  duration  —51  ns  (full-width  at  half-maximum,  FWHM).  A 
critical  parameter  for  the  experiment  is  the  resonant  optical  thick¬ 
ness  74j4  of  the  atomic  sample21.  We  measure  y4)4  —  4-5  for  cw 
excitation,  corresponding  to  an  attenuation  of  intensity  exp(—y4?4) 
in  propagation  through  the  MOT. 

The  ‘write’  pulse  generates  forward-scattered  (anti-Stokes) 
Raman  light  around  frequency  co  3j4  from  the  F1  =  4  excited  level 
to  the  F  =  3  ground  level  (\e)  —>  \b))  that  is  directed  onto  a  single¬ 
photon  detector  Dl.  After  a  variable  delay  bt,  the  ‘read’  pulse 
illuminates  the  sample,  with  this  pulse  tuned  to  the  6S1/2, 
F  =  3  — ►  6P3/2,  F'  =  4  transition  at  frequency  co3j4  with  duration 
—34ns  (FWHM).  Raman  (Stokes)  light  generated  by  the  ‘read’ 
pulse  around  frequency  co4j4  from  F'  =  4  to  F  =  4  (\e)  — ►  \a))  is 
directed  onto  a  second  single-photon  detector  D2. 

By  interchanging  the  frequencies  for  optical  pumping  of  the  filter 
cells  described  in  Fig.  1,  the  (write,  read)  beams  can  be  detected  at 
(Dl,  D2)  instead  of  the  (1,  2)  fields.  An  example  of  the  resulting 
pulse  profiles  accumulated  over  many  trials  {j}  is  presented  in  Fig.  2, 
where  the  origin  in  time  is  set  to  coincide  with  the  approximate 
centre  of  the  ‘write’  pulse,  with  the  ‘read’  pulse  following  after  a 
delay  of  —415  ns  determined  by  external  control  logic. 

With  the  filter  cells  set  to  transmit  the  (1,  2)  photons  to  the  (Dl, 
D2)  detectors,  respectively,  we  record  histograms  of  the  numbers 
(rii(t),  n2(t))  of  photoelectric  events  versus  time,  which  are  also 
displayed  in  Fig.  2.  For  the  data  presented  here,  the  intensity  of  the 
‘write’  pulse  is  kept  low  (~103  photons  per  pulse),  resulting  in  a 
time  lag  for  the  onset  of  the  rii(t)  counts  in  Fig.  2.  As  discussed  in  the 
Supplementary  Information,  the  probability  p^?ite  to  generate  an 
anti-Stokes  photon  1  within  the  solid  angle  of  our  imaging  system  is 
Pwite  “  10y  Per  Pulse- 

The  ‘read’  pulse  is  about  100  times  more  intense  than  the  ‘write’ 
pulse,  leading  to  high  efficiency  f3^4  —  0.6  for  the  transfer  of 
population  \b)  — >  | a),  with  p[2^d  —  f3^4p^te  for  the  Stokes  photon 
2.  Examples  of  the  resulting  detection  events  n2(t)  are  shown  in  Fig. 
2.  In  contrast  to  the  behaviour  of  ni(t),  the  intense  ‘read’  beam 
generates  n2{ 2)  counts  promptly.  More  extensive  investigations  of 
the  timing  characteristics  of  the  emitted  fields  (1,  2)  will  be  part  of 
our  subsequent  investigations,  including  the  relationship  to  elec- 
tromagnetically  induced  transparency  (EIT)23,24. 

A  virtue  of  the  DLCZ  protocol  is  its  insensitivity  to  a  variety  of 
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Figure  2  Normalized  singles  counts  n,{t)  are  shown  for  the  ‘write’,  ‘read’  and  (1 ,2)  fields. 
The  pulses  around  t  =  0  are  from  detector  D1  for  the  ‘write’  beam  /?w(f)  (solid  trace)  and 
for  photon  1 ,  n^(t)  (data  points).  The  pulses  around  t  =  41 0  ns  are  from  detector  D2  for 
the  ‘read’  beam  nr[t)  (solid  trace)  and  for  photon  2,  n2[t)  (points).  Note  that  in  addition  to 
the  symmetrized  excitation,  each  ‘write’  pulse  also  transfers  several  hundred  atoms  into 
the  F=  3  level  owing  to  spontaneous  emission  from  its  near-resonant  character. 
However,  atoms  transferred  into  F=  3  via  spontaneous  decay  are  spatially  uncorrelated, 
so  that  their  contribution  to  the  signal  from  the  ‘read’  channel  is  strongly  suppressed  (by 
roughly  the  fractional  solid  angle  collected,  50/4^  ^4x1  CT5)  as  compared  to  the 
signal  from  single-atom  excitations  of  the  form  |$1iA)  (ref.  6). 

loss  mechanisms,  including  inefficiencies  in  transport  and  detection 
of  the  (1,  2)  photons.  However,  in  an  actual  experiment,  various 
non-ideal  characteristics  of  the  atom-field  interaction  (as  in  our 
MOT)  do  lead  to  deterioration  of  correlation  for  the  (1,  2)  photons 
(for  example,  imperfect  filtering  and/or  background  fluorescence  as 
described  in  the  caption  of  Fig.  2  and  in  the  Supplementary 
Information).  Fortunately  there  exists  a  well-defined  border 
between  the  classical  and  quantum  domains  for  the  (1,  2)  fields 
that  can  be  operationally  accessed  via  coincidence  detection,  as  was 
first  demonstrated  in  the  pioneering  work  of  Clauser7. 

As  illustrated  in  Fig.  lb,  electronic  pulses  from  detectors  (Dl,  D2) 
are  separately  gated  with  windows  of  duration  T  =  60  ns  centred  on 
times  (f(1),  f(2))  corresponding  to  the  approximate  peaks  of  the 
(: nx[t ),  n2(t))  pulses  shown  in  Fig.  2.  Photoelectric  events  that  fall 
within  the  gate  windows  are  directed  to  a  time-interval  analyser 
(TIA)  configured  in  a  standard  fashion  for  measurement  of  photo¬ 
electric  correlations25.  For  a  ‘start’  event  from  Dl  within  the  interval 
±T  /  2  for  the  ;th  trial  of  the  experiment,  the  TIA  records  the 
times  of ‘stop’  events  from  D2  within  successive  intervals  ±  T/2. 
Over  many  repetitions  of  the  experiment,  we  thereby  acquire  time- 
resolved  coincidences  n  ij2(r)  between  the  (1,  2)  fields,  both  within 
the  same  trial  k  =  j  and  for  subsequent  trials  k=  j  +  1,  ;  +  2... 
(that  is,  a  ‘start’  event  from  trial;  around  time  tj  and  a  ‘stop’  event 
from  trial  k  around  time  t^  where  f*  =  tj  +  (k  —  j)At  for  k  =  ;, 
;  +  1,. . .).  By  a  50%-50%  beam  splitter,  the  field  1  can  be  directed  to 
detectors  (D 1 ,  D2 ) ,  and  then  in  turn  the  field  2  to  ( D 1 ,  D2 ) .  We  thus 
also  acquire  the  time-resolved  coincidences  nu(r)  and  n  2,2(7). 

Figure  3  displays  an  example  of  data  accumulated  in  this  manner 
for  coincidences  na^(r)  between  the  (1,2),  (1,1),  and  (2,2)  beams, 
with  successive  peaks  separated  by  the  time  between  trials  At  =  4  ps. 
Note  that  there  is  an  excess  of  coincidence  counts  in  each  of  the 
initial  peaks  for  joint  detections  from  the  same  trial  (r  <  At)  as 
compared  to  na^(r)  from  different  trials  (r  >  At).  This  excess  is 
shown  more  clearly  in  the  plots  in  the  right  column,  which  expand 
the  time  axis  from  the  left  column  in  Fig.  3.  Here,  data  from 
successive  trials  k  =j  +  1, . . .;  +  10  have  been  offset  to  r  <  At  and 
then  averaged  for  comparison  with  na^(r)  from  the  same  trial;  by 
introducing  the  quantity  =  (l/10)^|=j+ino;,i8(^  +  (k  ~ 

;)At).  As  discussed  in  the  Supplementary  Information,  statistical 


t  (ps)  t  (ns) 

Figure  3  Time-resolved  coincidences  n  ajj8(r)  between  the  (1 ,1 ),  (2,2)  and  (1 ,2)  fields  are 
displayed  versus  time  delay  r.  Left,  naij8(r)  is  shown  over  1 1  successive  repetitions  of  the 
experiment.  Right,  the  time  axis  is  expanded  to  a  total  duration  of  250  ns  with  t  =  0  set  to 
the  centre  of  the  gating  window  (f(1),  t{2\  f(1))  for  (nxx,n  22^12),  respectively.  The  larger 
peak  naip(r)  corresponds  to  detection  pairs  from  the  same  trial ;,  while  the  smaller  peak 
maAT)  's  f°r  Pa'rs  from  different  trials  as  defined  in  the  text.  Typical  acquisition 
parameters  are  as  follows.  Detectors  (Dl ,  D2)  have  average  count  rates  of  about 
(400  s-1,  250  s_1),  respectively,  while  background  counts  with  no  MOT  present  are 
about  1 00  s_1 .  Counts  due  to  the  MOT  itself  (with  ‘write’  and  ‘read’  beams  blocked)  are 
less  than  (1 0  s_1 ,  20  s_1)  for  (Dl ,  D2).  Dark  counts  with  the  inputs  to  the  fibres  blocked 
are  less  than  5  s_1 .  All  these  numbers  are  for  the  gated-output  mode  of  data  acquisition 
as  in  Fig.  1  with  T  =  60  ns. 

independence  for  trials  with  k  j  is  enforced  by  the  experimental 
protocol  of  reapplying  the  MOT  and  repumping  beams  after  each 
trial. 

From  the  data  in  Fig.  3,  we  determine  the  total  number  of 
coincidences  Na^  =  J2{Ti}na,AT^  with  (<*,j3)  =  (1>2)  obtained  by 
summing  over  time  bins  {rj  for  detection  within  the  same  trial;, 
and  M0)jj  =  ^2{Tk}M(x,p(Tk)  obtained  from  ‘start’  and  ‘stop’  events 
from  different  trials  ( j  k).  Fields  for  which  the  Glauber-Sudar- 
shan  phase-space  function  is  well-behaved  (that  is,  classical  fields) 
are  constrained  by  a  Cauchy-Schwarz  inequality  for  the  various 
coincidence  counts  (Supplementary  Information  and  ref.  25), 
namely: 

lgl,2($t)]2  <  gl,lg2,2  (2) 

where  —  A/2,2 /AG, 2  ^1,200  —  Nij2/M1j2. 

For  the  data  displayed  in  Fig.  3,  we  find  gy  1  =  (1.739  ±  0.020) 
and  g2,2  =  (1.710  ±  0.015),  in  correspondence  to  the  expectation 
that  the  (1,2)  fields  should  each  exhibit  gaussian  statistics  with 
gi,i  =  g2, 2  —  2  for  the  protocol  of  DLCZ  in  the  ideal  case,  but  here 
degraded  by  diverse  sources  of  background  counts  (see  Supplemen¬ 
tary  Information).  By  contrast,  for  the  cross-correlations  of  the 
(1,2)  fields,  we  record  gy2 (5 1)  =  (2.335  ±  0.014),  with  ht  =  405  ns. 
Hence  the  inequality  of  equation  (2)  for  classical  fields  is  strongly 
violated,  namely  [g^  2(ht)  =  5.45  ±0.11]  [gi,ig2,2  =  2.97  ±  0.08], 

where  all  errors  indicate  the  statistical  uncertainties.  This  violation 
of  the  Cauchy-Schwarz  inequality  clearly  demonstrates  the  non- 
classical  character  of  the  correlations  between  photons  (1,2)  gener¬ 
ated  by  the  (write,  read)  beams.  Moreover,  as  discussed  in  more 
detail  in  the  Supplementary  Information,  the  measured  coincidence 
rates  in  Fig.  3  explicitly  document  the  cooperative  nature  of  the 
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emission  process.  Overall,  we  estimate  that  the  probability  p ^  for 
coincidence  of  the  (1,2)  photons  due  to  collective  atomic  excitation 
as  described  by  the  state  |  #1A)  is  roughly  —  10-4  for  each  trial;, 
referenced  to  the  output  of  the  MOT. 

The  temporal  extent  of  the  photon  wave  packet  ^(r,  t )  for  the 
(1,2)  photons  is  also  of  some  interest.  To  investigate  this  issue,  we 
have  carried  out  the  experiment  with  expanded  gate  windows  of 
duration  T  =  140  ns  that  then  encompass  the  entire  domains  over 
which  counts  rii(t)  and  n2(t)  are  observed  in  Fig.  2.  In  this  case,  we 
record  gy i  =  (1.72  ±  0.04),  g2, 2  =  (1.52  ±  0.05),  and  gip(ht)  = 
(2.45  ±  0.10),  now  with  5 1  set  to  be  320  ns.  The  classical  inequality 
of  equation  (2)  is  once  again  not  satisfied;  2(bt)  =  6.00  ±  0.50]  ^ 
lgi,ig2,2  —  2.61  ±  0.11].  These  results  with  T  =  140  ns  also  confirm 
that  dead-time  effects  do  not  play  a  significant  role  in  the  current 
experiment. 

As  described  in  the  Supplementary  Information,  the  violation 
of  the  Cauchy-Schwarz  inequality  of  equation  (2)  in  the  ideal 
case  can  be  much  larger  than  we  have  observed,  namely 
[gi,2(5f)]2/[#i,ifc,2]  =  [(l+p)/(2p)]2  »  1,  where  p  <SC  1  is  the 
excitation  probability.  In  our  experiment,  the  size  of  the  violation 
of  the  inequality  was  limited  mostly  by  uncorrelated  fluorescence 
from  individual  atoms  in  the  atomic  sample.  This  contribution  will 
be  made  smaller  in  future  experiments  by  moving  to  off-resonant 
excitation,  which  necessitates  higher  optical  density.  There  is  also  a 
significant  limitation  due  to  the  presence  of  the  leakage  light  from 
the  ‘read’  pulse.  This  classical  pulse  is  only  9  GHz  away  from  the 
single-photon  field  2  of  interest,  and  is  filtered  by  a  factor  exceeding 
10-9.  To  achieve  even  stronger  violation  of  the  inequality,  we  must 
further  improve  the  filtering  capability. 

Our  observations  of  nonclassical  correlations  between  the  (1,2) 
photons  represent  the  first  important  step  in  the  realization  of  the 
protocol  of  DLCZ6  for  scalable  quantum  communication  with 
atomic  ensembles,  although  it  is  not  yet  sufficient  for  realization 
of  the  full  protocol.  Beyond  the  nonclassical  correlations,  our 
experiment  also  demonstrates  successful  filtering  of  the  various 
fields  and  collective  enhancement  by  the  atomic  ensemble,  all  of 
which  are  critical  for  realization  of  the  full  quantum  repeater 
protocol.  More  generally,  the  capabilities  that  we  have  demonstrated 
should  help  to  enable  other  advances  in  the  field  of  quantum 
information,  including  implementation  of  quantum  memory22,26 
and  fully  controllable  single-photon  sources27,  which,  when  com¬ 
bined,  help  to  pave  the  avenue  for  realization  of  universal  quantum 
computation4. 

Note  added  in  proof:  We  have  been  made  aware  of  a  recently 
published  related  paper  by  van  der  Wal  et  al.29.  □ 
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Many-body  systems  in  nature  exhibit  complexity  and  self-organ¬ 
ization  arising  from  seemingly  simple  laws.  For  example,  the 
long-range  Coulomb  interaction  between  electrical  charges  has  a 
simple  form,  yet  is  responsible  for  a  plethora  of  bound  states  in 
matter,  ranging  from  the  hydrogen  atom  to  complex  biochemical 
structures.  Semiconductors  form  an  ideal  laboratory  for  study¬ 
ing  many-body  interactions  of  electronic  quasiparticles  among 
themselves  and  with  lattice  vibrations  and  light1-4.  Oppositely 
charged  electron  and  hole  quasiparticles  can  coexist  in  an  ionized 
but  correlated  plasma,  or  form  bound  hydrogen-like  pairs  called 
excitons5,6.  The  pathways  between  such  states,  however,  remain 
elusive  in  near-visible  optical  experiments  that  detect  a  subset  of 
excitons  with  vanishing  centre-of-mass  momenta.  In  contrast, 
transitions  between  internal  exciton  levels,  which  occur  in  the 
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We  propose  an  efficient  scheme  to  engineer  multiatom  entanglement  by  detecting  cavity  decay 
through  single-photon  detectors.  In  the  special  case  of  two  atoms,  this  scheme  is  much  more  efficient 
than  previous  probabilistic  schemes,  and  insensitive  to  randomness  in  the  atom’s  position.  More 
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There  is  a  large  current  interest  in  generation  and 
engineering  of  quantum  entanglement,  with  applications 
for  fundamental  tests  of  quantum  mechanics  [1],  for 
high-precision  measurements  [2],  and,  in  particular,  for 
implementation  of  quantum  communication  and  compu¬ 
tation  (3].  Although  quantum  entanglement  is  typically 
fragile  to  practical  noise  and  technical  imperfections, 
there  exist  elegant  ways  to  overcome  this  sensitivity  by 
designing  schemes  with  inherent  robustness  to  diverse 
sources  of  noise.  Some  schemes  with  this  property  have 
been  known  for  entangling  two  single  atoms  [4-9]  as 
well  as  for  entangling  macroscopic  atomic  ensembles 
[10,11].  In  these  schemes,  feedback  is  typically  applied 
to  the  system  of  interest  based  upon  the  outcome  of 
certain  measurements.  The  protocols  are  thereby  proba¬ 
bilistic,  succeeding  only  conditionally  for  particular 
measurement  results.  Imperfections  and  noise  in  these 
schemes  decrease  the  success  probability,  but  have  no 
influence  on  the  fidelity  of  the  intended  state  generation 
for  the  “successful”  subset  of  trials.  In  this  way,  a  high- 
fidelity  entangled  state  can  be  obtained  simply  by  repeat¬ 
ing  the  scheme  successively. 

Here,  we  propose  a  robust  scheme  to  produce  and 
engineer  entanglement  between  multiple  atoms  in  optical 
cavities.  Compared  with  the  previous  robust  schemes 
[4-11],  our  protocol  has  the  following  favorable  features, 
(i)  It  is  much  more  efficient  in  the  sense  that  the  success 
probability  can  be  close  to  unity,  whereas  in  the  previous 
schemes  [4,5,8-11],  the  success  probability  is  required  to 
be  much  smaller  than  1  to  have  the  property  of  inherent 
robustness,  (ii)  It  is  more  insensitive  to  certain  practi¬ 
cal  sources  of  noise,  such  as  randomness  in  the  atom’s 
position,  atomic  spontaneous  emission,  or  detector  inef¬ 
ficiency.  (iii)  Individual  addressing  of  atoms  is  not  re¬ 
quired  [6],  nor  are  single-photon  states  as  initial  resources 
[7].  (iv)  Most  importantly,  our  scheme  is  not  limited  to 
generation  of  two-atom  entanglement.  Indeed,  we  show 
that  based  on  current  experimental  technology,  it  should 
be  possible  to  generate  any  superposition  of  the  Dicke 
states  [12]  between  multiple  atoms  in  an  optical  cavity. 


These  Dicke  states  and  their  superpositions,  including  the 
multiparty  GHZ  states  as  special  cases,  are  typically 
highly  entangled,  with  many  applications  in  quantum 
information  science  [2,13-15].  Their  entanglement  can 
be  directly  detected  without  separate  addressing  [2]. 

As  the  scheme  here  is  inherently  robust  to  noise,  it 
works  in  principle  for  entangling  atoms  (or  ions)  both 
in  free-space  configurations  and  in  high- Q  cavities,  albeit 
in  the  free-space  case  one  has  a  much  smaller  efficiency  to 
collect  the  emitted  photons.  In  this  Letter,  for  a  close 
relation  with  the  current  experimental  efforts  [16-18], 
we  assume  that  there  is  a  standing- wave  high-  Q  cavity 
around  the  atoms  [19,20]  to  improve  the  collection 
efficiency. 

To  explain  the  scheme,  let  us  start  from  the  simplest 
case  with  two  atoms  trapped  in  two  different  cavities.  The 
schematic  setup  is  shown  in  Fig.  1(a),  with  the  relevant 
atomic  levels  depicted  in  Fig.  1(b).  The  states  | g),  |0),  |1) 
correspond  to  the  hyperfine  and  the  Zeeman  sublevels  of 
alkali  atoms  in  the  ground-state  manifold,  and  \e)  corre¬ 
sponds  to  an  excited  state.  The  atom  is  initially  prepared 
in  the  state  |  g),  but  the  basis  vectors  of  a  qubit  are 
represented  by  the  states  |0)  and  |1).  The  transition  | g)  — ► 
\e)  is  driven  adiabatically  through  a  classical  laser  pulse 
with  the  corresponding  Rabi  frequency  denoted  by  fl(t) 
[21].  With  the  driving  pulse,  the  atom  is  transferred  with 
probability  pc  =*  1  to  the  |0)  and  |1)  states  by  emitting  a 
photon  from  the  transitions  |e)— >|0)  or  \e)—>\l). 


FIG.  1  (color  online),  (a)  The  schematic  setup  to  generate 
entanglement  between  two  atoms  in  different  cavities  L 
and  R.  (b)  The  relevant  atomic  level  structure  and  the  laser 
configuration. 
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Without  loss  of  generality,  we  assume  that  the  transitions 
\e)  —>  |0)  and  \e)  — ►  |1)  are  coupled  to  two  degenerate 
cavity  modes  ah  and  av  with  different  polarizations  h 
and  v.  The  decay  pulses  from  the  two  cavities  are  inter¬ 
fered  at  a  polarization  beam  splitter  (PBS),  with  the 
outputs  detected  by  two  single-photon  detectors  after  a 
45°  polarizer  [denoted  as  P45  in  Fig.  1(a)].  The  small 
fraction  of  the  transmitted  classical  pulse  can  be  easily 
filtered  based  on  the  frequency  selection  as  detailed  in 
Ref.  [21].  For  the  decay  pulse  from  the  R  cavity,  a 
polarization  rotator  R(rr/2)  is  inserted  before  the  PBS 
which  exchanges  h  and  v  polarizations  of  the  incoming 
photon.  Conditioned  upon  registering  one  photon  from 
each  of  the  detectors,  the  two  atoms  in  the  cavities  L  and 
R  will  be  prepared  into  the  maximally  entangled  state 

=  (|01)zj?  +  |10)lr)/V2.  (1) 

To  see  this,  we  write  down  the  interaction  Hamiltonian 
in  the  rotating  frame,  which,  for  each  of  the  cavities,  has 
the  form  (setting  H  =  1) 

H  =  n(f)k)(g|  +  g0\e)(0\ah  +  gi \e)(l\av  +  H.c.,  (2) 

where  g0  and  gx  are  the  corresponding  coupling  rates.  The 
cavity  outputs  a£ut  (/ jl  =  h,  v)  are  connected  with  the 
cavity  modes  a M  through  the  standard  input-output  rela¬ 
tions  a p  =  — i[a H ]  —  Ka^ / 2  —  yfRa^t)  and  a^ut(t)  = 
ain(t)  +  [22],  where  k  is  the  cavity  decay  rate,  and 

a^n{t),  with  the  commutation  relation  [a-^(t)y  (t1)]  = 

S(t  —  t'),  denotes  the  vacuum  cavity  input.  We  are  inter¬ 
ested  in  the  limit  for  which  the  variation  rate  of  £l(t)  is 
significantly  smaller  than  the  cavity  decay  rate  k.  In  this 
limit,  we  can  define  an  effective  single-mode  bosonic 
operator  a£ff  from  the  cavity  output  operator  a^ut(t)  as 
aeff  =  fo  /(0^out(0^  (see  Refs.  [21,23]),  where  T  is  the 
pulse  duration  and  f(t )  is  the  output  pulse  shape,  which 
is  determined  by  the  shape  of  il(t)  as  f(t)  = 
yfKsinO(t)exip[—(K/2)  J q  sin2  6  (r) dr]  with  sin 0(t)  = 

^MA/fgol2  +  Igil2  +  U2MI2.  After  the  driving  pulse, 
for  each  of  the  cavities  A  (A  =  L,  R),  the  final  state 
between  the  atom  and  the  corresponding  cavity  output 
has  the  form 

Wa  =  C?oIO>aI^>a  +  ^iI1)aI^)a)/V^oI2  +  Igil2,  (3) 

where  |  jul)  =  |vac),  {pi  =  h,  v),  and  |vac)  denotes  the 
vacuum  state  of  the  optical  modes. 

If  the  driving  pulses  have  the  same  shape  12  (t)  for  the  L 
and  R  cavities,  the  output  single-photon  pulses  from  the 
two  cavities  will  also  have  the  same  shape  f(t ),  and  they 
will  interfere  with  high  visibility  at  the  polarization 
beam  splitter  (PBS).  If  one  gets  a  “click”  from  each  of 
the  detectors  at  the  outputs  of  the  PBS,  the  two  incoming 
photons  can  be  either  both  in  h  polarizations  or  both  in  v 
polarizations,  and  these  two  possibility  amplitudes  are 
coherently  superposed  when  the  incoming  photon  pulses 


overlap  with  each  other  with  the  same  shape.  Therefore, 
the  measurement  in  Fig.  1(a),  together  with  the  polariza¬ 
tion  rotator  R{rr/2),  corresponds  to  projecting  the  whole 
state  | M/)l  ®  |  W)R  between  the  atoms  and  the  photons 
onto  a  subspace  with  the  projection  operator  given  by 
Ps  =  \hv)LR(hv\  +  I vh)LR(vh\.  Within  this  measurement 
scheme,  the  state  |M/)L  ®  is  effectively  equivalent  to 
the  four-particle  GHZ  state 

l^eff)  *  PsW)l  ®  W*  *  (|01>L*  ®  I hv)LR 

+  |10)L£  ®  \vh)LR}/ \f2.  (4) 

The  45°  polarizers  in  Fig.  1(a)  project  the  photon  polar¬ 
izations  to  the  (| h)  +  \v))/y/2  state.  It  immediately  fol¬ 
lows  from  Eq.  (4)  that  after  this  measurement  the  two 
atoms  will  be  prepared  in  the  maximally  entangled  state 
(1).  If  one  rotates  the  angles  of  the  polarizers  in  Fig.  1(a), 
corresponding  a  measurement  of  the  incoming  pho¬ 
ton  polarizations  either  in  the  {| h),  |u)}  basis  or  in  the 
{(I h)  +  \v))/ V2,  (| h)  -  \v))/y/2}  bases,  one  can  further 
demonstrate  four-particle  GHZ-type  of  entanglement  be¬ 
tween  the  atoms  and  the  photons  as  indicated  by  the 
effective  state  (4)  [24].  The  45°  polarizer  can  also  be 
replaced  by  a  PBS  with  both  of  its  outputs  detected  by 
single -photon  detectors.  The  measurement  success  proba¬ 
bility  is  then  increased  by  a  factor  of  2  for  each  side,  and 
the  overall  success  probability  of  this  scheme  becomes 

ps  =  2\g0gl\2/(\g0\2  +  \gl\2)2. 

Before  introducing  the  multiatom  entangling  scheme, 
we  offer  a  few  remarks  about  this  two-cavity  scheme. 
First,  it  is  evident  that  the  scheme  is  inherently  robust  to 
atomic  spontaneous  emission,  output  coupling  ineffi¬ 
ciency,  and  detector  inefficiency,  all  of  which  contribute 
to  loss  of  photons.  Since  a  click  from  each  of  the  detectors 
is  never  recorded  if  one  photon  is  lost,  these  processes 
simply  decrease  the  success  probability  ps  by  a  factor  of 
7]2  (where  1  —  7]  denotes  the  loss  for  each  of  the  photons), 
but  have  no  influence  on  the  fidelity  of  the  final  state 
iM/'/je).  Second,  our  scheme  does  not  require  localization 
of  the  atom  in  the  cavity  to  the  Lamb-Dick  limit.  For  the 
standing- wave  cavity  shown  in  Fig.  1(a)  and  with  the 
collinear  pumping  configuration  proposed  in  Ref.  [21], 
12 (t),  g0,  and  g1  depend  on  the  atom’s  position  through 
approximately  the  same  cavity  mode  function.  The  pulse 
shape  f(t ),  which  is  determined  by  the  ratios  0(t)/g 0  and 
H(t)/gi,  thus  becomes  basically  independent  of  the  ran¬ 
dom  variation  in  the  atom’s  position.  For  a  traveling-wave 
cavity  or  for  a  free-space  configuration,  the  atom’s  posi¬ 
tion  affects  only  the  common  phase  of  the  coupling  rates 
g0  and  gi,  and  in  this  case,  a  transverse  pumping  con¬ 
figuration  also  suffices  since  the  randomness  in  the  com¬ 
mon  phase  of  g0  and  g\  has  no  influence  on  the  final 
entangled  state  I'Plt?)-  Finally,  the  success  probability  of 
our  scheme  is  ps  ~  1/2  in  the  ideal  case  with  g0  ~  &i 
and  T7  —  1,  which  shows  that  the  present  scheme  is  sig¬ 
nificantly  more  efficient  than  the  previous  schemes 
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[4,5,8-11],  where  the  success  probability  is  required  to  be 
much  smaller  than  1  even  if  77  — ►  1. 

We  next  extend  our  basic  scheme  to  entangle  multiple 
atoms  in  the  same  optical  cavity.  The  schematic  setup  is 
shown  by  Fig.  2,  with  each  of  the  Na  atoms  taken  to  have 
the  same  level  structure  as  depicted  in  Fig.  1(b)  and  with 
the  atoms  not  separately  addressable  [16-18].  The  initial 
state  of  the  system  has  the  form  |  G)  =  \g)i  with  all 

the  atoms  prepared  to  the  ancillary  state  |  g).  The  driving 
laser,  incident  from  one  side  mirror,  is  now  divided  into 
M  sequential  pulses,  with  M  >  Na/ 2.  We  assume  that  the 
intensity  of  the  pulse  is  controlled  so  that  for  each  of  the 
M  pulses,  an  approximate  fraction  1/M  of  the  atomic 
population  is  transferred  adiabatically  from  the  | g)  state 
to  the  |0)  or  |1)  states,  by  emitting  on  average  Na/M 
photons  with  h  or  v  polarizations.  The  output  photons 
from  the  cavity  decay  are  split  by  a  PBS  according  to 
their  polarizations,  and  then  registered  through  two 
single-photon  detectors  (called  h  and  v  detectors,  respec¬ 
tively).  For  each  driving  pulse,  we  may  or  may  not  get  a 
click  from  the  h  or  v  detectors,  which  are  assumed  not  to 
distinguish  one  or  more  photons.  For  the  whole  M  pulses, 
we  can  count  the  total  number  of  “clicks”  (nh,  nv) 
registered  from  the  (h,  v)  detectors,  respectively.  Of 
course,  nh  +  nv  <  Na  since  there  are  only  Na  atoms.  If 
it  turns  out  that  nh  +  nv  =  Na,  the  following  Dicke  state 
results  for  the  Na  atoms: 

\Na,  nh)  =  c(nh)(sl)ni'(s\)N«~n>'\G).  (5) 

Here,  the  collective  operators  (jul  =  0,  1)  are  defined  as 
=  Ya=i  I l^)i(g\9  and  the  normalization  coefficient 
c(nh )  =  [A Ja\nh\(Na  —  Except  the  trivial  cases 

with  nh  =  0 ,Na,  clearly  the  Dicke  state  \Na,  nh)  is  en¬ 
tangled.  The  multiatom  Dicke  states  and  the  GHZ  states 
in  general  belong  to  different  classes  of  entangled  states, 
and  the  Dicke  states  are  relatively  more  robust  to  the 
influence  of  noise  [13].  The  Dicke  states  have  some  inter¬ 
esting  applications  in  quantum  information  processing 
and  in  high-precision  measurements  [14,15]. 

To  understand  why  a  Dicke  state  results  conditioned 
upon  the  above  type  of  measurement,  we  note  that  each 
atom  has  an  equal  probability  to  emit  a  photon  with  the 
same  pulse  shape  for  each  driving  pulse  for  the  assumed 
sequence  of  adiabatic  passages.  Hence,  each  driving  pulse 


FIG.  2  (color  online).  The  schematic  setup  to  generate  entan¬ 
glement  between  multiple  atoms  in  the  same  cavity  The 
polarization  rotator  R(0,  <p)  is  only  required  for  generation  of 
superpositions  of  the  Dicke  states. 


involves  a  collective  excitation  of  the  atoms  to  the  |0)  or 
|1)  levels  with  homogeneous  superposition  coefficients. 
For  the  subset  of  measurements  for  which  we  register  Na 
photoelectric  events  in  total  from  the  h  and  v  detectors 
for  the  whole  M  pulses,  each  click  of  the  detectors  should 
correspond  exactly  to  the  emission  of  one  and  only  one 
photon  by  the  atoms.  This  is  the  case  even  if  there  are 
photon  loss  and  detector  inefficiencies,  because  we  post 
select  only  the  trials  with  exactly  Na  photoelectric  events. 
Therefore,  for  each  click  registered  from  the  h  or  v 
detectors  for  these  trials,  we  should  apply  correspond¬ 
ingly  the  collective  operators  or  s\  to  the  atomic  state. 
After  registering  nh  /z-polarized  photons  and  (Na  —  nh) 
u-polarized  photons,  we  get  exactly  the  state  of  Eq.  (5). 
Similar  to  the  two-cavity  scheme,  this  multiatom  en¬ 
tangling  scheme  is  also  robust  to  practical  imperfections, 
such  as  a  moderate  randomness  in  the  atoms’  positions 
and  various  sources  of  photon  loss.  Again,  photon  loss 
reduces  the  success  probability  instead  of  the  state 
fidelity. 

To  calculate  the  success  probability  of  the  multi¬ 
atom  entangling  scheme,  we  note  that  the  stepwise 
driving  method  described  above  is  actually  equivalent 
to  the  following  one-step  driving  method:  we  transfer 
all  the  atomic  population  to  the  |0)  and  |1)  levels  with  a 
single  driving  pulse,  but  both  of  the  h  and  v  polar¬ 
ized  photons  after  the  PBS  need  to  be  further  split 
equally  into  M  paths  through  a  series  of  beam  splitters, 
with  separate  photoelectric  detection  for  each  path.  The 
state  in  Eq.  (5)  corresponds  to  the  case  when  nh  h  detec¬ 
tors  and  (Na  —  nh )  v  detectors  register  a  photoelectric 
event.  When  two  or  more  photons  go  to  the  same  path, 
the  number  of  detector  events  is  certainly  less  than  Na. 
So,  for  overall  success  with  Na  events,  we  require  that 
each  photon  follow  a  distinct  path,  for  which  the  success 
probability  is  given  by  psi  =  (2M)!/[(2M  —  Na)\(2M)Na] 
(in  total  there  are  2 M  paths.  For  simplicity,  we  have 
assumed  g0  ~  g{  so  that  one  has  equal  probability  to 
get  h  or  v  photons.)  All  photon  loss  processes  simply 
contribute  to  an  undercount  probability  1  —  p  for  each 
photon.  Hence,  the  success  probability  to  generate  one  of 
the  Dicke  states  of  Eq.  (5)  is  psucc  =  rjNapSi ,  while  the 
probability  to  obtain  a  specific  Dicke  state  \Na,  nh)  is 
Pnh  =  Psucc2-NaNal/[nh\(Na  ~  nh)l],  Excluding  the  triv- 
ial  cases  with  nh  =  0 ,Na,  we  then  find  that  the  success 
probability  to  obtain  an  entangled  state  from  this  scheme 
is  pen  =  /?succ(l  —  2~Na+1 ),  which  tends  to  unity  in  the 
case  2 M  »  Na  if  we  neglect  contributions  from  photon 
loss  (i.e.,  p  — y  1).  This  scheme  could  thus  be  quite  effi¬ 
cient.  For  instance,  with  p  =  0.70  (0.20)  and  M  =  50  (10) 
pulses,  pen  =  0.018  (1.9  X  10-4)  for  Na  =  10  (5)  atoms, 
so  that  repeating  this  scheme  on  average  l/pen  ~  56 
(5.4  X  103)  times  leads  to  a  high-fidelity  entangled  state 
between  10  (5)  atoms.  In  current  experimental  setups 
[16,17],  the  typical  duration  At  of  the  adiabatic  pulse 
is  a  few  hundred  nanoseconds,  so  that  the  total  duration 
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(M / pen)\t  ^  1  (20)  ms  for  entangling  10  (5)  atoms  with 
rj  =  0.70  (0.20).  The  currently  available  trapping  time  of 
atoms  in  high-g  cavities  is  about  1  sec  [16]. 

Finally,  we  would  like  to  extend  further  the  above 
scheme  to  generate  any  superposition  of  the  Dicke  states 
| Na,  nh).  For  this  purpose,  we  simply  insert  a  polarization 
rotator  R(6,  cp)  before  the  PBS  as  shown  in  Fig.  2, 
which  transforms  the  photon  polarizations  according  to 
\h)  — ►  cosd\h)  +  sin6el(p\v)  and  \v)  — ►  -  sinde~l(p\h)  + 
cos#|u).  We  assume  that  the  parameters  6 ,  cp  can  be 
separately  controlled  for  each  driving  pulse,  and  are  de¬ 
noted  by  0m,  cpm  for  the  mth  pulse.  As  before,  we  consider 
only  the  subset  of  cases  for  which  exactly  Na  photoelec¬ 
tric  events  are  registered  from  the  whole  M-pulse  se¬ 
quence.  If  the  h  (or  v)  detector  clicks  for  the  mth  pulse 
with  the  control  parameters  0m,  cpm ,  the  corresponding 
atomic  excitation  operator  Pm0  (or  Pml)  is  expressed  by 
the  collective  operators  4  as  Pm[1  =  cos  6ms^  — 
(—  1)M  sin 6meliPm^~x^  s\_^  (jul  =  0,  1).  So,  after  Na  regis¬ 
tered  events,  the  final  atomic  state  has  the  form  \^F)  = 
PmifjL |G),  where  m,  (i  =  1,  2, . .  .,Na)  denote  the  set 
of  driving  pulses  for  which  we  register  a  photon.  Each 
operator  Pmfl  introduces  two  real  parameters  0m.,  <pm,  so 
there  are  2 Na  independently  controllable  real  parameters 
in  the  state  \^F).  The  state  \^F)  can  be  written  in  general 
in  the  form 

Na 

I*f>  =  X  b(nh)\Na,  nh),  (6) 

nh=  o 

where  the  Dicke  states  \Na,  nh)  are  defined  by  Eq.  (5),  and 
the  complex  superposition  coefficients  b(nh)  are  func¬ 
tions  of  6m.y  <pm.  Superpositions  of  the  Dicke  states 
have  2 Na  degrees  of  freedom,  which  exactly  equals  to 
the  number  of  control  parameters  0m.,  <pm.. 

Actually,  we  can  prove  that  an  arbitrary  superposition 
of  the  Dicke  states  \Na,  nh)  [i.e.,  the  state  \^F)  with  any 
coefficients  b(nh )]  is  obtainable  by  choosing  an  appropri¬ 
ate  set  of  control  parameters  0m,  <pm  .  For  the  proof,  we 
write  the  state  (6)  in  the  form  |$F)  =  b(Na)c(Na )  X 
Y^l=ob\nh)(sl)nh(s\)Na~nh\G)’  where  b'{nh)  = 
c(nh)b(nh) /[b(N a)c(N a)],  and  without  loss  of  generality 
we  have  assumed  b(Na )  A  0.  Each  of  the  atomic  excita¬ 
tion  operators  Pmi/UL  can  be  expressed  as  Pmi/UL  oc 
(4  —  rmiiJLs i )»  where  the  complex  coefficient  rm i/JL,  deter¬ 
mined  by  the  real  parameters  0m.,  <pm.,  is  the  relevant 
control  parameter.  To  prepare  a  desired  state  |^F)  with 
the  superposition  coefficients  b'(nh ),  we  need  to  choose 
the  parameters  rm  to  satisfy  the  algebraic  equation 

nf=i(4  -  rmiA)  =  b'intXsly^slT^K  It  im- 

mediately  follows  from  this  equation  that  the  parameters 
rmifJL  should  be  the  Na  solutions  of  the  A^th-order  alge¬ 
braic  equation  X^=o  b'(nh)xnh  =  0,  where  v  denotes  the 
variable.  In  the  complex  domain,  there  always  exist  Na 


solutions  to  the  Vflth-order  algebraic  equation,  and  the 
parameters  rmijUL  are  uniquely  determined  from  these  so¬ 
lutions  if  we  do  not  care  about  the  order  of  the  excitation 
operators  PmijUL  (note  that  they  commute  with  each  other). 
This  finishes  the  proof. 
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We  experimentally  demonstrate  quantum  teleportation  for  continuous  variables  using  squeezed- 
state  entanglement.  The  teleportation  fidelity  for  a  real  experimental  system  is  calculated  explicitly, 
including  relevant  imperfection  factors  such  as  propagation  losses,  detection  inefficiencies  and  phase 
fluctuations.  The  inferred  fidelity  for  input  coherent  states  is  F  —  0.61  ±0.02,  which  when  cor¬ 
rected  for  the  efficiency  of  detection  by  the  output  observer,  gives  a  fidelity  of  0.62.  By  contrast, 
the  projected  result  based  on  the  independently  measured  entanglement  and  efficiencies  is  0.69. 
The  teleportation  protocol  is  explained  in  detail,  including  a  discussion  of  discrepancy  between 
experiment  and  theory,  as  well  as  of  the  limitations  of  the  current  apparatus. 


I.  INTRODUCTION 

The  No  Cloning  Theorem  prohibits  making  an  exact 
copy  of  an  unknown  quantum  state  [1].  Yet,  it  is  never¬ 
theless  possible  to  transport  an  unknown  quantum  state 
from  one  place  to  another  without  having  the  associated 
physical  object  propagate  through  the  intervening  space 
by  way  of  a  process  termed  quantum  teleportation  in  the 
landmark  work  by  Bennett  et  al.  [2]  in  1993.  This  “dis¬ 
embodied”  transport  of  quantum  states  is  made  possible 
by  utilizing  shared  quantum  entanglement  and  classical 
communication  between  the  sending  and  receiving  loca¬ 
tions.  In  recent  years,  quantum  teleportation  has  played 
a  central  role  in  quantum  information  science  and  has 
become  an  essential  tool  in  diverse  quantum  algorithms 
and  protocols  [3,  4]. 

By  contrast,  progress  on  an  experimental  front  has 
been  rather  more  modest  in  the  actual  attainment  of 
quantum  teleportation  [5,  6,  7,  8].  An  overview  of  these 
various  experiments  as  well  as  operational  criteria  for 
gauging  laboratory  success  can  be  found  in  Refs.  [9,  10]. 
Significantly,  to  date  only  the  experiment  of  Furusawa  et 
al.  [7]  on  continuous  variables  has  achieved  unconditional 
quantum  teleportation  [11]. 

The  purpose  of  this  paper  is  to  present  a  report  of 
our  progress  in  the  continuation  of  the  experiment  as  re¬ 
ported  by  Furusawa  et  al.  [7]  and  as  described  in  Ref. 
[12].  We  give  a  detailed  description  of  our  quantum  tele¬ 
portation  apparatus  and  procedures,  and  include  recent 
experimental  results  [13].  Some  notable  distinctions  be¬ 
tween  our  current  experiment  and  the  previous  one  by 
Furusawa  et  al.  are  improved  EPR  entanglement,  better 
detection  efficiencies,  and  ultimately,  a  higher  fidelity  be¬ 
tween  the  input  and  teleported  output  states.  We  also 
investigate  in  some  detail  the  various  factors  that  limit 
the  quality  of  the  teleportation  procedure  under  realis¬ 
tic  conditions  and  as  are  applicable  to  the  experimental 
setup.  We  provide  a  detailed  model  of  the  entire  exper¬ 
iment  that  includes  essentially  all  of  the  dominant  loss 
mechanisms  and  utilize  this  model  to  gain  insight  into 
the  limitations  of  the  current  apparatus  and  protocols, 
and  thereby  to  discover  methods  of  circumventing  these 
limitations. 


Our  experiment  is  based  on  the  continuous  variable 
teleportation  protocol  first  proposed  in  [14],  which  in 
turn  was  motivated  by  the  work  of  Vaidman  [15].  In 
our  realization  of  this  protocol,  an  entangled  EPR  state 
[16]  is  created  from  two  independent  squeezed  fields.  One 
half  of  this  entangled  state  (called  EPR1)  is  sent  to  Alice, 
who  in  turn  combines  it  at  a  50/50  beamsplitter  with  an 
unknown  input  state  that  is  intended  for  teleportation. 
Note  that  the  input  quantum  state  is  unknown  to  both 
Alice  and  Bob.  Alice  subsequently  measures  the  x  and  p 
quadratures  of  the  two  output  fields  from  the  beamsplit¬ 
ter,  the  x  quadrature  for  one  beam  and  the  p  quadrature 
for  the  other.  This  measurement  of  (x,p)  provides  the 
continuous  variable  analogy  to  a  Bell-state  measurement 
for  the  discrete  variable  case  [17].  In  the  limit  of  per¬ 
fect  EPR  correlations,  Alice  gains  no  information  about 
the  input  state.  The  output  photocurrents  from  Alice’s 
two  quadrature  measurements  are  transmitted  to  Bob 
via  classical  information  channels.  Bob  then  uses  them 
to  perform  a  continuous  phase  space  displacement  on  the 
second  EPR  beam  (EPR2),  thereby  generating  the  tele¬ 
ported  output  state.  For  perfect  EPR  correlations,  the 
teleported  state  has  unit  fidelity  with  the  original  un¬ 
known  input  state,  as  can  be  verified  by  “Victor”  who 
both  generates  the  original  input  and  measures  the  tele¬ 
ported  output.  Of  course,  the  limit  of  this  ideal  case  is 
unattainable  in  any  laboratory  setting.  This  necessitates 
the  introduction  of  operational  criteria  to  gauge  the  suc¬ 
cess  of  the  protocol,  as  discussed  in  Refs.  [9,  10],  and  as 
will  be  applied  in  relation  to  our  experiment. 

The  paper  is  organized  as  follows.  In  Sec.  II  we  discuss 
the  fidelity  for  quantum  teleportation  in  the  presence  of 
losses  and  phase  fluctuations,  including  importantly  for 
the  EPR  beams.  In  Sec.  Ill  this  model  is  connected  to 
the  laboratory  via  a  detailed  discussion  of  the  genera¬ 
tion  of  our  EPR  resource,  including  specifications  of  the 
optical  parametric  oscillator  (OPO)  parameters,  the  ob¬ 
tainable  squeezing,  and  the  characterization  of  the  EPR 
state.  The  technical  details  of  the  actual  implementation 
of  the  quantum  teleportation  protocol  are  discussed  in 
depth  in  Sec.  IV  with  emphasis  on  the  phase- lock  servo 
systems  and  the  calibration  of  the  classical  information 
channels.  Here,  we  also  present  new  data  on  the  tele¬ 
portation  of  coherent  states  of  light.  These  experimental 
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data  are  compared  to  theoretical  calculations  based  on 
the  relevant  parameters  for  the  experiment,  with  each 
parameter  measured  in  absolute  terms  without  adjust¬ 
ment.  Finally,  we  collect  our  conclusions  in  Sec.  V,  to¬ 
gether  with  an  outlook  for  future  progress. 


II.  THEORY 

In  this  section  a  theoretical  description  of  the  quantum 
teleportation  protocol  for  continuous  variables  is  given. 
This  is  a  generalization  of  previous  work  in  order  to  in¬ 
clude  all  relevant  detector  inefficiencies  and  phase  off¬ 
sets  for  the  experiment.  The  discussion  is  divided  into 
two  parts:  in  Sec.  II A  the  effect  of  nonideal  homodyne 
detectors  is  investigated  while  Sec.  II B  concerns  phase 
fluctuations  due  to  imperfect  phase- lock  servos.  Both  ef¬ 
fects  turn  out  to  be  of  substantial  importance  in  trying 
to  accurately  model  experimental  data. 


A.  Detection  inefficiencies 


Squeezed  beams 


FIG.  1:  Main  parts  in  the  teleportation  protocol  for  con¬ 
tinuous  variables.  Indicated  are  the  relevant  efficiencies 
(£1^5  ,£epr)  that  limit  the  teleportation  fidelity. 


Fig.  1  shows  a  simplified  schematic  of  the  experiment 
for  teleportation  of  an  unknown  quantum  state  provided 
by  the  verifier  Victor  and  characterized  by  a  pure  input 
state  {'(pin).  The  process  is  as  follows:  Alice  performs 
measurements  of  the  two  quadratures  x  and  p  of  the 
fields  obtained  by  combining  the  unknown  input  state 
with  EPR1.  This  is  done  by  implementing  two  balanced 
homodyne  detectors  where  the  signal  fields  are  each  com¬ 
bined  with  a  strong  coherent  local  oscillator  (LO)  and 
the  resulting  output  intensities  are  measured.  Subtract¬ 
ing  the  two  photocurrents  from  a  given  set  of  detectors 
results  in  a  signal  proportional  to  the  quadrature  ampli¬ 
tude,  with  the  relevant  quadrature  selected  by  the  phase 
of  the  LO.  The  efficiency  of  the  homodyne  detectors  can 


be  characterized  by  the  visibilities  (£2,^3)  of  the  over¬ 
lap  between  the  LOs  and  the  output  beams  from  Alice’s 
beamsplitter,  as  well  as  the  detectors’  quantum  efficiency 
(<a).  Furthermore,  the  visibility  (£1)  of  the  overlap  be¬ 
tween  the  input  state  and  EPR1  is  relevant. 

Because  of  the  nature  of  the  EPR  correlations,  the 
effect  of  Alice’s  quadrature  measurements  is  to  project 
EPR2  onto  a  state  that  differs  from  the  unknown  input 
state  only  by  a  phase  space  displacement.  The  necessary 
displacement,  however,  depends  on  the  outcome  of  Al¬ 
ice’s  measurements.  Hence  the  task  for  Bob  is  to  perform 
this  phase  space  displacement  with  the  classical  infor¬ 
mation  received  from  Alice  by  way  of  the  photocurrents 
(ix,ip)  shown  in  Fig.  1.  In  practice  this  is  accomplished 
by  overlapping  EPR2  with  a  phase  and  amplitude  mod¬ 
ulated  coherent  state  on  a  99/1  beamsplitter.  This  mod¬ 
ulation  is  directly  driven  (with  suitable  gain  and  phase 
compensation)  by  the  photocurrents  (iXlip)  from  Alice’s 
detectors.  The  relevant  efficiency  is  the  visibility  (£4) 
between  EPR2  and  the  modulated  coherent  state.  Fi¬ 
nally,  the  quality  of  the  teleportation  can  be  checked  by 
a  third  party  (Victor)  that  performs  homodyne  detection 
on  the  output  state.  The  visibility  of  Victor’s  homodyne 
detector  is  denoted  £5. 

As  discussed  in  more  detail  in  Refs.  [9,  10],  the  perfor¬ 
mance  of  the  teleportation  protocol  can  be  quantified  by 
the  fidelity  F,  which  is  defined  by 


F  —  ('Ipin  | Pout  |  ^Pin)  •>  (1) 

which  is  simply  the  overlap  between  the  input  state  | ipin) 
(which  is  assumed  to  be  a  pure  state)  and  the  output 
state  characterized  by  a  density  matrix  pout .  In  the  limit 
of  perfect  detectors  (unity  efficiencies)  but  with  a  finite 
degree  of  EPR  correlation,  the  fidelity  for  quantum  tele¬ 
portation  of  coherent  states  can  be  shown  to  be  [7,  9] 


F  = 


-  exp - | (3out  -  (3in\2  , 

aQ  l  aQ 


(2) 


where 

aQ  =  (^a) 

°w  =  °w  =  92  +  b2r+  (1  -  g )2  +  V2r-  (1  +  gf. 

(3b) 

Here  and  are  the  variances  (in  the  Wigner  rep¬ 
resentation)  of  the  teleported  x  and  p  quadratures  that 
emerge  from  Bob’s  beamsplitter  (as  shown  in  Fig.  1  at 
Pout)-  9  is  gain  of  the  classical  channels,  where  we  have 
assumed  that  the  two  classical  channels  have  the  same 
gain  and  that  any  phase  offsets  have  been  appropriately 
compensated.  Furthermore,  /^n,  /3out  are  the  amplitudes 
of  the  unknown  input  field  and  teleported  output  field 
respectively.  Finally,  r+,r_  are  the  anti-squeezing  and 
squeezing  parameters,  respectively,  for  the  two  equally 
squeezed  beams  used  to  produce  the  EPR  correlations, 
as  will  be  discussed  in  detail  in  Sec.  IIIC. 
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Any  real  experiment  of  course  suffers  from  finite  losses 
in  propagation  and  detection,  with  the  individual  efficien¬ 
cies  being  critical  due  to  the  fragility  of  quantum  states 
of  light.  It  turns  out  that  the  general  expression  (2)  for 
the  fidelity  still  applies  to  the  case  with  losses,  but  the 
variances  of  the  quadratures  of  the  teleported  field  gen¬ 
eralized.  In  addition,  we  take  into  account  the  fact  that 
we  do  not  observe  the  output  state  directly,  but  instead 
measure  the  output  photocurrent  from  Victor’s  balanced 
homodyne  detector.  If  we  assume  as  before  that  the  input 
states  to  Alice  are  coherent  states,  then  the  quadrature 
variance  ay  recorded  by  Victor  for  the  teleported  output 
state  can  be  written  as 


<yxv  =  i-r2BtltWv-giel  + 


J2  <-2 


zWa* 

,2r+ 


(4) 


e-2r_ 

-  2  (gx£,i+rBU£,5 pv)2  +  (gx£i- rBU& pv)2  • 


and  the  variance  for  the  p  quadrature  is  given  by  ay  = 
°y  (9x  SpiVAx  Vap,&  &)•  The  fidelity  is  then 
obtained  by  replacing  a^  with  ayP  in  Eqs.  (3).  The  non¬ 
unit  reflectivity  of  Bob’s  beamsplitter  appears  as  a  loss 
factor  t\b,  where  in  our  experiment,  \rB\  =  0.99.  pi  are 
detector  efficiency  factors  directly  related  to  the  quantum 
efficiencies  cq  by  ai  =  pf ,  where  the  subscripts  denote  Al¬ 
ice  x,  Alice  p  or  Victor.  gX:P  are  the  suitably  normalized 
gains  for  the  x  and  p  classical  channels  through  which 
Alice  sends  information  to  Bob. 

In  terms  of  the  model  given  in  Fig.  1,  the  gains  gX:P 
are  given  explicitly  by 


9x  — 


9x,(  o) 

V2 


tB&€b  PAxPv, 


9p 


tB^5riApr]v  ■ 


(5a) 

(5b) 


Here,  gx^ 0)  and  gp^o)  are  dimensionless  gains  that  ac¬ 
count  for  the  translation  of  the  photocurrents  iX:P  into 
fields  by  Bob’s  amplitude  and  phase  modulation,  where 
the  point  of  reference  is  immediately  before  his  beam¬ 
splitter,  which  is  taken  to  have  amplitude  reflection  and 
transmission  coefficients  ( rB,tB ).  Note  that  the  formal 
limit  of  a  phase-space  displacement  by  Bob  is  achieved 
only  for  the  case  (tB  — >  0,^(0)  — >  oo),  with  the  product 
tBgx,(o)  held  constant. 

The  convention  that  we  adopt  for  the  normalization 
of  the  gains  gXiP  in  Eqs.  (4)  and  (5)  is  such  that  gx  = 
gp  =  1  results  in  /3y  =  Pin,  and  hence  reflects  an  optimal 
reconstruction  of  the  input  state  for  any  sensible  values 
of  the  squeezing  parameters  r±.  The  caveat  here  is  that 
since  we  measure  Victor’s  photocurrent  and  not  the  field 
emerging  from  Bob,  we  effectively  set  \/3y\2  =  |An|2  and 
not  \Pout\ 2  =  | An | 2  as  required  by  the  protocol,  where  it 
can  be  easily  shown  that 

\/3v\2=e5ri2v\l3out\2.  (6) 


This  defect  in  our  measurement  will  be  discussed  quan¬ 
titatively  when  we  present  our  experimental  data  in  Sec. 


IV  D.  Note  that  if  Victor  has  perfect  detection  efficiency 
(£5  =  py  =  1),  the  problem  vanishes,  and  the  result 
given  in  Eq.  (4)  is  exact  for  the  teleported  output  field 
emerging  from  Bob’s  beamsplitter. 

The  corresponding  variances  obtained  by  Alice’s  ho¬ 
modyne  detectors  are  given  by 

°A  =  1#  \  ( e~2r~  +  e2r+  -  2)  ZUWax,  (7a) 

°A  \  (e~2r~  +  e2r+  -  2)  iUWAp-  (7b) 

Several  limiting  cases  associated  with  these  expressions 
are  worth  noting.  In  the  classical  case  where  there  is 
no  EPR  entanglement  (r+  =  r_  =0),  and  with  per¬ 
fect  homodyne  detectors  (£1^5  =  Pi  =  1)?  we  obtain 
ay  =  ay  =  3,  corresponding  to  three  units  of  vacuum 
noise  in  Victor’s  homodyne  detector.  One  unit  stems 
from  the  vacuum  noise  intrinsic  to  the  input  coherent 
state,  while  the  two  extra  units  can  be  traced  back  as 
the  quantum  duties  added  in  each  crossing  of  the  border 
between  quantum  and  classical  domains  corresponding  to 
Alice’s  quadrature  measurements  and  Bob’s  phase  space 
displacement  [14].  This  means  that  for  classical  telepor¬ 
tation  of  coherent  states,  the  best  achievement  possible 
is  reconstructing  the  input  state  with  two  extra  units  of 
vacuum  noise  added  [9,  10].  The  three  vacuum  units  cor¬ 
respond  to  excess  noise  recorded  in  Victor’s  homodyne 
detector  of  4.77  dB  above  the  vacuum- state  limit  for  his 
detector.  With  quantum  entanglement  it  is  possible  to 
beat  this  limit  and  observe  noise  reduction  below  the 
4.77  dB  level  in  Victor’s  detector.  The  measured  noise 
reduction  can  then  be  transferred  into  a  fidelity  through 
Eq.  (2).  As  analyzed  in  Refs.  [9,  10],  the  classical  bound¬ 
ary  for  teleportation  of  coherent  states  is  F  =  0.5. 

In  the  case  of  nonideal  detectors,  gx  =  gp  =  1  still 
preserves  optimal  teleportation  for  the  normalized  gains, 
in  the  sense  that  /3y  m  /3in.  However,  the  normalization 
is  performed  by  effectively  tuning  the  unnormalized  gains 
9(0)  by 


C"  —  (b&VAxwr1  gi%,  (8) 

and  similarly  for  p.  Thus  in  the  nonideal  case,  the  actual 
gain  is  larger  than  in  the  ideal  case,  reflecting  the  fact 
that  the  gain  must  now  compensate  for  Alice’s  and  Vic¬ 
tor’s  detection  losses  in  order  to  ensure  f3y  =  An-  As  a 
consequence,  the  fidelity  drops  below  F  =  0.50  with  no 
entanglement  (r±  =  0).  In  our  experiment,  the  detec¬ 
tion  efficiencies  are  characterized  by  the  measured  visi¬ 
bilities  and  quantum  efficiencies,  which  in  the  best  case 
are  given  by  £1  =  0.986,  £2  =  £3  =  0.995,  £4  =  0.988, 
£5  =  0.985,  and  ay  =  aAx  =  &AP  =  0.988.  With  these 
experimentally  achievable  efficiency  factors,  we  find  that 
ayP  =  4.84  dB  and  F  =  0.494  when  r±  =  0. 

Fig.  2  shows  the  excess  noise  recorded  by  Victor  and 
Alice  x  (or  equivalently  Alice  p)  as  a  function  of  the 
amount  of  squeezing,  both  for  the  ideal  case  with  per¬ 
fect  detection  efficiencies,  and  for  the  nonideal  case  with 
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detector  efficiencies  given  above.  With  no  squeezing  in 
the  ideal  case,  we  see  from  the  solid  curves  that  Vic¬ 
tor  obtains  exactly  4.77  dB  of  excess  noise  as  expected 
and  as  discussed  above  while  Alice  is  shot-noise-limited. 
With  imperfect  detection  efficiencies  as  shown  by  the 
dashed  curves,  Alice  remains  shot-noise-limited,  while 
Victor  records  excess  noise  higher  than  4.77  dB.  In  fact, 
the  only  relevant  efficiencies  that  drive  Victor’s  recorded 
noise  above  4.77  dB  involve  Alice’s  homodyne  detectors, 
namely  (£2 ,VAx)  for  the  x  quadrature,  and  (£3 ,rjAP)  for 
the  p  quadrature.  All  other  detection  losses  can  be  com¬ 
pensated  by  the  gains  gXiP  when  r±  =  0. 

As  the  squeezing  is  increased  so  that  now  r±  >  0,  Vic¬ 
tor  records  noise  reduction  below  the  r±  =  0  level.  By 
contrast,  Alice’s  noise  increases  above  the  vacuum  level 
at  her  detectors,  and  in  the  limit  of  infinite  squeezing, 
Alice’s  noise  diverges  while  Victor’s  excess  noise  is  sup¬ 
pressed  to  the  vacuum  level.  Notice  that  with  perfect 
detection  efficiencies,  <jyP  <  4.77  dB  for  any  r±  >  0. 
With  imperfect  efficiencies,  this  is  not  true.  In  effect, 
some  of  the  squeezing  is  “wasted”  to  compensate  for  the 
nonideal  efficiencies.  Since  our  experimental  visibilities 
are  close  to  unity,  this  loss  can  be  neglected  as  it  is  below 
the  level  of  other  experimental  uncertainties  for  small  val¬ 
ues  of  r±.  However,  with  large  degrees  of  squeezing,  the 
disparity  between  the  ideal  and  nonideal  cases  increases 
and  cannot  be  ignored,  as  can  be  seen  from  Fig.  2.  The 
reason  for  this  trend  is  that  now  the  visibilities  £1  and 
£4  that  characterize  the  overlap  of  the  EPR  beams  with 
Alice’s  and  Bob’s  relevant  beams,  as  well  as  the  non- unit 
reflectivity  vb  of  Bob’s  beamsplitter,  become  important. 
The  losses  from  non-unit  £1  and  £4  obviously  cannot  be 
compensated  by  the  gains  of  the  classical  channels. 

The  noise  reduction  at  Victor  can  be  transferred  into 
a  teleportation  fidelity,  with  the  result  plotted  in  Fig.  3. 
The  solid  and  dashed  curves  for  the  ideal  and  nonideal 
cases  mimic  the  conclusions  discussed  above  for  the  vari¬ 
ances  (jyP  measured  by  Victor. 

Of  course,  the  teleportation  fidelity  is  very  dependent 
on  the  detector  efficiencies.  We  investigate  this  point 
in  more  detail  in  Fig.  4,  where  the  fidelity  is  plotted 
as  a  function  of  a  single  global  visibility  £  (assuming 
£1^5  =  £)  and  where  the  quantum  efficiencies  of  all  the 
photodetectors  are  a  =  0.988.  This  figure  clearly  illus¬ 
trates  the  need  for  a  high  amount  of  squeezing  as  well  as 
very  efficient  spatial  mode-matching  of  our  optical  beams 
to  achieve  high  fidelity  quantum  teleportation. 


B.  Phase  fluctuations 

Not  only  losses  associated  with  the  detection  efficien¬ 
cies  limit  the  achieved  fidelity  for  quantum  teleportation. 
Also  the  quality  of  the  servo-control  systems  that  lock 
various  phases  (e.g.,  the  local  oscillator  phases  at  Alice’s 
detectors)  appear  to  be  of  significant  importance,  since 
phase  deviations  due  to  nonideal  locking  turn  out  to  de¬ 
teriorate  the  noise  reduction  measured  by  Victor.  We 


FIG.  2:  Noise  powers  in  dB  above  the  vacuum- state  limit  for 
Alice’s  x  detector  and  for  Victor’s  detector  as  a  function  of 
the  degree  of  squeezing  of  each  squeezed  vacuum  state  con¬ 
stituting  the  EPR  state.  The  solid  traces  are  for  an  ideal 
case  where  both  Alice  and  Victor  have  perfect  detection  effi¬ 
ciency  and  all  relevant  beams  are  perfectly  overlapped,  that  is, 
£1^5  =  OLi  =  1.  The  dashed  traces  show  the  noise  levels  for  a 
real  (nonideal)  case  where  the  visibilities  correspond  to  the  ex¬ 
periment  described  below  and  are  £1  =  0.986,  £2  =  £3  =  0.995, 
£4  =  0.988,  £5  —  0.985,  and  the  quantum  efficiencies  of  pho- 
todetectors  are  oci  —  0.988.  The  squeezing  given  in  the  figure 
is  the  squeezing  just  before  the  EPR  beamsplitter. 


FIG.  3:  The  fidelity  for  Victor’s  teleported  output  as  a  func¬ 
tion  of  the  degree  of  squeezing  with  same  parameters  as  in  Fig. 
2.  Again,  the  solid  trace  describes  a  perfect  case  with  ideal 
detectors,  while  the  dashed  trace  describes  the  imperfect  case 
as  relevant  to  our  experiment. 


will  see  that  this  mathematically  corresponds  to  mixing 
in  terms  proportional  to  the  anti-squeezed  quadratures 
of  the  squeezed  beams  constituting  the  EPR  state. 

In  a  realistic  model  of  the  experiment  we  include  phase 
offsets  of  four  servo  locks:  the  EPR  lock,  Alice’s  two  ho¬ 
modyne  detectors,  as  well  as  Bob’s  lock  of  the  phase  be¬ 
tween  the  second  EPR  field  and  the  classical  field.  The 
analysis  presented  here  will  be  a  straightforward  general¬ 
ization  of  the  derivation  in  [18]  based  on  the  Heisenberg 
picture.  The  quadratures  of  the  two  EPR  fields  (1  and 
2)  are  obtained  by  combining  two  squeezed  fields  with 
the  angle  between  the  squeezing  ellipses  equal  to  n/2. 
Although  we  have  investigated  a  more  complete  model, 
here  we  account  for  the  phase  deviation  away  from  tt/2 
by  introducing  an  angle  offset  0e  for  field  2.  In  this  sim- 
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FIG.  4:  Fidelity  as  a  function  of  visibility  for  different  values 
of  the  degree  of  squeezing.  We  have  assumed  £1^5  =  £  and 
a  =  0.988. 


pie  nonideal  case,  we  obtain  for  the  fields  emerging  from 
the  EPR  beamsplitter 

£1,2  =  -^=  (er+x^  4=  cosOe^-x^  =F  sin ^er+p2°^  , 

(9a) 

Pi52  =  -^=  {e~r~p^  =F  cos^e^p^  ±  sin^e-7'-^0^  , 

(9b) 

where  x^\  and  pf\  are  vacuum  state  operators  for  (x,p) 


for  the  input  beams  1  and  2  to  the  beamsplitter,  respec¬ 
tively.  Further  downstream,  EPR  beam  1  is  mixed  with 
the  unknown  input  state  on  a  50/50  beamsplitter  creat¬ 
ing  the  modes  u  =  (d^n  —  di)/ y/2  and  v  =  (d^n  +  di)/ \/2, 
and  Alice  measures  the  two  quadrature  amplitudes  of  the 
corresponding  state  in  her  homodyne  detectors.  Allowing 
for  small  phase  deviations  0ax  and  0ap  in  the  detection 
process,  we  find  that  the  quadratures  measured  by  Alice 
now  become 

xu(0ax)  =  Xu  COS  eAx +pu  sin  9 Ax,  (10a) 

Pv(6ap)  =  Pv  cosOAp  -  xv  sin 0Ap.  (10b) 

Finally,  Bob  performs  a  phase  space  displacement  of 
the  second  EPR  beam  by  overlapping  with  the  coher¬ 
ent  beam  containing  the  classical  information  received 
from  Alice.  Allowing  again  for  a  phase  offset  0B  in  Bob’s 
phase-space  displacement,  we  calculate  that  the  quadra¬ 
ture  operators  for  the  teleported  state  exiting  the  appa¬ 
ratus  for  investigation  by  Victor  is  given  by 

Xy  =  x2  cos  0B  +p2  sin  0B  +  V2xu(0ax),  (Ha) 

Pv  =P2Cos6b  ~  x2sin0B  +  V2pv(0Ap),  (lib) 

where  the  normalized  gains  of  the  classical  channels  have 
been  taken  to  be  unity.  Using  Eqs.  (9)  and  (10),  we 
arrive  at  expressions  for  the  Heisenberg  operators  for  the 
teleported  field  received  by  Victor,  namely 


J 


\[2xy  =  (cos  0B  —  cos  6 Ax)  er+x^  +  (sin 0B  —  sin Qax)  e~r~p(f"> 

+  [cos 0 B (cos 0 B  +  cos  Oax)  ~  sin0£(sin0B  +  sin^^)]  e~r~x^ 

+  [sin 0 B (cos  0 B  +  cos  0ax)  +  cos  0E (sin  0B  +  sin^x)]  ^^P^  +  V^cos  OAx^in  +  a/2  sin  OAxPin,  (12a) 

V^pv  =  —  (sin  0B  +  sin  0ap)  er+x^  +  (cos  0B  +  cos  0ap)  e~r~p^ 

+  [sin  0B (cos  0ap  —  cos  0B)  +  cos  0B (sin  0ap  —  sin  0b)]  e~r~x^ 

+  [sin  0e {sin  0ap  —  sin  0B)  +  cos  0e (cos  6b  —  cos  0ap)\  er+p^  +  a/2  cos  0APPin  —  sin  0APXin-  (12b) 


By  utilizing  these  expressions,  the  variances  of  the  two  so  that  products  of  phases  vanish  on  average, 
quadratures  measured  by  Victor  can  be  calculated.  As¬ 
suming  the  phase  excursions  are  small,  we  expand  to  low¬ 
est  order.  We  recall  that  the  aim  of  this  calculation  is 
to  describe  the  impact  of  phase  fluctuations  in  the  var¬ 
ious  servo-controls.  Hence  we  assume  that  there  are  no 
static  offsets  (which  we  believe  our  current  procedures 
adequately  null),  so  that  all  the  phase  excursions  vanish 
on  average,  0  =  0,  and  only  deviations  expressed  by  the 
second  order  moments  contribute.  Furthermore,  it  is  as¬ 
sumed  that  all  the  phase  fluctuations  are  independent, 
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After  some  algebra,  we  finally  arrive  at 
(JV{x)  =  (A  Xy)  =1  + 


+ 


av{p)  =  (A py >  =1  + 


o  ^  £2  ^  n2  0/32 

2  2  ^  2  ^ 

^C+^+2^| 

9  _  1 7)2  _  ^  zi 2 

2  2^b 

Ifl2  ,  IjE-1  „2r+ 

2°ap+2»b 


-2  r_ 


e2r+,  (13a) 


-2r_ 


(13b) 


where  the  various  are  meant  to  be  associated  with 
the  residual  RMS  fluctuations  arising  from  the  nonideal 
performance  of  our  locking  servos.  Explicit  dependence 
on  the  phase  Oy  of  Victor’s  LO  is  given  by 

Gy  [x{Ov)\  =  CFy(x)  COS2  Oy  +  Gy(p)  sin2  Oy .  (14) 


These  equations  make  quantitative  the  obvious  intu¬ 
ition  that  the  effect  of  the  phase  fluctuations  is  to  add  ex¬ 
tra  noise  in  the  quadratures  measured  by  Victor  through 
components  proportional  to  the  anti-squeezed  quadra¬ 
ture.  In  fact,  relatively  small  phase  fluctuations  (~  1° 
RMS)  can  degrade  the  noise  reduction  that  would  oth¬ 
erwise  have  been  recorded  by  Victor,  and  consequently 
also  the  achieved  fidelity. 

From  these  equations,  we  see  that  particular  phase 
fluctuations  contribute  in  quite  different  ways.  Phase 
fluctuations  at  Bob  contribute  equally  to  excess  noise  in 
the  x  and  p  quadratures  and  will  consequently  be  seen 
as  a  constant  shift  in  the  noise  measured  in  Victor’s  ho¬ 
modyne  detector  while  scanning  the  local  oscillator.  The 
same  effect  is  found  from  fluctuations  in  the  locking  of 
the  local  oscillator  phases  at  Alice  x  and  Alice  p  pro¬ 
vided  02Ax  =  6\  .  However,  phase  fluctuations  in  the 
EPR  lock  are  seen  to  modify  Victor’s  x  and  p  quadra¬ 
tures  differently  and  therefore  imply  modulation  of  the 
noise  measured  by  Victor.  The  relevant  second  order  mo¬ 
ments  0 2  for  the  various  locks  can  be  obtained  experimen¬ 
tally  by  measuring  the  RMS  noise  of  the  error  signals  in 

locked  operation.  Typically  measurements  give  \J~ff2  2 
to  6  degrees.  From  Eqs.  (13)  it  is  seen  that  fluctuations 
in  the  phase  with  which  the  squeezed  beams  are  com¬ 
bined  to  form  the  EPR  beams  are  most  critical  since  02E 
contributes  with  a  coefficient  four  times  higher  than  the 
other  phase  terms  to  the  mixing  with  the  anti-squeezing 
term. 

Fig.  5  shows  the  calculated  noise  in  Victor’s  homo¬ 
dyne  detector  for  different  levels  of  phase  fluctuations  in 
the  EPR  lock  employing  realistic  values  of  squeezing  and 
anti-squeezing  for  the  experiment  discussed  in  the  follow¬ 
ing  sections.  The  modulation  of  Victor’s  signal  is  seen  to 
be  up  to  about  0.2  dB  peak  to  peak  which  turns  out  to 
imply  a  significant  reduction  of  the  achieved  fidelity. 


FIG.  5:  Noise  power  recorded  by  Victor’s  balanced  detector 
when  scanning  his  local  oscillator  for  yj~0 —  0,  2,  4,  6  degrees 
(corresponding  to  the  bold,  thin,  dotted,  and  dashed  curves, 


respectively),  and  for  0\x  =  0\p  =  0\  =0.  Realistic  values  of 
the  degrees  of  squeezing  (— 3dB)  and  of  anti-squeezing  (7dB) 
have  been  used. 


III.  GENERATION  OF  THE  QUANTUM 
RESOURCE 

A  more  complete  figure  of  the  experimental  setup  is 
given  in  Fig.  6.  A  10  W  Verdi  was  used  to  pump  a  single 
frequency  Ti:Sapphire  laser  operating  at  866  nm.  This 
laser  system  provided  about  1.6  W  of  IR.  About  80% 
to  85%  of  this  light  was  sent  to  an  external  frequency 
doubler  to  generate  an  efficient  433  nm  pump  source  for 
the  optical  parametric  oscillator  (OPO).  Typically  about 
300  mW  of  blue  light  was  produced  that  could  be  mode- 
matched  to  the  OPO  using  a  triangular  ring  cavity.  Fur¬ 
thermore,  the  pump  was  divided  into  two  beams,  which 
allowed  pumping  the  OPO  from  two  directions  to  pro¬ 
duce  two  independent  squeezed  beams.  A  detailed  de¬ 
scription  of  this  setup  for  generation  of  highly  squeezed 
light  can  be  found  in  Ref.  [19].  About  10%  of  the  IR  light 
from  the  Ti: Sapphire  laser  was  spatially  filtered  in  a  mode 
cleaning  cavity  and  used  down  stream  in  the  experiment 
for  locking  the  OPO,  as  local  oscillators  in  the  homodyne 
detectors,  for  Bob’s  displacement  beam,  and  as  the  input 
coherent  state  for  the  actual  teleportation.  Combining 
the  two  squeezed  beams  on  a  50/50  beamsplitter  with 
the  phases  locked  so  that  the  squeezing  ellipses  are  per¬ 
pendicular  to  each  other,  the  EPR  state  was  generated 
which  is  the  quantum  resource  necessary  for  the  actual 
quantum  teleportation  protocol  described  previously  in 
relation  to  Fig.  1.  In  the  current  section  a  detailed  de¬ 
scription  of  the  generation  of  the  EPR  state  is  given  with 
a  careful  characterization  of  both  classical  and  quantum 
properties  of  the  OPO.  The  actual  implementation  of  the 
full  teleportation  protocol  follows  in  Sec.  IV. 


A.  Loss  and  gain  in  the  OPO 

The  OPO  cavity  was  a  bow-tie  ring  configuration  con¬ 
sisting  of  two  curved  mirrors  (radius  of  curvature  5  cm) 
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FIG.  6:  Sketch  illustrating  the  basic  optical  parts  in  the  quan¬ 
tum  teleportation  experiment.  See  the  main  text  for  a  careful 
discussion. 


and  two  plane  mirrors.  The  total  cavity  length  was  48cm. 
The  focus  positioned  between  the  two  curved  mirrors  had 
a  waist  size  of  21  /im  where  the  1cm  long  nonlinear  potas¬ 
sium  niobate  (KNbOs)  crystal  was  positioned.  The  use 
of  a-cut  potassium  niobate  allowed  noncritical  tempera¬ 
ture  phase-matching  of  a  degenerate  parametric  process. 
To  generate  a  pump  for  the  OPO,  the  output  from  the 
Ti: Sapphire  laser  was  frequency  doubled  in  another  ex¬ 
ternal  cavity  also  using  potassium  niobate  as  the  nonlin¬ 
ear  medium  [20] .  In  this  way  300  mW  of  pump  light  at 
433  nm  was  generated. 

In  the  OPO  nonlinear  down-conversion  transformed 
energy  from  the  pump  field  at  433  nm  into  the  para¬ 
metric  field  at  866  nm.  In  the  current  application  the 
OPO  was  only  driven  below  oscillation  threshold,  where 
spontaneous  parametric  emission  can  produce  a  squeezed 
vacuum  state.  The  parametric  field  was  resonant  in  the 
OPO  while  the  pump  light  from  the  frequency  doubler 
was  divided  into  two  beams,  each  of  which  was  used  in 
single  pass  of  the  OPO  crystal  from  a  counter  propagat¬ 
ing  direction. 

The  OPO  performance  can  be  characterized  once  spec¬ 
ifying  the  output  coupler  intensity  transmission  T,  the 
effective  nonlinearity  Enl,  and  the  intracavity  round- 
trip  loss  C.  In  the  current  experiment  the  output  coupler 
transmission  was  fixed  at  T  =  10%  which  was  chosen 
to  optimize  squeezing.  The  total  information  about  the 
nonlinear  interaction  can  be  captured  in  the  single  pa¬ 
rameter  Enl  that  depends  on  the  focusing,  length  of 
the  crystal,  phase- matching  and  crystal  properties.  It 
can  operationally  be  defined  as  Enl  =  P2/P1 ,  where  P2 
is  the  second  harmonic  power  generated  in  single  pass 
frequency  doubling  of  a  fundamental  pump  Pi.  In  the 
current  setup  we  measured  Enl  =  0.021  FF_1. 

Contributing  to  the  intracavity  loss  are  nonideal  an¬ 
tireflection  coatings  of  the  potassium  niobate  as  well  as 
leakage  from  the  three  high-reflection  coated  cavity  mir¬ 


rors.  This  passive  loss  was  measured  to  be  Cp  =  0.3%. 
Unfortunately  potassium  niobate  also  suffers  from  an  in¬ 
herent  loss  mechanism  that  adds  to  the  passive  losses 
[21,  22].  This  nonlinear  loss  arises  in  the  OPO  in  the 
presence  of  the  blue  pump  beam  and  has  been  termed 
blue-light-induced  infrared  absorption  (BLIIRA).  It  is 
believed  to  originate  from  impurities  in  the  crystal  and  is 
found  to  vary  substantially  from  crystal  sample  to  sam¬ 
ple.  At  a  high  pump  level  of  the  OPO,  BLIIRA  turns 
out  to  be  the  dominating  loss  mechanism  and  eventually 
becomes  the  limiting  factor  for  the  amount  of  squeezing 
obtained.  The  losses  £5  due  to  BLIIRA  could  be  moni¬ 
tored  in  the  OPO  by  measuring  the  reflection  dip  of  the 
injected  pump  beam  while  scanning  the  cavity  around 
resonance.  Typical  measurements  of  the  total  intracav¬ 
ity  loss  (£  =  Cp  +  Cb)  are  shown  in  Fig.  7  as  a  function 
of  the  blue  pump  power.  In  this  case  the  OPO  was  only 
pumped  along  one  direction.  We  observe  that  the  total 
loss  increases  up  to  about  2%  at  the  highest  pump  level 
of  P2  =  155  mW. 
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FIG.  7:  Measured  total  intracavity  round-trip  cavity  loss  C 
as  a  function  of  the  blue  pump  P2.  Note  that  the  transmission 
T  of  the  output  coupler  is  not  included. 

The  OPO  could  also  be  operated  as  a  phase  sensitive 
amplifier  as  a  way  to  test  the  classical  performance  of 
the  device.  For  that  purpose  an  866  nm  beam  was  seeded 
into  the  OPO  and  by  scanning  the  injection  phase  slowly, 
the  amplification  factor  G  was  measured.  When  injecting 
the  seed  beam  through  the  small  transmission  of  a  high 
reflection  mirror  and  measuring  the  amplification  of  the 
light  through  the  output  coupler  mirror,  the  gain  is  given 

by 
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where  P2  t  is  the  oscillation  threshold  of  the  OPO  given 
by 


P2.t  = 


(T  +  £)2 

4-Ejvz, 


(16) 


Fig.  8  shows  measured  values  of  the  gain  as  a  function 
of  pump  power  for  single-sided  pump  of  the  OPO  as  well 
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as  the  theoretical  curve  based  on  the  measured  loss  and 
nonlinearity.  The  gain  diverges  as  approaching  threshold 
and  from  these  data  we  estimate  Pt  ~  190  mW.  The 
agreement  between  experiment  and  theory  is  apparently 
quite  good,  here  with  no  adjustable  parameters. 


FIG.  8:  Measured  gain  (points)  and  corresponding  theoretical 
curve  based  on  the  measured  loss  and  nonlinearity. 


B.  Squeezing 

The  phase-sensitive  amplification  in  the  OPO  can  be 
exploited  for  generating  squeezed  states  of  light.  In  this 
case  the  two  input  vacuum  noise  quadratures  are  ampli¬ 
fied  and  deamplified,  respectively,  creating  a  squeezed 
vacuum  state.  Balanced  homodyne  detection  allows 
phase  sensitive  detection  of  the  quantum  noise  of  the 
squeezed  state.  With  this  method  the  signal  field  is 
overlapped  with  a  strong  coherent  local  oscillator  (LO) 
on  a  50/50  beamsplitter.  The  two  output  beams  from 
the  beamsplitter  are  measured  and  the  corresponding 
photocurrents  subtracted.  In  this  way  the  weak  quan¬ 
tum  noise  of  the  signal  is  amplified  to  achieve  a  signal 
substantially  above  the  thermal  noise  floor  of  the  pho¬ 
todiodes.  The  photodiodes  used  were  a  special  part 
made  by  Hamamatsu  with  a  measured  quantum  effi¬ 
ciency  a  =  98.8  d=  1.0%. 

Two  typical  squeezing  traces  at  different  pump  levels 
are  presented  in  Fig.  9.  They  were  obtained  by  recording 
Victor’s  noise  power  with  the  spectrum  analyzer  when 
scanning  the  phase  of  the  LO.  We  observe  the  phase  sen¬ 
sitive  noise  with  the  maximum  and  minimum  correspond¬ 
ing  to  measuring  the  anti-squeezed  and  squeezed  quadra¬ 
tures,  respectively.  With  the  signal  beam  blocked,  the 
vacuum  state  level  was  recorded,  and  squeezing  cor¬ 
responds  to  noise  reduction  below  this  level.  By  locking 
the  LO  phase  to  the  squeezed  quadrature,  we  obtained 
the  flat  traces  below  as  shown  in  Fig.  9.  We  infer 
squeezing  of  3.73  dB  below  the  vacuum  level  and  anti¬ 
squeezing  of  6.9  dB  with  pump  power  of  42  mW.  At  the 
higher  pump  level  shown  (107mW),  the  degree  of  squeez¬ 
ing  remained  at  3.73 dB  below  the  vacuum  level  while  the 
degree  of  anti-squeezing  increased  to  10.8  dB. 


FIG.  9:  Typical  squeezing  traces  recorded  by  homodyne  de¬ 
tection  while  scanning  the  phase  of  the  LO.  The  measurement 
frequency  was  1.475  MHz  within  a  resolution  bandwidth  of 
30  kHz  and  with  a  video  bandwidth  of  300  Hz.  The  flat  traces 
at  0  dB  are  the  respective  vacuum  levels  4>qX\  taken  with  a 
5  trace  average.  Also  displayed  are  the  flat  traces  that  cor¬ 
respond  to  the  minimum  noise  level  attained,  again  with  a  5 
trace  average.  These  traces  were  obtained  by  locking  the  LO 
phase  to  the  squeezed  quadrature  and  lead  to  estimates  of 
the  squeezing  and  anti-squeezing  of:  (A)  — 3.73dB  and  6.9  dB 
with  42  mW  pump  power;  and  (B)  — 3.73dB  and  10.8  dB  with 
107  mW  pump  power.  In  both  cases,  the  OPO  was  pumped 
from  a  single  direction.  Victor’s  detection  efficiency  was  char¬ 
acterized  by  a  homodyne  visibility  £  =  0.972  and  photodiode 
quantum  efficiency  a  =  0.988. 


In  the  actual  teleportation  experiment  the  fidelity  is 
ultimately  limited  by  the  amount  of  squeezing  available. 
However,  as  discussed  in  Section  II B,  in  a  non-perfect 
experiment  the  amount  of  anti-squeezing  is  also  impor¬ 
tant.  In  that  section,  we  concluded  that  fluctuations  in 
the  servo  locks  will  degrade  the  fidelity  with  contributions 
from  the  anti-squeezed  quadratures.  For  that  reason  it 
is  important  to  find  the  optimum  operation  point  of  the 
OPO  where  the  degree  of  squeezing  is  large,  while  at  the 
same  point,  the  anti-squeezing  has  not  grown  too  large. 
Such  a  compromise  is  made  necessary  by  the  BLIIRA, 
which  limits  the  degree  of  quantum  noise  reduction  in  a 
power-dependent  fashion,  while  the  noise  from  the  anti¬ 
squeezed  quadrature  continues  to  grow. 

Fig.  10  shows  the  variation  of  the  measured  squeez¬ 
ing  and  anti-squeezing  with  the  OPO  pump  power  as 
well  as  the  corresponding  theoretical  curves  based  on  the 
measured  experimental  parameters  (loss  and  nonlinear¬ 
ity)  discussed  in  the  previous  section.  The  data  have 
been  corrected  for  the  thermal  noise  level  of  the  detec¬ 
tors,  which  was  17  dB  below  the  vacuum  noise  level  for 
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an  LO  of  2  mW.  The  squeezing  is  seen  to  level  off  at 
about  —3.5  dB  already  at  a  pump  of  45  mW  while  the 
anti-squeezing  increases  with  the  pump.  This  indicates 
that  in  the  teleportation  experiment  it  would  be  most 
favorable  to  operate  at  this  relatively  weak  pump  level  of 
the  0P0  and  that  decreased  teleportation  fidelity  might 
be  expected  when  increasing  the  pump  further  (e.g.,  due 
to  mixing  in  of  noise  from  the  anti-squeezed  quadrature 
from  the  imperfections  in  servo  control  discussed  in  Sec. 
IIB). 

We  note  that  the  measured  squeezing  is  lower  than  pre¬ 
dicted  from  the  0P0  parameters  discussed  above;  indeed 
we  predict  about  4.7  dB  squeezing  at  high  pump  level. 
This  discrepancy  might  be  due  to  offset  fluctuations  in 
the  OPO  lock  as  well  as  phase  fluctuations  between  the 
local  oscillator  and  the  squeezed  beam  in  the  homodyne 
detector.  In  favor  of  such  an  explanation  is  the  fact  that 
the  theory  predicts  the  anti-squeezed  quadrature  better 
than  the  squeezing  quadrature,  and  the  broad  maximum 
from  the  anti-squeezing  (see  Fig.  9)  is  expected  to  be 
much  less  sensitive  to  phase  fluctuations  than  the  nar¬ 
row  squeezing  minimum. 


FIG.  10:  Measured  squeezing  (diamonds)  and  anti-squeezing 
(dots)  at  a  frequency  offset  of  1.475  MHz  as  a  function  of  the 
OPO  pump  power  when  pumping  from  only  one  direction. 
The  curves  are  the  corresponding  theoretical  results  for  non¬ 
linearity  Enl  —  0.019IF-1  and  measured  cavity  loss  similar 
to  that  given  in  Fig.  7.  The  measured  quantum  efficiency 
was  a  =  0.988,  the  homodyne  visibility  £  =  0.990,  and  the 
propagation  loss  of  the  squeezed  beam  was  5.7%. 

The  above  data  were  taken  when  the  OPO  was  only 
pumped  from  a  single  direction.  In  order  to  obtain  two 
squeezed  beams  necessary  to  generate  the  EPR  correla¬ 
tions,  the  OPO  was  pumped  in  two  counter-propagating 
directions.  In  this  case  we  expect  lower  squeezing  than 
from  a  single  pumped  OPO  due  to  increased  BLIIRA. 
This  reduction  in  squeezing  was  measured  to  range  from 
less  than  0.3  dB  at  total  pump  powers  below  around 
80  mW,  to  0.5  dB  at  higher  pump  powers.  To  estimate 
the  degree  of  squeezing  in  the  double-pumped  case,  we 
take  the  degree  of  squeezing  obtained  in  Fig.  9  and  cor¬ 
rect  for  Victor’s  homodyne  visibility  and  finite  detector 
thermal  noise,  to  find  that  in  the  single-pumped  case,  we 


have  —4.1  dB  of  available  squeezing.  Thus  we  estimate 
that  we  have  about  —3.6  to  —3.8  dB  squeezing  in  the 
double-pumped  case. 

C.  EPR  correlations 

The  EPR  correlated  beams  were  generated  by  com¬ 
bining  two  independently  squeezed  beams  with  the  rela¬ 
tive  phase  servo- locked  to  be  tt/2.  These  continuous  vari¬ 
able  EPR  correlations  are  of  the  type  originally  discussed 
by  Einstein,  Podolsky,  and  Rosen  [16].  The  two  output 
beams  1  and  2  from  the  EPR  beamsplitter  possess  cor¬ 
relations  as  expressed  by  the  variances  a(xi  ±£2)  =  2 <j± 
and  a  {pi  ±£>2)  =  2<tt,  where  cr+  and  cr_  are  the  variances 
of  the  anti-squeezed  and  squeezed  quadratures  of  the  two 
input  beams,  i.e.  <j+  >  1,  cr_  <  1,  and  cr+cr_  >  1.  With¬ 
out  squeezing  we  obtain  the  vacuum  noise  level  for  two 
beams  (4>q  ’)  where  cf(x\  d=  X2)  =  cr(pi  ±  P2)  =  2.  We 
observe  that  x\  and  X2  are  correlated  while  pi  and  P2  are 
anti-correlated  both  to  a  level  below  the  vacuum  noise 
level.  This  is  the  same  kind  of  quantum  correlations 
first  recorded  for  the  light  from  a  nondegenerate  OPO 
[23,  24].  While  noise  reduction  below  the  vacuum  level 
is  achieved  when  measuring  correlations  between  the  two 
EPR  beams,  the  noise  from  only  one  of  the  EPR  beams 
is  phase  independent  and  above  the  vacuum  level.  In¬ 
deed  we  find  that  V(xi^)  =  V(p\^)  =  (0  +  +  cr_)/2  >  1, 

where  unity  is  the  vacuum  level  for  a  single  beam  (ffi^). 

Experimentally  the  quality  of  the  EPR  state  was  inves¬ 
tigated  both  by  measuring  one  EPR  beam  as  well  as  the 
correlations  between  the  two  beams.  The  noise  in  a  single 
beam  was  measured  while  scanning  the  EPR  phase  Qepr 
between  the  two  squeezed  beams  slowly  compared  to  the 
scan  rate  of  the  local  oscillator  in  the  homodyne  detector. 
An  example  is  presented  in  Fig.  11.  The  rapid  sweep  of 
the  local  oscillator  ensured  that  both  quadratures  were 
measured  for  each  value  of  Qepr  and  gives  rise  to  the  fast 
variations  in  the  trace.  As  explained  above,  the  noise  of 
a  single  EPR  beam  is  expected  to  be  phase  independent 
and  arises  when  overlapping  the  two  squeezed  beams  with 
a  mutual  phase  difference  of  Qepr  =  7r/2.  Hence,  this  al¬ 
lows  identification  of  this  point  in  Fig.  11,  and  we  observe 
excess  noise  in  this  case  about  5  dB  above  the  vacuum 
noise  level.  Furthermore,  when  the  two  squeezed  beams 
are  combined  in  phase  {Qepr  =  0),  two  squeezed  beams 
exit  the  beamsplitter  giving  noise  reduction  in  this  case 
roughly  3  dB  below  vacuum  noise. 

A  direct  measurement  of  the  correlations  between  the 
two  EPR  beams  was  obtained  via  balanced  homodyne  de¬ 
tection  of  both  EPR  beams  and  subtracting  the  result¬ 
ing  photocurrents  from  the  two  sets  of  balanced  detec¬ 
tors.  In  these  measurements  the  EPR  phase  was  locked 
at  Qepr  =  tt/2  and  one  of  the  homodyne  detectors 
was  locked  to  measure  a  fixed  quadrature.  The  trace 
in  Fig.  12  was  recorded  by  scanning  the  local  oscillator 
Olo  of  the  second  homodyne  detector,  thus  recording  the 
variance  a  {x\  —  X2 (Olo))  -  Reduction  below  the  vacuum 


10 


FIG.  11:  Noise  of  one  beam  from  the  EPR  beamsplitter  as 
obtained  by  scanning  the  mutual  phase  difference  between 
the  two  squeezed  beams  in  addition  to  a  rapid  sweep  (about 
5  times  faster)  of  the  LO  in  the  homodyne  detector.  The  flat 
curve  represents  the  vacuum  state  level  for  a  single  beam. 


level  for  two  beams  (<J>g2^)  was  observed  when  the  sec¬ 
ond  homodyne  detector  measured  the  same  quadrature 
as  the  first  homodyne  detector,  i.e.  for  Olo  =0.  We  ob¬ 
serve  correlations  of  the  amplitude  quadratures  of  about 
2  dB  with  respect  to  the  vacuum  level.  However,  these 
data  were  taken  in  a  non-optimized  situation  (e.g.,  inef¬ 
ficient  OPO  cavity  alignment);  the  measured  degree  of 
squeezing  at  that  time  was  under  —3  dB.  In  the  actual 
teleportation  experiment  inter-beam  EPR  correlations  of 
more  than  3  dB  was  obtained. 


FIG.  12:  Noise  obtained  by  subtracting  signals  from  two 
homodyne  detectors  measuring  the  two  EPR  beams  while 
scanning  one  of  the  LO  phases.  The  recorded  quantity  was 
a  (x i  —  x 2(0 lo))  -  The  flat  curve  is  the  vacuum  noise  level  for 
two  beams. 


IV.  QUANTUM  TELEPORTATION 

Given  the  preceding  description  of  how  to  generate  the 
EPR  state,  we  will  now  move  on  to  discuss  the  com¬ 
plete  quantum  teleportation  protocol.  First  the  elec¬ 


tronic  servo  locks  implemented  in  the  experiment  are  de¬ 
scribed,  then  the  technique  to  calibrate  the  channels  used 
by  Alice  to  send  classical  information  to  Bob  is  presented. 
This  establishes  the  basis  for  presenting  the  experimental 
results  on  quantum  teleportation. 


A.  Phase  locking 

In  the  complete  quantum  teleportation  experiment  ten 
servo  systems  in  total  were  implemented  for  locking  opti¬ 
cal  phases  and  cavities.  A  summary  of  the  locking  tech¬ 
niques  is  given  in  Table  I.  Three  of  the  servos  (num¬ 
bers  1  —  3)  are  for  locking  cavities  and  use  the  standard 
Pound-Drever-Hall  technique  where  reflection  or  trans¬ 
mission  of  RF  modulation  sidebands  are  used  to  derive 
an  error  signal.  The  frequency  doubling  cavity  was  locked 
by  observing  the  cavity  reflection  dip  off  the  input  cou¬ 
pler,  and  generating  an  error  signal  using  RF  sidebands 
at  26  MHz.  The  OPO  cavity  was  locked  by  observing  a 
1%  pick-off  from  the  transmitted  light  from  a  weak  coher¬ 
ent  beam  injected  through  a  high  reflector.  The  injected 
signal  carried  5  MHz  sidebands  for  locking.  Finally,  the 
mode-cleaning  cavity  was  locked  in  reflection  using  side¬ 
bands  at  26  MHz. 

The  remaining  seven  servos  were  used  to  keep  the  op¬ 
tical  phases  properly  aligned  for  successful  teleportation 
[25].  Two  weak  injection  beams  were  seeded  into  the 
two  counterpropagating  modes  of  the  OPO  and  used  for 
several  purposes:  to  align  the  OPO  cavity  and  the  homo¬ 
dyne  detectors,  and  to  use  for  locking  of  optical  phases. 
The  two  injected  signals  were  phase  modulated  at  3  MHz 
(injection  1)  and  5  MHz  (injection  2)  respectively,  using 
an  electro-optical  modulator  (EOM)  in  each  signal  path. 
The  relative  phase  differences  between  the  pump  beams 
and  the  injected  beams  were  locked  using  standard  lock- 
in  techniques  where  the  error  signal  is  derived  from  the 
phase  sensitive  amplification  (due  to  the  parametric  gain 
in  the  OPO)  of  the  injected  signal.  The  phases  are  locked 
at  maximum  gain  corresponding  to  the  situation  where 
the  direction  of  the  phasor  of  the  coherent  injected  field  is 
along  the  long  axis  of  the  squeezing  ellipse  of  the  squeezed 
vacuum  beams. 

The  third  phase-lock,  the  EPR  phase  lock,  keeps  the 
two  independent  squeezed  beams  incident  on  the  EPR 
beamsplitter  with  a  phase  difference  of  7r/2  to  ensure  the 
production  of  the  EPR  state.  This  was  done  by  using  1% 
leakages  from  mirrors  in  the  EPR  beam  paths.  The  DC 
interference  signal  between  the  two  injected  beams  from 
the  two  distinct  EPR  paths  was  subtracted  to  produce 
an  error  signal  centered  around  zero.  This  zero  cross¬ 
ing  corresponds  to  7r/2  phase  difference  between  the  two 
squeezed  beams. 

To  lock  the  local  oscillators  in  Alice’s  two  homodyne 
detectors,  the  RF  beat  notes  between  the  LO  and  the  3 
and  5  MHz  sidebands  on  the  injected  beams  were  demod¬ 
ulated  to  produce  the  respective  error  signals  for  Alice  I 
(3MHz)  and  II  (5MHz).  As  the  EPR  phase  lock  described 
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NO. _ BRIEF  DESCRIPTION _ 

1  Doubling  cavity  resonant  to  866  nm 

2  0P0  cavity  resonant  to  866  nm 

3  Mode  cleaning  cavity  resonant  to  866  nm 

4,  5  Relative  phase  between  blue  pump  (433  nm)  and  injected 

beams  1  &  2  (866  nm)  to  zero  (maximum  gain) 

6  Relative  phase  between  two  squeezed  beams  to  7r/2  (to 

create  EPR  state) 

7,  8  Alice’s  LOs  to  x  and  p  quadratures 

9  Bob’s  coherent  beam  to  x  quadrature 

10  Victor’s  LO  to  either  x  or  p  quadratures 


LOCKING  TECHNIQUE _ 

Pound-Drever-Hall  via  reflection,  26  MHz 
Pound-Drever-Hall  via  transmission,  5  MHz 
Pound-Drever-Hall  via  reflection,  26  MHz 
Lock-in  via  1%  pick-off  of  cavity  transmission 

DC  interference  fringe 

RF  interference  fringe,  3  MHz  — >  x,  5  MHz  — >  p 
RF  interference  fringe,  3  MHz 
RF  interference  fringe,  3  MHz  — ►  x,  5  MHz  — ►  p 


TABLE  I:  Summary  of  the  servo  systems  that  were  implemented.  See  main  text  for  further  discussion. 


above  already  keeps  the  two  injected  beams  at  7t/2  phase 
difference,  the  LOs  in  Alice  I  and  II  will  also  stay  at  7t/2 
relative  phase  difference.  Bob’s  LO  phase  was  locked  us¬ 
ing  the  RF  interference  fringe  at  3  MHz  between  itself 
and  the  modulation  sidebands  transported  with  EPR2. 
This  means  that  Alice  I  is  locked  to  the  same  quadrature 
as  Bob,  which,  by  an  arbitrary  convention,  sets  Alice  I 
as  x  and  Alice  II  as  p.  Thus  to  complete  the  classical  in¬ 
formation  channel,  Alice  I’s  measured  output  (photocur¬ 
rent)  was  sent  to  Bob’s  amplitude  modulator,  and  Alice 
II’s  output  was  sent  to  Bob’s  phase  modulator.  The  tele¬ 
ported  state  then  emerges  from  Bob’s  beamsplitter,  and 
is  sent  to  Victor  for  verification.  Victor’s  LO  phase  can 
be  either  scanned,  or  locked  to  either  3  or  5  MHz  to  check 
x  and  p  separately. 


B.  Classical  information  channels 


A  crucial  part  of  the  teleportation  protocol  is  the  trans¬ 
mission  of  classical  information  from  Alice  to  Bob.  In 
the  present  experiment  the  classical  information  is  just 
the  photocurrents  from  Alice’s  two  homodyne  detectors. 
These  signals  have  to  be  faithfully  transmitted  to  Bob 
without  distortion  and  with  proper  phase  and  gain,  and 
for  that  reason  several  RF  amplifiers,  filters  and  de¬ 
lay  boxes  were  used  in  the  classical  channel  paths.  We 
typically  measured  Alice’s  and  Victor’s  noise  levels  at 
1.475  MHz,  thus  the  electronics  of  the  channels  were  opti¬ 
mized  at  that  frequency.  In  the  following  we  will  present 
a  method  to  perform  the  calibration  of  these  classical 
channels. 

To  ensure  that  we  are  operating  at  a  gain  gx  =  9p  =  1 
such  that  Pv  =  An,  we  compared  the  photocurrents  mea¬ 
sured  by  Alice  and  Victor  when  there  were  no  EPR  beams 
present  for  the  case  of  a  coherent  state  of  amplitude  An 
sent  to  Alice  as  the  input  state.  In  practice,  this  was 
easily  achieved  by  blocking  the  optical  beam  paths  of  the 
EPR  state.  In  this  case,  it  can  be  shown  that  when  an 
amplitude  or  phase  modulated  beam  is  sent  to  Alice  as 
the  input  state,  the  ratio  between  the  spectral  densities 


measured  at  Victor  and  Alice  is  given  by 
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where  we  have  assumed  Alice  and  Victor  are  measuring 
the  same  quadrature  (either  x  or  p),  that  |An|2  1 

and  that  the  efficiencies  are  close  to  unity.  We  observe 
that  Victor  records  a  spectral  density  (i.e.,  noise  power 
of  the  RMS  photocurrent)  two  times  (corresponding  to 
3  dB)  higher  than  Alice  x  (or  p).  This  factor  of  two  can 
easily  be  understood  since  the  input  beam  is  split  into 
two  equal  halves  at  Alice’s  50/50  beamsplitter.  Hence, 
this  identifies  a  signature  for  the  optimum  condition  of 
the  classical  gain. 

On  the  other  hand,  for  vacuum  input,  the  ratio  be¬ 
tween  Victor’s  and  Alice’s  spectral  densities,  or  equiva¬ 
lently,  their  variances  since  now  |An|2  =  0,  is  found  to 
be 


CTA 


=  1  + 


(18) 


when  the  classical  gain  is  optimum.  This  means  Victor’s 
output  is  ^  3  times  higher  than  Alice  x  (or  p)  again  in 
the  limit  where  all  detector  efficiencies  are  close  to  unity. 

Fig.  13  shows  the  spectral  density  of  photocur¬ 

rent  fluctuations  recorded  at  Alice  x  from  input  beams 
with  modulation  amplitudes  corresponding  to  24.9  dB 
and  0  dB  (vacuum),  respectively.  The  signal  recorded 
at  Alice  p  mirrors  this  trace,  except  that  the  coherent 
amplitude  is  shifted  in  phase  by  7r/2,  demonstrating  that 
Alice  x  and  p  are  7r/2  apart  in  phase,  as  required.  The 
corresponding  traces  for  the  spectral  density  4>y(^)  for 
Victor’s  detector  are  shown  in  Fig.  14.  Here  we  show 
explicitly  the  tt/2  phase  shift  when  Victor’s  LO  is  phase- 
locked  to  the  x  and  p  quadratures,  respectively.  We  see 
that  Victor  records  3  dB  higher  spectral  density  for  the 
amplitude  modulated  input  and  4.8  dB  greater  for  the 
vacuum  input,  which  indicates  that  the  gain  of  the  clas¬ 
sical  channels  has  been  properly  calibrated  relative  to  the 
criteria  of  Eqs.  (2)  and  (4). 

We  now  turn  our  attention  to  the  phases  of  the  RF  sig¬ 
nals,  keeping  in  mind  the  distinction  between  the  phase 
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FIG.  13:  Spectral  density  relative  to  the  vacuum  level 

recorded  by  Alice  x  while  scanning  the  phase  of  the  input 
beam  both  for  vacuum  (flat  trace),  and  an  input  beam  with 
amplitude  modulation  at  24.9  dB  above  the  vacuum  noise 
level.  The  measured  amplitude  of  this  beam  is  21.9  dB  or 
3  dB  lower  than  the  actual  input,  as  explained  in  the  text. 
The  measurement  frequency  was  1.475  MHz,  RBW  30  kHz, 
and  VBW  1  kHz. 


FIG.  14:  Spectral  density  <$y(Q)  relative  to  the  vacuum  level 
recorded  by  Victor  while  scanning  the  phase  of  the  input 
beam,  is  Victor’s  noise  level  for  one  unit  of  vacuum, 

while  4>q  ;  shows  the  2  extra  units  making  3  total  units  of 
vacuum  noise  measured  in  an  attempt  to  recreate  Alice’s  vac¬ 
uum  state  input  without  any  entanglement.  <f>^  and  <f>^ 
show  the  recreation  of  Alice’s  input  coherent  modulation  am¬ 
plitude  of  24.9  dB,  demonstrating  that  the  peak  input  and 
output  amplitudes  are  equal,  and  the  x  and  p  quadratures 
are  indeed  7r/2  apart  in  phase.  Victor’s  LO  was  phase- locked 
to  the  x  and  p  quadratures  respectively  for  these  measure¬ 
ments. 


of  the  optical  carrier,  and  the  phase  of  the  RF  signal  at 
our  measurement  frequency,  1.475  MHz.  We  have  already 
discussed  optical  phases  in  detail  in  Sec.  IV  A  above,  and 
indeed  implicitly  assumed  correct  optical  phases  in  the 
discussion  on  finding  optimal  gain.  In  discussing  RF 
phase,  we  will  continue  to  assume  that  our  optical  phase- 
lock  servos  are  working  correctly. 

Our  goal  is  quantum  noise  subtraction  at  our  analy¬ 
sis  frequency,  between  the  correlated  beams  EPR1  (mea¬ 
sured  at  Alice  and  sent  to  Bob  via  the  classical  channels) 
and  EPR2  at  Bob’s  beamsplitter.  Obviously,  the  best 
way  to  achieve  this  goal  is  for  the  relative  RF  phases  of 


EPR1  and  EPR2  to  be  zero  for  perfect  subtraction.  How¬ 
ever,  this  is  quite  impractical  in  the  current  laboratory 
setup  since  EPR2  arrives  at  Bob’s  beamsplitter  directly 
from  the  EPR  beamsplitter,  while  EPR1  takes  a  very  in¬ 
direct  route  involving  electrical  photocurrents  that  travel 
much  slower  than  light.  It  is  sufficient,  therefore,  to  en¬ 
sure  that  the  relative  phase  difference  is  a  factor  of  2i r 
by  implementing  delays  in  the  classical  channels  between 
Alice  and  Bob.  In  practice,  we  can  also  keep  the  phase 
difference  any  multiple  of  i r  and  compensate  for  this  by 
flipping  the  sign  of  Bob’s  optical  phase-lock  error  signal. 
In  this  way,  Bob  adds  EPR1  and  EPR2  instead  of  sub¬ 
tracting  them,  thereby  optically  creating  a  i r  RF  phase 
shift  that  compensates  for  the  i r  phase  delay  in  the  clas¬ 
sical  channels. 

Finally,  Bob’s  two  modulators  must  provide  pure  phase 
and  amplitude  modulation,  respectively.  This  condition 
is  satisfied  by  carefully  controlling  the  input  beam  polar¬ 
izations  for  the  two  temperature  compensated  EOMs. 

Some  care  must  be  given  to  the  maximum  time  delay 
allowed  in  our  classical  channels.  This  is  set  by  the  OPO 
linewidth  (HWHM)  of  5.4  MHz,  corresponding  to  a  cor¬ 
relation  time  between  EPR1  and  EPR2  of  about  30  ns  if 
the  full  bandwidth  of  the  OPO  were  employed  for  tele¬ 
portation.  For  a  more  detailed  discussion  see  Ref.  [18]. 

However,  in  our  experiment,  a  much  smaller  effective 
bandwidth  is  employed  corresponding  to  the  detection 
bandwidth  for  Alice,  the  bandwidth  of  the  classical  chan¬ 
nel  from  Alice  to  Bob,  and  the  frequency  range  of  Bob’s 
modulators.  Finally,  there  is  the  bandwidth  employed  by 
Victor  in  his  verification  of  the  protocol.  For  simplicity, 
here  we  assume  that  the  effective  detection  bandwidth 
of  our  protocol  is  equal  to  the  RF  bandwidth,  z/,  of  the 
spectrum  analyzer  employed  by  Victor  in  his  analysis, 
typically  around  v  =  30  kHz  (see  figures  in  Sec.  IV  C). 
The  relevant  issue  is  the  ratio  between  our  analysis  fre¬ 
quency,  H/(27r)  =  1.475  MHz,  and  v.  We  see  that  this 
ratio  is  about  50  cycles.  Thus  2tt  of  RF  phase  delay  in 
our  classical  channels  contributes  to  a  roughly  2%  effect 
on  the  noise  subtraction  quality,  which  is  small  but  not 
negligible. 

When  the  gain  and  RF  phase  of  the  classical  channels 
as  well  as  the  optical  phases  at  Alice’s  and  Bob’s  detec¬ 
tors  were  suitably  optimized,  Victor  recorded  a  stable 
output  while  the  phase  of  the  unknown  input  state  was 
being  scanned,  independent  of  the  input  state  amplitude 
and  phase  over  a  wide  range,  as  discussed  below.  In  the 
case  of  vacuum  input  we  obtain  the  flat  trace  at  4.8  dB 

/o\ 

(trace  3>g  )  in  Fig.  14.  The  trace  remained  stable  for  tens 
of  minutes,  with  fluctuations  on  the  order  of  ±0.1  dB. 

It  was  not  a  trivial  task  to  realize  the  balance  of  the  two 
classical  channels  due  to  above-mentioned  reasons.  One 
practical  way  that  we  employed  to  optimize  the  system 
and  to  judge  the  effectiveness  of  the  two  classical  chan¬ 
nels  was  to  send  RF  modulated  optical  fields  at  1.475MHz 
through  the  two  injection  ports  of  the  OPO  cavity.  These 
modulated  optical  signals  were  allowed  to  propagate  to 
Alice  and  Bob,  just  as  the  EPR  beams  would  in  the  pres- 
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ence  of  blue  pump.  We  could  then  easily  optimize  the 
subtraction  of  the  two  classical  fields  at  Bob’s  99/1  beam¬ 
splitter.  In  terms  of  the  conditions  stated  above,  this 
classical  noise  subtraction  directly  mimics  the  quantum 
noise  subtraction  that  we  perform  using  the  entangled 
EPR  state  during  quantum  teleportation,  which  means 
that  if  we  obtain  good  classical  subtraction,  we  should  in 
fact  be  operating  at  the  optimal  conditions  for  quantum 
teleportation.  Typically  the  subtraction  was  about  25  dB 
for  each  channel. 


Gain  of  classical  channels  g2  (dB) 


FIG.  15:  Measured  spectral  density  <lv(fl)  from  Victor’s 
balanced  homodyne  detector  as  a  function  of  gain  for  dif¬ 
ferent  amplitude-modulated  coherent  state  and  comparison 
with  theory.  The  parameters  for  the  theoretical  curves  were 
the  measured  visibilities:  £i  =  0.985,  £2  =  £3  =  0.994, 
£4  =  0.985,  £5  =  0.985  and  the  detector  quantum  efficiency 
was  a  =  0.988.  The  measurement  frequency  was  1.475  MHz. 
Pin  corresponds  to  the  actual  power  of  the  input  state  pre¬ 
sented  to  Alice,  in  dB  above  the  vacuum  state. 

In  order  to  check  the  linearity  and  dynamical  range 
for  the  classical  channels,  we  measured  Victor’s  output 
noise  levels  as  a  function  of  gain  for  input  modulation 
sidebands  of  various  amplitudes  when  the  two  classical 
channels  were  balanced.  Fig.  15  shows  the  results  for 
input  beams  with  the  following  modulation  amplitudes: 
OdB  (vacuum),  7.0dB,  14.8dB  and  24.8dB,  where  in  each 
case  the  full  27t  of  phase  variation  was  explored.  It  shows 
that  the  linearity  is  very  good  from  0  dB  to  25  dB  input 
modulation  amplitudes,  which  means  we  can  teleport  any 
coherent  state  amplitude  within  that  range.  The  theo¬ 
retical  traces  on  the  figure  are  based  on  our  measured 
efficiencies  without  any  adjustable  parameters. 


C.  Teleportation  results 

Fig.  16  shows  quantum  teleportation  results  for  a  co¬ 
herent  input  state.  All  traces  are  Victor’s  measured  vari¬ 
ances  at  1.475  MHz  and  at  pump  power  of  33/35  mW  in 
the  OPO  paths  1  and  2,  respectively.  The  amplitude  of 


the  input  state  was  about  25.5dB  higher  than  the  vacuum 
level.  Trace  (a)  is  one  unit  of  the  vacuum  noise,  that  is, 
the  vacuum-state  level  or  shot-noise  level  (SNL)  of  Vic¬ 
tor,  which  is  obtained  by  blocking  Bob’s  beam.  Trace  (b) 
marks  the  3  units  of  vacuum  noise  in  the  case  of  absent 
EPR  beams  but  with  Alice  and  Bob  engaged  nonethe¬ 
less  in  the  teleportation  protocol,  which  is  4.8  dB  above 
the  SNL  with  our  efficiencies  close  to  1  and  is  obtained 
by  blocking  the  blue  pumps  in  the  experiment.  Trace  (c) 
shows  the  phase  sensitive  noise  when  the  EPR  beams  and 
the  AM  sidebands  on  the  input  state  are  present,  while 
Victor’s  LO  is  phase-locked  to  the  x  quadrature.  Locking 
Victor’s  LO  to  the  p  quadrature  produces  an  analogous 
trace  with  the  peaks  offset  by  tt/2  in  phase.  Closer  in¬ 
spection  of  traces  such  as  (c)  in  Fig.  16  shows  that  the 
minimum  noise  level  is  approximately  1.1  dB  below  the 
level  of  three  units  of  the  vacuum,  although  it  is  rather 
difficult  to  get  an  accurate  reading  because  of  the  mis¬ 
match  of  scan  rate  and  detection  bandwidths.  This  noise 
level  corresponds  to  2.3  vacuum  units.  The  peak  of  the 
trace  should  have  the  same  amount  of  noise  reduction, 
that  is,  from  354.8  to  354.1  vacuum  units  (from  25.50  dB 
to  25.49  dB),  but  this  reduction  is  too  small  to  observe 
in  the  graph.  Trace  (d)  corresponds  to  the  vacuum  input 
state,  which  is  obtained  by  blocking  the  modulated  input 
beam.  Acquisition  parameters  are:  resolution  bandwidth 
30  kHz,  video  bandwidth  1  kHz  and  sweep  time  200  ms. 


FIG.  16:  Spectral  density  4v(Q)  recorded  by  Victor  with  the 
phase  of  the  input  beam  scanning.  Trace  (a)  is  Victor’s  shot- 
noise  level,  trace  (b)  marks  the  3  units  of  vacuum  noise  mea¬ 
sured  without  entanglement,  trace  (c)  is  one  quadrature  of 
the  teleported  output  coherent  state,  and  trace  (d)  is  the  tele¬ 
ported  output  vacuum  state  (see  text  for  details).  For  trace 
(c)  Victor’s  LO  was  phase-locked  to  the  x  quadrature,  while 
for  traces  (a),  (b)  and  (d)  Victor’s  LO  was  freely  scanned,  and 
a  ten  trace  average  was  used.  The  average  OPO  pump  power 
was  34  mW  per  beam,  measurement  frequency  1.475  MHz, 
RBW  30  kHz,  and  VBW  1  kHz. 

The  best  noise  reduction  that  we  have  obtained  to 
date  is  shown  in  detail  in  Fig.  17.  With  the  EPR 
beams  present,  the  variances  recorded  by  Victor  are 
<jy  =  Gy  =  3.54  =b  0.19  dB,  while  with  the  EPR  beams 
absent,  cry  =  cry  =  4.86  db  0.12  dB.  The  entanglement 
of  the  EPR  beams  thus  leads  to  a  quantum  noise  reduc¬ 
tion  of  1.32  db  0.16  dB.  This  result  was  obtained  with 
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40  mW  pump  power  in  each  OPO  path.  The  measure¬ 
ment  parameters  are  the  same  as  that  in  Fig.  16  ex¬ 
cept  that  the  sweep  time  was  640  ms  and  we  use  a  ten 
trace  average  for  all  traces.  For  this  particular  trace, 
the  measured  detection  efficiencies  were  characterized  by 

Ci  =  0.986,6  =  6  =  0.990,6  =  0.980,6  =  0.975,  and 
&Ax  =  olAv  =  olv  =  0.988. 
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FIG.  17:  Noise  recorded  by  Victor  showing  in  detail  the  re¬ 
duction  in  the  noise  level  with  EPR  beams  (entanglement) 
present.  With  EPR  beams,  the  measured  variance  at  Victor 
was  cry  =  <Ty  —  3.54  d=  0.19  dB.  The  average  OPO  pump 
power  was  40  mW  per  beam,  and  acquisition  parameters  are 
the  same  as  in  Fig.  16.  All  traces  use  a  ten  trace  average. 
SNL  stands  for  shot-noise  level. 

A  study  of  the  dependence  of  the  output  variance  on 
OPO  pump  power  yields  further  insight  into  the  experi¬ 
ment.  Fig.  18  shows  the  variances  of  Alice  and  Victor  as 
functions  of  pump  power.  From  the  squeezing  results  in 
Fig.  10,  we  can  see  that  as  the  pump  increases,  both  the 
squeezing  and  anti-squeezing  increase,  even  though  the 
squeezing  increases  very  slowly.  Therefore,  the  entan¬ 
glement  becomes  stronger  with  increasing  pump  power. 
This  phenomenon  is  reflected  in  the  data  shown  in  Fig. 
18  at  pump  powers  below  30  mW,  where  Alice’s  variance 
increases  with  pump  power,  whereas  Victor’s  variance 
drops  below  4.8  dB,  as  predicted.  The  best  noise  reduc¬ 
tion  for  Victor  was  around  30  mW  of  blue  pump  for  this 
particular  set  of  data.  Higher  pump  power  did  not  help 
to  reduce  the  noise;  it  instead  increased  both  Alice’s  and 
Victor’s  variances  even  though  we  expect  Victor’s  vari¬ 
ance  to  continue  to  decrease,  or  at  least  remain  stable. 
The  likely  culprits  responsible  for  this  degradation  in  per¬ 
formance  will  be  discussed  in  the  next  sections.  Chief 
among  them  is  the  performance  of  the  various  locking 
servos.  As  discussed  in  Sec.  II B,  fluctuations  around  the 
nominal  ideal  settings  (e.g.,  7r/2  for  the  squeezed  beams 
that  form  the  EPR  beams)  allows  excess  noise  to  con¬ 
taminate  the  “quiet”  quadratures,  an  effect  that  becomes 
more  important  as  the  degree  of  squeezing  is  increased. 


FIG.  18:  Noise  measured  by  Alice  and  Victor  as  functions  of 
the  pump  of  the  OPO.  The  data  points  have  been  connected 
to  ease  viewing. 


D.  Teleportation  fidelity 

From  the  measured  variances  reported  in  Sec.  IV  C 
above,  we  can  gauge  the  quality  of  the  protocol  by  in¬ 
ferring  the  fidelity  F  via  Eqs.  (2)  and  (4).  We  first  as¬ 
sume  that  the  noise  in  the  teleported  output  state  pout 
obeys  Gaussian  statistics  [7,  9].  From  the  data  in  Fig. 
17,  where  cry  =  <jy  =  3.54  db  0.19  dB,  and  where  the 
gain  g  has  already  been  set  to  unity  by  the  techniques 
described  in  Sec.  IV  B,  direct  application  of  Eq.  (2)  gives 
F  =  0.61  ±  0.02. 

Note  that  no  correction  whatsoever  has  been  applied  to 
this  result;  it  corresponds  to  the  fidelity  obtained  directly 
from  Victor’s  photocurrent.  We  can  certainly  attempt 
to  infer  the  fidelity  associated  with  the  field  emerging 
from  Bob’s  beamsplitter,  rather  than  the  photocurrent 
detected  by  Victor.  To  do  so,  we  return  to  the  issue 
pointed  out  earlier  that  we  calibrate  gXjP  by  ensuring 
Pv  =  Pin  as  opposed  to  pout  =  Pin-  From  the  discus¬ 
sion  in  Sec.  II  A,  we  see  that  if  Victor  had  unit  detection 
efficiency  (6  =  r]v  =  1),  this  issue  would  not  arise  at 
all.  In  our  experiment  where  (6  <  l,rjv  <  1),  the  ac¬ 
tual  unnormalized  gains  g^  were  set  to  be  too  large  by 

a  factor  (6w)_1  than  that  neccessary  for  optimal  re¬ 
construction  of  Bob’s  field,  instead  of  Victor’s  photocur¬ 
rent.  We  can  use  Eq.  (4)  to  compare  Victor’s  variances 
when  (6  =  r]v  =  1)  and  when  (6  <  l,r]v  <  1)  with 
the  same  degree  of  squeezing  as  in  Fig.  17.  We  thus  in¬ 
fer  that  if  Victor  had  perfect  detectors,  the  variance  of 
Bob’s  teleported  output  field  (or  equivalently  now,  Vic¬ 
tor’s  variance  as  measured  by  his  photocurrent)  is  given 
by  cr^^  =  cry  ^  =  3.47  dB  above  the  shot-noise  level, 
which  corresponds  to  an  inferred  fidelity  of  Fb  =  0.62. 

Returning  to  fidelity  referenced  to  Victor’s  photocur¬ 
rent,  we  estimate  that  with  42  mW  pump  power  in  each 
OPO  path,  where  we  have  measured  —3.73  dB  squeez¬ 
ing  and  6.9  dB  anti-squeezing  at  Victor  (see  Fig.  9),  the 
EPR  entanglement  at  the  EPR  beamsplitter  is  charac¬ 
terized  by  the  factors:  a~  =  —3.97  dB  and  cr+  =  7.0  dB. 
These  numbers  were  obtained  by  back-propagating  the 
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squeezed  beams  to  the  EPR  beamsplitter  from  Victor’s 
homodyne  detectors,  considering  the  effects  of  Victor’s 
homodyne  efficiency  £5  =  0.972,  photodiode  quantum  ef¬ 
ficiency  ay  =  0.988  and  the  EPR  homodyne  efficiency 
£ epr  =  0.985.  With  measured  efficiencies  £1  =  0.985, 
£2  =  £3  =  0.990,  £4  =  0.980,  £5  =  0.975,  the  predicted 
variance  of  the  teleported  output  state  emerging  from 
Bob’s  beamsplitter  is  =  2.82  dB.  The  in¬ 

ferred  fidelity  would  then  be  Fp  =  0.69. 

By  contrast,  our  best  entanglement-assisted  noise  re¬ 
duction  measured  by  Victor’s  balanced  detector  is  1.32dB 
below  the  level  with  no  EPR  beams,  which  corresponds  to 
Gy  ^  =  Gy  ^  =  3.54  dB  and  an  inferred  fidelity  F  =  0.61, 
as  has  been  discussed. 

To  gain  more  insight  into  this  discrepancy,  we  per¬ 
formed  an  analogous  study  to  that  shown  in  Fig.  18, 
where  we  plot  the  inferred  fidelities  from  the  measured 
variances  versus  pump  power  in  Fig.  19.  The  inferred  ex¬ 
perimental  fidelities  peak  at  around  30mW  corresponding 
to  the  minimum  in  Victor’s  measured  variance.  Here  we 
also  show  the  values  of  F  that  we  might  be  able  to  reach 
with  the  current  apparatus.  First  of  all,  the  square  sym¬ 
bols  in  Fig.  19  derive  from  the  inferred  degrees  of  squeez¬ 
ing  at  the  EPR  beamsplitter  that  are  deduced  from  the 
data  in  Fig.  10.  This  was  done  from  the  measurement 
results  by  back-propagating  the  squeezed  beams  from 
Victor’s  homodyne  detector  to  the  EPR  beamsplitter  as 
previously  described.  In  addition,  the  triangles  in  Fig.  19 
correspond  to  the  inferred  fidelities  from  the  theoretically 
predicted  degrees  of  squeezing  given  by  the  solid  line  in 
Fig.  10.  It  can  be  seen  that  the  disagreement  between  the 
predicted  and  measured  fidelities  is  already  apparent  at 
low  pump  powers.  However,  the  mismatch  becomes  more 
pronounced  at  higher  pump  powers  where  the  measured 
fidelities  start  to  decrease  rather  than  increase  with  the 
pump  level. 


Pump  power  [mW] 

FIG.  19:  Fidelity  as  a  function  of  the  OPO  pump.  Dots 
are  the  experimentally  measured  fidelities,  squares  are  the 
expected  fidelities  given  the  measured  degrees  of  squeezing 
shown  in  Fig.  10,  and  triangles  are  the  expected  fidelities 
based  on  our  theoretically  predicted  degrees  of  squeezing  in 
Fig.  10.  Again,  the  lines  connecting  the  data  points  are  to 
ease  viewing. 

There  are  several  factors  contributing  to  the  discrep¬ 


ancy  between  the  measured  and  predicted  variances  and 
fidelities.  The  first  is  phase  fluctuations  from  different 
locking  systems,  especially  the  EPR  phase-lock.  As  dis¬ 
cussed  in  Sec.  II B,  phase  fluctuations  add  extra  noise  to 
the  quadratures,  and  this  effect  is  more  pronounced  at 
higher  pump  powers  where  the  anti-squeezed  quadrature 
is  large.  Another  factor  is  the  bandwidth  of  our  two  clas¬ 
sical  channels,  which  by  bad  design  on  our  part,  has  been 
found  to  have  excessive  phase  variation  over  the  relevant 
RF  bandwidth  for  the  undistorted  transmission  of  classi¬ 
cal  information.  In  fact,  by  an  aforementioned  technique, 
we  measured  the  actual  subtraction  of  the  RF  signal  at 
1.475  MHz  through  the  classical  channels  and  EPR2.  It 
was  found  that  frequency  offsets  of  about  5  kHz  led  to 
drops  in  the  cancellation  from  —  25  dB  to  —  20  dB.  When 
the  offset  was  20  kHz,  the  cancellation  is  only  —9  dB. 
Increasing  the  bandwidth  while  keeping  relatively  high 
isolation  for  the  filter  for  various  control  signals  (e.g., 
modulations  at  3  and  5  MHz)  in  the  classical  channels 
will  be  helpful.  A  third  reason  for  the  discrepancy  in  Vic¬ 
tor’s  variances  is  the  imperfect  character  of  Bob’s  EOMs, 
which  cause  coupling  between  the  x  and  p  quadratures, 
again  resulting  in  contamination  between  squeezing  and 
anti-squeezing.  In  order  to  reach  higher  fidelity,  we  are 
working  to  improve  these  aspects  of  the  experiment. 


V.  CONCLUSION 

We  have  described  the  details  of  recent  experimental 
work  to  perform  quantum  teleportation  for  continuous 
variables.  We  have  discussed  a  real  experimental  system 
where  we  considered  many  of  the  prevalent  loss  sources 
in  our  experiment,  thus  providing  a  detailed  analysis  of 
how  the  variances  measured  at  Alice  and  Victor  during 
teleportation  vary  with  squeezing  and  ultimately  OPO 
pump  power.  Phase  fluctuations  due  to  imperfect  lock¬ 
ing  systems  were  also  discussed,  and  it  has  been  shown 
that  nonideal  detection  schemes  as  well  as  phase  fluctua¬ 
tions  eventually  degrade  the  noise  reduction  recorded  by 
Victor  and  consequently  reduce  the  teleportation  fidelity. 

We  have  discussed  how  to  prepare  experimentally  the 
entangled  EPR  beams.  Our  entangled  EPR  fields  at 
the  EPR  beamsplitter  were  typically  characterized  by 
g~  ~  —4  dB  and  cr+  ~  7  dB  according  to  our  measured 
squeezing  and  efficiencies,  which  implies  that  our  mea¬ 
sured  prospective  fidelity  would  be  0.69. 

The  experimental  setup  and  procedure  was  described 
in  detail,  including  the  OPO,  the  source  of  the  entangled 
EPR  fields.  We  have  discussed  optical  phase-lock  servo 
systems  that  ensure  Alice  is  able  to  correctly  measure  the 
two  orthogonal  quadratures,  x  and  p.  Lastly,  we  have 
also  described  our  classical  channels  through  which  clas¬ 
sical  information  in  the  form  of  photocurrents  obtained 
by  Alice  can  be  sent  to  Bob  with  the  goals  of  minimal 
distortion  and  proper  phase  and  gain  in  order  to  be  sure 
that  Bob  can  use  that  information  to  recover  the  origi¬ 
nal  input  state.  The  teleportation  procedure  was  inves- 
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tigated  for  arbitrary  unknown  coherent  states  with  any 
phase  and  a  wide  range  of  amplitudes. 

The  experiment  clearly  revealed  the  quduties  that  limit 
the  classical  performance  of  such  a  teleportation  system. 
Victor  unavoidably  measures  two  extra  units  of  vacuum 
noise  if  there  is  no  entanglement.  By  employing  the  en¬ 
tangled  fields  of  the  EPR  state,  we  demonstrated  that 
the  quduties  were  suppressed  by  —1.32  zb  0.16  dB,  which 
corresponds  to  an  inferred  fidelity  of  F  =  0.61  =b  0.02  for 
coherent  states,  which  when  corrected  for  the  efficiency 
of  detection  by  Victor,  gives  a  fidelity  of  0.62.  The  appa¬ 
ratus  was  shown  to  succeed  for  arbitrary  coherent  states 
with  amplitudes  up  to  25dB  above  the  vacuum  level.  This 
demonstrates  that  with  entanglement,  the  procedure  ex¬ 
hibits  better  performance  than  the  classical  bound  of  fi¬ 
delity  F  =  0.50  [9,  10],  and  hence  is  genuinely  a  quantum 
protocol  for  unconditional  teleportation. 

We  discussed  Alice’s  and  Victor’s  measured  variances 
as  functions  of  OPO  pump  power.  The  data  in  Fig.  18 
showed  that  at  low  pump  powers,  Alice’s  measured  vari¬ 
ance  ga  increased  and  Victor’s  measured  variance  Gy 
decreased  with  pump  power,  as  expected.  However,  at 
higher  pump  powers  (above  «  30  mW),  while  ga  con¬ 
tinued  to  increase  with  pump  as  expected,  Gy  did  not 
continue  to  decrease,  but  began  increasing  instead.  This 
implied  that  the  anti-squeezing  quadrature  contaminates 
the  squeezed  quadrature  at  high  pump  powers.  Possi¬ 
ble  reasons  for  this  contamination  include  fluctuations 


in  the  phase-lock  servos,  limited  classical  channel  band¬ 
width  and  impure  amplitude  and  phase  modulators  at 
Bob’s  station. 

It  is  encouraging  to  note  that  our  high  detection  effi¬ 
ciencies  together  with  the  relatively  high  degree  of  entan¬ 
glement  that  we  have  achieved  shows  that  our  apparatus 
is  capable  of  producing  higher  fidelity  between  the  in¬ 
put  and  output  states.  In  addition,  a  new  scheme  with 
the  OPO  pumped  only  by  a  single  unidirectional  blue 
beam  to  form  the  EPR  state  is  being  planned.  We  could 
then  hope  to  obtain  over  —5  dB  of  entanglement  by  mit¬ 
igating  high  BLIIRA,  and  thus  reducing  the  intracavity 
losses  in  the  OPO.  Such  capabilities  would  be  of  inter¬ 
est  to  quantum  information  processing  with  continuous 
quantum  variables  [26]. 
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We  propose  a  method  to  implement  cavity  QED  and  quantum-information  processing  in  high-g  cavities 
with  a  single  trapped  but  nonlocalized  atom.  The  system  is  beyond  the  Lamb-Dicke  limit  due  to  the  atomic 
thermal  motion.  Our  method  is  based  on  adiabatic  passages,  which  make  the  relevant  dynamics  insensitive  to 
the  randomness  of  the  atom  position  with  an  appropriate  interaction  configuration.  The  validity  of  this  method 
is  demonstrated  from  both  approximate  analytical  calculations  and  exact  numerical  simulations.  We  also 
discuss  various  applications  of  this  method  based  on  the  current  experimental  technology. 
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I.  INTRODUCTION 

Trapping  of  single  atoms  in  high- 2  cavities  opens  up  ex¬ 
citing  possibilities  for  the  observation  and  manipulation  of 
the  dynamics  of  single  particles  and  for  control  of  their  in¬ 
teractions  with  single-mode  photons  [1,2, 4, 5].  Such  possi¬ 
bilities  could  have  wide  applications,  such  as  for  the  genera¬ 
tion  of  nonclassical  or  entangled  optical  pulses  [6,7],  for 
observing  strong  cavity-QED  effects  [4,5,8]  and,  more  re¬ 
markably,  for  implementation  of  quantum  communication 
and  computation  [9-13].  The  trapping  potential  for  confining 
single  atoms  can  be  created  by  diverse  avenues,  including  by 
the  cavity-QED  light  itself  [4,5],  by  additional  far-off- 
resonant  trapping  (FORT)  beams  [2],  and  by  combining 
single  trapped  ions  with  high-finesse  optical  cavities  [14,15]. 
In  this  paper,  we  will  direct  our  attention  principally  to  trap¬ 
ping  in  cavity  QED  by  way  of  an  additional  FORT  beam, 
although  our  results  are  applicable  to  broader  settings. 

The  first  experiment  to  achieve  strong  coupling  in  cavity 
QED  with  trapped  atoms  was  that  of  Ref.  [2],  which  em¬ 
ployed  an  intracavity  FORT  beam  and  reported  trapping  life¬ 
times  of  28  ms.  By  now,  this  experiment  has  attained  much 
longer  trapping  times,  with  recent  work  demonstrating  life¬ 
times  in  excess  of  1  s  [3,16].  By  contrast,  atomic  localization 
by  way  of  the  cavity-QED  field  itself  has  led  to  trapping 
within  a  single  axial  well  with  mean  trapping  time  r 
^340  /zs  [4]  and  to  localization  across  many  axial  wells 
with  mean  time  280  /zs  [5]. 

The  long  trapping  times  achieved  with  an  intracavity 
FORT  beam  set  the  stage  for  diverse  applications  in 
quantum-information  science,  which  motivates  the  current 
analysis.  However,  one  of  the  main  obstacles  to  the  experi¬ 
mental  demonstration  of  these  applications  is  that  the  posi¬ 
tion  of  the  trapped  atom  is  not  well  fixed  within  the  cavity. 
The  coupling  rate  g  between  the  atomic  internal  levels  and 
the  cavity  mode  depends  on  the  atom’s  position  r  through  the 
relation 


g(r)  =  g0x(r)  (1) 
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with  the  mode  function 

*(r)  =  sin(£0z)exp[  -  (x2+ j2)/^^],  (2) 

where  g0  is  the  peak  coupling  rate,  w0  and  k0  =  2i r/\0  are, 
respectively,  the  width  and  the  wave  vector  of  the  Gaussian 
cavity  mode,  and  z  is  assumed  to  be  along  the  axis  of  the 
cavity.  Due  to  the  randomness  of  the  atom’s  position  r,  we 
have  an  unknown  randomly  changing  coupling  rate  g(  r). 
Most  of  the  applications  of  this  setup  assumed  a  fixed  known 
coupling  rate  g.  Therefore,  before  the  experimental  demon¬ 
stration  of  these  schemes,  first  one  needs  to  solve  the  prob¬ 
lem  associated  with  the  random  coupling. 

Intense  experimental  efforts  have  been  taken  to  localize 
the  atom  inside  the  cavity  so  as  to  fix  the  coupling  rate  g(  r), 
with  notable  recent  success  attained  via  ion  traps  [14,15].  In 
the  cavity-QED  experiments  employing  cold  atoms  and 
without  FORT  beams  [1,17,18],  atoms  were  dropped  through 
the  cavity  and  followed  random  trajectories  with  large  axial 
heating.  As  a  result,  the  magnitude  and  the  sign  of  g(r)  were 
not  well  controlled.  With  a  FORT  beam  and  with  current 
experimental  capabilities  [2,3,16],  an  atom  can  be  trapped 
inside  one  potential  well  along  the  cavity  axis  with  a  fixed 
sign  of  g(  r).  But  the  atom  still  has  appreciable  kinetic  en¬ 
ergy  and  is  not  fully  localized,  leading  to  significant  varia¬ 
tions  in  the  magnitude  of  the  coupling  rate  g(  r). 

The  randomness  of  the  coupling  rate  g(  r)  comes  from 
several  contributions:  first,  the  trapped  atom  is  still  quite  hot 
in  the  current  experimental  setup.  Its  kinetic  energy  from  the 
thermal  motion  is  typically  lower  but  not  much  lower  than 
the  depth  of  the  trapping  potential.  The  atom’s  oscillation 
amplitude  d  in  the  trap  is  comparable  to  the  optical  wave¬ 
length  \0,  so  it  does  not  satisfy  the  usually  assumed  Lamb- 
Dicke  condition  Due  to  the  thermal  motion  of  the 

atom,  the  coupling  rate  g(  r)  typically  has  a  variation  within 
a  factor  of  2  with  the  current  experimental  technique.  Cer¬ 
tainly,  the  atom  will  become  better  localized  as  cooling  tech¬ 
niques  are  adapted  to  cavity  QED  and  its  energy  is  reduced 
[19,20].  However,  due  to  the  presence  of  the  cavity  and  the 
trapping  potential,  it  is  still  experimentally  hard  to  achieve 
efficient  cooling  inside  the  cavity  [19-21].  Furthermore, 
even  if  we  assume  that  the  atom  has  been  precooled  and 
localized  initially  to  the  Lamb-Dicke  limit,  the  implemented 
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application  protocols  will  still  tend  to  heat  the  atom  due  to 
photon  recoils  from  the  spontaneous  emissions  [22,23].  As  a 
result  of  the  heating,  the  atom  may  go  out  of  the  Lamb-Dicke 
limit  after  a  short  time.  Finally,  even  if  we  neglect  all  the 
motional  and  the  heating  effects  of  the  trapped  atom,  there  is 
still  some  uncertainty  of  the  coupling  rate.  The  intracavity 
field  of  the  FORT  beam  forms  many  potential  wells  inside 
the  cavity,  and  in  current  experiments,  one  cannot  control 
and  does  not  know  precisely  in  which  well  the  atom  is 
trapped.  The  FORT  beam  has  a  wavelength  X  F  different  from 
the  cavity-QED  wavelength  \0,  so,  even  if  the  atom  is  kept 
very  cold  and  well  localized  at  the  bottom  of  the  trapping 
potential  well,  we  still  might  not  know  exactly  the  coupling 
rate,  since  the  bottoms  of  different  potential  wells  have  dif¬ 
ferent  coupling  rates  [24]. 

Here,  to  overcome  these  difficulties,  we  propose  a  method 
to  do  cavity-QED  and  quantum-information  processing  di¬ 
rectly  with  hot  atoms  with  an  inhomogeneous  distribution  in 
position  and/or  a  time- varying  location.  The  method  is  based 
on  adiabatic  passages  with  a  new  interaction  configuration. 
Adiabatic  passages  have  been  studied  in  the  context  of  cavity 
QED  [6,13,26,27],  and  have  been  adopted  in  some  recent 
experiments  [17,18].  Normally,  schemes  based  on  adiabatic 
passages  are  more  insensitive  to  certain  parameter  changes 
compared  with  the  corresponding  Raman  schemes.  Some  ini¬ 
tial  indication  of  insensitivity  of  the  adiabatic  passage 
scheme  to  certain  parameter  changes  was  already  illustrated 
in  Ref.  [27]  for  a  certain  cavity-QED  scheme.  However,  to 
make  the  whole  system  dynamics  insensitive  to  variations  of 
the  coupling  rate  g(r),  the  direct  use  of  the  usual  adiabatic 
passage  schemes  is  not  enough  to  achieve  this  goal,  and  we 
also  need  to  design  a  different  and  appropriate  interaction 
configuration.  The  relevant  dynamics  of  adiabatic  passages 
are  determined  by  the  relative  ratio  between  different  cou¬ 
pling  rates,  and  are  almost  independent  of  their  absolute  val¬ 
ues.  Thanks  to  this  property,  with  an  appropriate  design  of 
the  interaction  configuration,  we  can  make  different  coupling 
rates  have  the  same  dependence  on  the  atom’s  position  r, 
and,  therefore,  the  system  dynamics,  determined  by  their 
relative  ratios,  will  become  independent  of  r.  As  a  result, 
though  the  atom’s  position  may  be  unknown  and  time  depen¬ 
dent,  the  output  signal  from  the  cavity  is  still  controllable 
and  has  definitely  known  properties.  This  is  the  difference 
between  the  scheme  here  and  the  usual  adiabatic  passage 
schemes  [13,17,18,27].  Note  that  the  method  described  here 
is  also  different  from  some  previous  quantum  computation 
schemes  with  hot  trapped  ions  [28,29],  where  the  Lamb- 
Dicke  condition  is  still  required. 

The  paper  is  arranged  as  follows:  In  Sec.  II,  we  explain 
the  basic  idea  of  the  method,  and  then  describe  and  solve  the 
model  Hamiltonian  analytically  following  some  well-known 
approach  based  on  the  adiabatic  approximation.  This  ap¬ 
proximate  analytical  approach  is  still  not  enough  to  fully 
understand  the  experiments,  so  in  Sec.  Ill,  we  give  an  exact 
numerical  simulation  of  the  model,  with  the  emphasis  on 
checking  the  validity  of  the  introduced  approximations  and 
calculating  various  kinds  of  noise  magnitudes  relevant  for 
the  on-going  experimental  efforts.  The  calculations  show  that 
we  can  get  reasonably  good  signal-to-noise  ratios  with  typi- 
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FIG.  1.  Schematic  setup.  Left  side:  a  single  atom  trapped  in  a 
high-Q  cavity,  which  is  driven  by  a  classical  laser  pulse  s(t).  Right 
side:  the  relevant  atomic  level  structure. 

cal  experimental  values  for  the  parameters.  In  Sec.  IV,  we 
briefly  review  some  known  cavity-QED  and  quantum- 
information  processing  schemes,  and  then  discuss  how  to 
incorporate  the  present  method  into  these  schemes  to  im¬ 
prove  their  performance  against  the  randomness  in  the  at¬ 
om’s  position.  After  this  incorporation,  with  hot  nonlocalized 
atoms,  one  can  still  realize  many  kinds  of  cavity-QED  and 
quantum-information  processing  schemes,  including,  for  in¬ 
stance,  the  controllable  single-photon  or  entangled-photon 
source,  quantum  communication  between  cavities,  atomic 
entanglement  generation,  teleportation,  and  Bell  inequality 
detection.  Section  V  gives  a  synopsis  of  parameters  relevant 
to  our  current  experiment  for  a  single-atom  trapping  with  a 
FORT  beam  at  Caltech  [2,3,16].  We  summarize  the  results  in 
the  final  section. 

II.  CAVITY  QED  WITH  A  NONLOCALIZED  TRAPPED 
ATOM:  THE  SCHEME 

A.  Basic  idea 

First,  we  explain  the  basic  idea  of  this  method  by  consid¬ 
ering  a  single  trapped  atom,  which  has  three  effective  levels 
| g),  \e),  | s),  as  shown  in  Fig.  1.  The  two  ground  states  | g) 
and  | s)  can  correspond,  for  instance,  to  sub-Zeeman  levels  in 
the  F=  3  and  F  =  4  manifolds,  respectively,  for  the  cesium 
atom.  The  transition  | e)—>\s)  is  coupled  resonantly  to  the 
cavity-QED  mode  a  with  a  coupling  rate  g(  r)  in  the  form  of 
Eq.  (1).  A  classical  laser  field  s(t )  incident  from  one  mirror 
of  the  cavity  (see  Fig.  1)  drives  the  transition  \g)^\e) 
through  another  cavity  mode  a' .  We  assume  for  simplicity 
that  a  and  a '  have  the  same  spatial  mode  structure  with  the 
same  frequency  (for  example,  they  can  be  of  different  polar¬ 
izations)  [25].  The  driving  laser  s(t )  is  resonant  to  the  tran¬ 
sition  |g)— »|e),  so  it  is  far-off-resonant  to  the  cavity  mode 
a '  with  a  large  detuning  a)gs ,  where  cogs  denotes  the  split¬ 
ting  between  the  levels  | g)  and  \s).  Due  to  the  off-resonant 
driving  by  s(t),  a'  can  be  described  classically  by  its  mean 
value  (a')  =  a(t)e~l(0set  (coge  is  the  frequency  splitting  be¬ 
tween  the  levels  | g)  and  \e)),  which  couples  resonantly  to 
the  transition  \g)-^\e)  with  a  Rabi  oscillation  frequency 
I2(r,Q.  Since  a  and  a'  have  the  same  spatial  mode  structure, 
the  Rabi  frequency  D(r,Q  will  depend  on  the  atom’s  posi¬ 
tion  r  by  the  same  mode  function  v(r)>  i-e.,  fi(r,0  can  be 
factorized  as 

Cl(r,t)  =  Cl0(t)x(r)  =  rog0a(t)x(r),  (3) 
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where  rQ  represents  the  fixed  ratio  of  the  Clebsch-Gordan 
coefficients  for  the  transitions  \g)^\e)  and  \s)—>\e). 

To  understand  the  basic  idea  of  this  method,  let  us  first 
look  at  a  very  simplified  picture  by  neglecting  the  coupling 
of  the  mode  a  to  the  cavity  output.  The  system  is  then  de¬ 
scribed  by  the  following  simple  Hamiltonian  in  the  rotating 
frame  (setting  h  =  1 ) : 

^sim  0  ^ eg  H.C.,  (4) 

where  cr  ^v=  \p){v\  (/x, v=g,e,s)  are  the  atomic  transition 
operators,  and  H.c.  stands  for  the  Hermitian  conjugate.  The 
Hamiltonian  Hsim  has  the  well-known  dark  state  | D)  (the 
instantaneous  eigenstate  with  a  zero  eigenvalue)  with  the 
form  [27] 


I  D)  = 


1 


[g(r)|g)|0)-fl(r,/)|,?)|l}] 


=  /i  i  /^[|g)l°)~r«?QV)k)l1)],  (5) 

Vl  +  k0a(f)|2 

where  |0)  and  |l)  represent  the  zero-  and  the  one-photon 
state  of  the  cavity  mode  a.  Note  that  the  dark  state  | D)  ac¬ 
tually  only  depends  on  the  ratio  between  the  parameters  g(r) 
and  12  (r ,t),  so  it  becomes  independent  of  the  random  atom 
position  r  with  the  interaction  configuration  specified  above. 
If  we  start  with  the  atom  in  the  ground  state  | g),  and  gradu¬ 
ally  increase  the  Rabi  frequency  I2(r,Q,  under  the  adiabatic 
approximation,  the  system  will  remain  in  the  dark  state  |Z2), 
which  gradually  evolves  into  the  final  state  |s)|l).  Due  to 
the  independence  of  the  state  |D)  on  the  variable  r,  the  rel¬ 
evant  dynamics  of  this  adiabatic  evolution  also  becomes  in¬ 
dependent  of  the  random  atom  site  r.  This  is  the  basic  idea  of 
the  method  to  eliminate  the  influence  of  the  randomness  on 
the  coupling  coefficient  g(  r). 

Note  that  to  make  the  dark  state  and  the  relevant  dynam¬ 
ics  independent  of  the  random  atom  position  r,  the  driving 
pulse  and  the  cavity  mode  need  to  have  the  same  spatial 
mode  structure.  This  is  why  the  classical  driving  pulse  is 
matched  to  the  spatial  mode  of  the  cavity  field,  both  along 
the  cavity  axis  and  transversely,  which  is  routinely  accom¬ 
plished  by  way  of  illumination  from  one  side  mirror  of  the 
cavity.  This  configuration  is  different  from  the  original  pro¬ 
posals  for  adiabatic  dynamics  in  cavity  QED  [27]  in  which 
the  propagation  direction  of  the  driving  pulse  is  perpendicu¬ 
lar  to  the  cavity  axis  with  uniform  illumination  intensity.  It  is 
also  distinct  from  the  configuration  employed  in  some  recent 
interesting  experiments  directed  toward  achieving  a  single¬ 
photon  source  [17,18],  which  likewise  employed  uniform  il¬ 
lumination  transverse  to  the  cavity  axis  and  for  which  the 
atom  is  not  localized  axially.  As  a  result,  in  these  experi¬ 
ments  some  of  the  dynamics,  such  as  the  output  pulse  shape 
and  phase,  still  depend  on  the  unknown  position  of  the  atom, 
and  are  thus  not  fully  controllable,  as  has  been  seen  from  the 
experiments. 

We  also  would  like  to  mention  that  in  this  configuration, 
the  driving  field  and  the  quantized  cavity  output  are  collin- 
ear,  and  they  need  to  be  separated  afterwards.  The  separation 


can  be  done  through  either  polarization  or  frequency  selec¬ 
tion.  This  separation  is  actually  pretty  easy  in  the  present 
case.  In  typical  experimental  configurations,  the  classical 
field  drives  one  cavity  mode  from  one  cavity  mirror  (say  1) 
with  a  large  detuning,  and  the  single-photon  quantum  field 
together  with  some  transmitted  driving  field  are  output  from 
the  other  cavity  mirror  (say  2,  with  the  transmission  rate  t2 
>tx).  Most  of  the  driving  field  has  been  filtered  already  by 
the  high-finesse  cavity  itself.  The  ratio  between  the  intensi¬ 
ties  of  the  classical  driving  field  and  the  quantum  field  output 
from  side  2  is  the  same  as  their  ratio  inside  the  cavity,  which 
does  not  need  to  be  very  large,  since  both  of  the  atomic 
transitions  are  enhanced  by  the  cavity  and  thus  have  compa¬ 
rable  strength.  As  will  be  seen  in  the  numerical  simulations 
in  Sec.  Ill,  inside  the  cavity,  the  driving  field  is  typically 
assumed  to  be  about  five  times  stronger  than  the  single¬ 
photon  field,  and  it  is  pretty  easy  to  separate  such  a  weak 
field  with  a  polarization  beam  splitter  at  output  side  2. 

To  guarantee  an  adiabatic  evolution,  we  need  to  fulfill  the 
adiabatic  condition,  which  means  that  the  evolution  time  T 
should  be  significantly  longer  than  the  frequency  gap  8  be¬ 
tween  the  dark  state  and  some  other  eigenstates  of  the  Hamil¬ 
tonian  Hsim .  The  error  probability  due  to  the  nonadiabaticity 
is  estimated  by  pad=  l/(  ST)2.  For  the  Hamiltonian  Hsim ,  the 
frequency  gap  8  is  given  by  8=  V|g(r)|2+  |I2(r,Q|2.  Thus, 
the  adiabatic  condition  [|g(r)|2+  \£l(r,t)\2]T2>  1  depends 
on  the  atom  position  r.  If  the  coupling  coefficient  g(r) 
changes  by  a  factor  of  2,  the  error  probability  pad  will 
change  by  a  factor  of  4  for  the  same  evolution  time  T.  How¬ 
ever,  if  T  is  sufficiently  long,  the  error  probability  pad  re¬ 
mains  small,  and  the  relevant  system  dynamics  will  be  still 
very  insensitive  to  the  randomness  of  the  atom’s  position.  To 
estimate  pad ,  we  can  use  the  average  value  of  the  coupling 
rate  g(r). 

In  the  above  simple  picture,  we  neglect  the  coupling  of 
the  mode  a  to  the  cavity  output.  This  is  only  a  valid  picture 
in  the  good-cavity  limit  with  the  evolution  time  T<\/k , 
where  k  is  the  cavity  decay  rate.  However,  in  practice,  it  is 
better  to  operate  the  system  in  the  limit  with  T ^  1/k.  There 
are  several  advantages  of  operating  the  system  in  this  limit: 
first,  without  the  requirement  T<1/k ,  it  is  easier  to  satisfy 
the  adiabatic  condition  for  which  T  should  be  sufficiently 
long;  second,  in  this  limit  it  is  easier  to  modulate  the  Rabi 
frequency  I2(r,Q  by  changing  the  intensity  of  the  driving 
laser  e(t)  incident  from  one  side  mirror  of  the  cavity.  In  this 
way,  one  can  efficiently  control  the  pulse  shape  of  the  cavity 
output  by  modulating  the  shape  s(t)  of  the  driving  laser, 
which  is  useful  for  many  applications.  In  the  limit  T^I/k, 
we  need  to  take  into  account,  from  the  beginning,  the  cou¬ 
pling  of  the  mode  a  to  the  continuum  cavity  output,  and  the 
whole  system  will  then  have  infinite  levels.  We  will  describe 
in  the  section  this  more  involved  interaction  configuration. 
The  above  simple  three-level  picture,  though  it  does  not  de¬ 
scribe  the  real  experimental  configuration,  does  help  in  un¬ 
derstanding  the  basic  idea  of  the  adiabatic  method. 

B.  Theoretical  model  and  its  approximate  analytical  solution 

Now  we  look  at  the  more  complicated  theoretical  model, 
which  includes  the  coupling  of  the  mode  a  to  the  continuum 
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cavity  output.  If  we  adiabatically  apply  a  classical  driving 
pulse  s(t )  as  shown  in  Fig.  1,  one  photon  will  be  emitted 
from  the  transition  |e)— >| s),  and  the  cavity  will  output  a 
single-photon  pulse.  We  want  to  show  below  that  this  single¬ 
photon  pulse  has  a  definite  pulse  shape  which  is  independent 
of  the  randomness  in  the  atom’s  position  r  and  in  the  cou¬ 
pling  rate  g(r).  In  this  way,  although  the  atom’s  position  and 
the  absolute  value  of  the  light-atom  coupling  rate  are  not 
fully  controlled,  we  can  nevertheless  fully  control  the  prop¬ 
erties  of  the  output  single-photon  pulse  by  modulating  the 
driving  laser  pulse  s(t).  This  is  an  important  feature  for 
many  applications  of  this  setup,  which  we  will  discuss  in 
Sec.  IV.  There  are  several  equivalent  ways  to  describe  the 
coupling  of  the  mode  a  to  the  continuum  cavity  output 
[9,30,31].  Since  we  want  to  calculate  the  output  pulse  shape 
within  the  adiabatic  approximation,  it  is  convenient  to  use 
the  Hamiltonian  approach  [30,31].  The  derivation  here  is 
similar  to  the  calculation  in  Ref.  [30]  for  the  pulse  shape 
from  an  ensemble  of  atoms.  The  whole  Hamiltonian,  includ¬ 
ing  the  coupling  to  the  cavity  output,  has  the  following  form 
in  the  rotating  frame  [31]: 

H=(A-iyJ2)aee  +  [(l(r,t)aeg  +  g(r)aaes  +  ll.c.] 

+  i^Kl 277  I  d(o[a^b((x))  — ab\co)] 

J  ~Q)b 

+  J  d<jL>[<jjb\<jL>)b((o)],  (6) 

J  ~ojb 

where  b(co),  with  the  standard  commutation  relation 
[b(co),bf(co')]  =  S((o  — co'),  denote  the  one-dimensional 
free- space  modes  that  couple  to  the  cavity  mode  a.  We  only 
need  to  consider  the  free- space  modes  within  a  finite  band¬ 
width  [o)se—  u)b  ,(x)se+  a)b]  with  the  carrier  frequency  cose 
(cose  is  the  frequency  splitting  between  the  levels  | s)  and 
\e))9  since  all  the  modes  outside  of  this  bandwidth  have 
negligible  contributions  to  the  dynamics  due  to  the  large  de¬ 
tuning  (larger  than  (ob).  Within  this  bandwidth,  the  coupling 
between  b(a>)  and  the  cavity  mode  a  is  approximately  a 
constant,  and  we  denote  it  by  V kIItt  for  convenience,  where 
k  is  the  effective  cavity  decay  rate,  as  we  will  see.  The 
bandwidth  o)b  should  be  chosen  to  be  much  larger  than  /c, 
but  still  much  smaller  than  a)se . 

We  have  assumed  that  the  driving  laser  and  the  cavity 
mode  a  couple  resonantly  to  the  corresponding  free- space 
atomic  transitions.  However,  we  emphasize  that  our  scheme 
still  works  for  the  case  of  off-resonant  coupling.  By  consid¬ 
ering  the  off-resonant  scheme,  there  is  no  win  with  respect  to 
losses  due  to  the  atomic  decay,  since  in  this  case  the  time 
scale  also  slows  down.  So  it  suffices  here  to  consider  the 
resonant  coupling  case.  However,  in  the  Hamiltonian  (6),  it 
is  still  helpful  to  include  a  single-photon-transition  detuning 
A  to  account  for  the  trapping  potential  difference  for  the 
levels  \g)  and  \e)  induced  by  the  FORT  beam  (this  potential 
is  basically  the  same  for  the  levels  | s)  and  | g)  for  a  FORT 
beam  with  linear  polarization  as  in  our  current  experiments). 
The  potential  difference  between  the  level  | g)  and  \e),  in 
general,  depends  as  well  on  the  random  atom  position  r. 


The  imaginary  part  of  the  Hamiltonian  (6)  accounts  for 
the  spontaneous  emission  loss,  where  ys  denotes  the  total 
spontaneous  emission  rate  of  the  upper  level  \e).  In  writing 
this  form,  we  have  assumed  that  the  spontaneous  emission 
photon  escapes  and  that  the  atom  after  a  spontaneous  emis¬ 
sion  will  not  be  repumped.  This  is  a  good  assumption  for  the 
interesting  region  where  the  spontaneous  emission  loss  is  not 
big,  and  the  atom  thus  has  a  very  small  probability  to  be 
repumped  after  emitting  a  spontaneous  emission  photon.  As 
a  result  of  this  assumption,  the  spontaneous  emission  only 
contributes  to  the  leakage  error  which  is  properly  represented 
by  Eq.  (6)  [32]. 

We  treat  the  atom’s  position  r  in  the  Hamiltonian  (6)  as  a 
classical  stochastic  variable,  and  neglect  its  quantum  nature. 
This  is  a  good  approximation  for  the  current  experimental 
situation  where  the  atom  is  still  quite  hot.  There  have  been 
some  analyses  of  the  noise  from  quantum  motion  effects  in 
high-Q  cavities  with  very  cold  atoms  [33]. 

We  start  with  the  atom  in  the  ground  state  | g),  and  then 
apply  a  classical  driving  pulse  s(t).  This  pulse  can  effi¬ 
ciently  control  the  time  evolution  of  the  Rabi  frequency 
12 (r,  t)  in  the  Hamiltonian  (6).  To  see  this,  we  write  the 
input-output  equation  for  the  cavity  mode  a '  [31] 

k  r~  r 

a'  =  -iajsea'  -  -a'  ~^Ka'n(t),  (7) 

where  a[n(t)  is  the  field  operator  for  the  input  driving  pulse 
coupling  to  the  mode  a ',  with  (a[n(t))  =  s(t)  and 
=  S(t-t').  By  assumption,  the  mode  a '  has 
the  same  frequency  as  the  mode  a ,  which  is  resonant  to  the 
free-space  atomic  transition  \s)—>\e),  so  the  eigenfrequency 
of  a'  is  (x)se .  Such  a  situation  corresponds,  for  example,  to 
the  case  of  the  ( a, a')  modes  of  orthogonal  polarization,  but 
degenerate  in  frequency,  although  this  is  not  an  essential  re¬ 
quirement.  In  Eq.  (7),  we  have  neglected  the  small  depletion 
of  a '  caused  by  the  coupling  to  the  atomic  transition  aeg , 
since  a '  is  driven  by  a  strong  classical  pulse  e(t)  which 
dominates  its  time  evolution.  We  write  the  mean  values  of  a ' 
and  a'in(t)  as  (a')  =  a(t)e~l(°sd  and  (a[n(t))  =  s(t) 
=  s(t)e~l0)8et,  where  s(t)  is  the  slowly  varying  amplitude  of 
the  driving  laser.  From  Eq.  (7),  we  get  a  time  evolution  equa¬ 
tion  for  the  mean  value  a(t),  which  has  the  following  im¬ 
mediate  solution: 

a(t)=  [si^e^ss-^-^dr.  (8) 

Jo 

The  variation  rate  of  e(r)  is  characterized  by  the  inverse  of 
the  operation  time  T  (the  pulse  duration),  which  is  typically 
much  smaller  than  the  hyperfine  frequency  splitting  cogs 
(about  9  GHz  for  cesium  atoms).  Hence,  a  partial  integration 
of  Eq.  (8)  yields 


—  Ai(x)ss~K/ 2)f 


e(0) 


—  i  a)gs  +  k!  2 


1  +o 


<0gsT 


(9) 
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We  assume  that  s(t )  gradually  increases  from  zero  with 
e(0)  —  0.  Then,  within  a  good  approximation,  we  have 
a(t)°cs(t)  from  Eq.  (9).  In  the  following,  without  loss  of 
generality,  we  assume  a(t)  to  be  real  by  choosing  an  appro¬ 
priate  constant  phase  of  s(t).  The  time  behavior  of  the  Rabi 
frequency  I2(r,Q  is  completely  determined  by  a(t)  [note 
that  H(r,Q  =  rog0a(t)x( r)  from  Eq.  (3)],  that  is,  by  the  am¬ 
plitude  s(t )  of  the  driving  laser. 

The  dark  state  (5)  can  be  rewritten  as  |D)  =  cos  #|g)|0) 
—  sin  #|s)|l),  with  cos  9=  1/V 1  +  \rQa{t)\2  independent  of  the 
atom’s  position  r.  The  state  | B)  complementary  to  the  dark 
state  is  usually  called  the  bright  state  with  \B)  =  sin  #|g)|0) 
+cos  #|s)|l).  To  solve  the  dynamics  governed  by  the  Hamil¬ 
tonian  (6),  we  can  expand  the  state  |''P)  of  the  whole  system 
into  the  following  superposition: 

l^)  =  (Q|£>)  +  cfe|fi)  +  cJe)|0))®|vac)+|s)|0)®|<p1), 

(10) 

where  |vac)  denotes  the  vacuum  state  of  the  free-space 
modes  b(a> ),  and 


dcoc^b^co)  |  vac) 


(ID 


The  approximation  in  Eq.  (15)  is  valid  since  the  bandwidth 
ojb  satisfies  cobT>  1,  where  the  operation  time  T  character¬ 
izes  the  time  scale  for  a  significant  change  of  cd  and  sin  #. 
Therefore,  the  dark-state  coefficient  cd  satisfies  the  cavity 
free-decay  equation,  with  the  decay  rate  k  replaced  by  the 
effective  rate  k  sin 2  6.  This  can  be  easily  understood  since 
sin2#  is  the  probability  of  the  component  \s)\  1)  in  the  dark 
state  | D),  and  it  is  exactly  this  component  that  couples  to  the 
cavity  output.  Equation  (15)  has  the  straightforward  solution 


Q=exp 


sin2#(  r)dr 


(16) 


We  want  to  know  the  single-photon  pulse  shape  f(t)  of  the 
cavity  output  state  \<p\).  Suppose  now  that  T is  the  final  time 
of  the  interaction  (i.e.,  the  operation  time  determined  by  the 
driving  laser  pulse  is  from  0  to  T).  The  pulse  shape  f(t)  is 
connected  with  the  coefficients  cw(Q  before  the  frequency 
components  in  \ipx)  by  the  Fourier  transformation  [31] 


f(t)= 


1  C  +  (x)b 

j— -  I  d(j)C 

TJ  ~  oo  b 


—  io)(t—T ) 


From  Eqs.  (14),  (16),  and  (17),  we  finally  obtain 


(17) 


represents  the  state  (not  normalized)  of  the  single-photon 
output  pulse.  The  coefficients  cd,  cb,  ce,  and  cM  in  Eq.  (10) 
are  time  dependent.  At  the  time  t  =  0,  we  have  cd=  1,  cb 
=  ce  =  c(O=0,  and  cos  0=1.  After  applying  a  classical  driv¬ 
ing  pulse  e(Q,  cos  6  slowly  changes  with  a(t ),  and  we  need 
to  compute  the  time  evolution  of  all  the  coefficients 
cd,cb  ,ce  ,c w  in  Eq.  (10)  by  substituting  |'VE)  into  the 
Schrodinger  equation  = 

To  go  on  with  this  task,  let  us  first  take  the  adiabatic 
approximation,  which  assumes  the  time  derivative  dt  cos  9 
^0.  As  a  result,  dt\ D)  and  dt\B)  become  negligible.  We  will 
check  the  validity  of  the  adiabatic  approximation  and  calcu¬ 
late  various  nonadiabatic  corrections  in  the  following  section 
through  numerical  methods.  In  the  adiabatic  limit,  the  popu¬ 
lations  in  the  bright  state  | B)  and  in  the  excited  state  \e)  are 
negligible,  so  we  assume  cb^ce^ 0.  The  coefficients  cd  and 
satisfy  the  following  evolution  equations: 


cd=  —  V^/27rsin  #  J  c cod co , 

(12) 

J  ~cob 

cM=  —  i(jL>cM+  V Kl2ircd  sin  #. 

(13) 

Equation  (13)  has  the  solution 

cw(0  =  Vk/27t  I  T^cd(r) sin  9(t) dr,  (14) 

Jo 

which,  substituted  into  Eq.  (12),  leads  to 
k  psin[o)^(r— r)] 

cd=-^— sin#  - ; - cd(r)sin9(r)dr 

2  7 T  Jo  t~T 

—  ~(K/2)cdsin29.  (15) 


f(t)  =  V^sin  #(Qexp[  — —  J  sin 2#(r)Jr 


(18) 


Note  that  the  single-photon  pulse  shape  f(t)  is  completely 
determined  by  9(t),  i.e.,  by  the  driving  pulse  shape  s(t ),  and 
is  independent  of  the  random  atom’s  position  r  and  the  ab¬ 
solute  value  of  the  coupling  coefficient  g(  r).  As  we  have 
mentioned  before,  this  is  the  main  advantage  of  this  adiabatic 
method  compared  with  either  the  Raman  scheme  or  prior 
proposals  based  upon  adiabatic  passages  with  uniform  illu¬ 
mination  [17,18,27],  and  this  feature  is  essential  for  many 
applications  of  this  setup. 

The  above  result  is  obtained  within  the  adiabatic  approxi¬ 
mation,  and  in  the  adiabatic  limit,  the  solution  is  independent 
of  the  atomic  spontaneous  emission  rate  ys  and  the  detuning 
A.  This  is  only  a  rough  picture.  In  the  following,  we  will 
solve  exactly  the  dynamics  governed  by  the  Hamiltonian  (6) 
without  the  use  of  the  adiabatic  approximation.  The  exact 
solution  is  necessary  in  the  following  two  senses:  first,  we 
need  to  verify  the  above  ideal  picture  and  to  find  out  under 
what  condition  this  picture  is  approximately  valid.  Though  in 
the  three-level  case,  we  have  some  simple  estimation  of  the 
condition  for  the  adiabatic  following,  it  is  not  easy  to  figure 
out  the  exact  adiabatic  following  condition  for  the  more  re¬ 
alistic  situation  of  a  continuum  of  external  modes.  In  this 
case,  the  argument  based  on  the  level  spacing  is  not  valid. 
We  need  to  know  how  long  the  operation  time  T  should  be  to 
satisfy  the  adiabatic  following  condition.  We  also  expect  that 
the  atomic  spontaneous  emission  cannot  be  made  negligible 
simply  by  increasing  the  operation  time  T.  Its  rate  ys  should 
be  small  enough  to  satisfy  the  strong-coupling  condition 
Kys<g2,  where  g  denotes  the  average  of  the  coupling  rate 
g(r)  [34].  Second,  in  real  experiments,  the  operation  time  T 
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is  not  infinitely  long,  and  the  coupling  rate  g  cannot  be  arbi¬ 
trarily  larger  than  the  decay  rates  k  and  ys  due  to  limitation 
of  the  technology  (for  instance,  in  Caltech  experiments,  typi¬ 
cally,  gl2iT  is  around  20  MHz,  and  kUtt—  ysl27T 
~  6  MHz) .  In  this  case,  there  would  be  various  nonadiabatic 
corrections  to  the  above  ideal  picture,  for  instance,  the  atom 
may  go  down  from  the  level  \  e)  to  \s)  through  a  spontaneous 
emission,  and  then  we  lose  the  emitted  photon  and  thus  have 
no  output  from  the  cavity;  or  we  have  a  single-photon  output, 
but  it  is  in  a  wrong  and  unknown  pulse  shape  due  to  its 
sensitivity  to  the  random  atom  position  induced  by  the  nona¬ 
diabatic  contributions.  It  is  desirous  and  important  to  calcu¬ 
late  quantitatively  the  magnitudes  of  these  noises  to  predict 
the  real  experiments.  The  exact  solution  of  the  system  dy¬ 
namics  is  only  available  with  the  numerical  methods,  which 
is  the  main  task  of  the  following  section. 

III.  EXACT  NUMERICAL  SIMULATIONS 

A.  The  numerical  calculation  method 

In  this  section,  we  solve  exactly  the  system  dynamics 
governed  by  the  Hamiltonian  (6)  through  numerical  simula¬ 
tions,  and  calculate  various  nonadiabatic  corrections  and 
noise  magnitudes.  For  numerical  simulations  of  the  Hamil¬ 
tonian  (6),  we  need  to  discretize  the  free-space  field  b(a))  by 
introducing  a  finite  but  small  frequency  interval  Sw  between 
two  adjacent  modes.  Then,  in  total  we  have  about  N 
^2  o)b/S(x)  free-space  modes,  with  the  j  mode  denoted  by 
bj .  The  frequency  detuning  coj  of  the  j  mode  is  given  by 
a)j  =  (J  —  N/2)S(o.  To  assure  that  there  is  no  change  of  the 
physical  result  after  the  discretization,  we  should  choose  the 
frequency  interval  8co  much  smaller  than  the  inverse  of  the 
operation  time  T,  and  the  bandwidth  ojb  much  larger  than  the 
cavity  decay  rate  ac. 

For  the  numerical  simulation,  we  can  similarly  expand  the 
state  I'VE)  of  the  whole  system  in  the  form  of  Eq.  (9),  with  the 
single-photon  pulse  state  replaced  by 

N 

|<Pi}=2  Cjb) | vac).  (19) 

7  =  1 

From  the  Hamiltonian  (6),  we  get  the  following  complete  set 
of  equations  for  the  coefficients  cd,  cb,  ce,  and  Cj  : 

N 

cd=  —  6cb—  Ac'sin  #2  Cj>  (20) 

7=1 

N 

cb=  dcd—iyl£l2(r,t)  +  g2(r)ce+  ac' cos  #2  (21) 

7  =  1 

ce=(-iA-  yJ2)ce-i  V^2(r,r)  +  g2{r)cb ,  (22) 

Cj  =  —  i(j  —  N/2)8coCj  +  ac 'sin  0cd  —  ac'cos  0cb ,  (23) 

where  the  effective  decay  rate  k'  =  ^ kS(o/27t.  We  obtain  the 
solutions  of  these  coefficients  by  numerically  integrating 
Eqs.  (20)-(23)  from  the  time  t  =  0  to  t=T ,  where  T  is  the 


FIG.  2.  The  shape  of  the  output  single-photon  pulse  described 
by  the  amplitude  \f(t)\  vs  the  time  t  for  the  coupling  rates  g(r) 
=  3  ac  (solid  curve)  and  g(r)  =  6  ac  (dotted  curve).  The  dashed  curve 
represents  the  pulse  shape  in  the  ideal  adiabatic  limit  calculated 
from  Eq.  (16).  In  this  figure,  we  have  taken  ys  =  ac,  A  =  0,  and  T 
=  20/ ac.  The  driving  pulse  s(t)  is  in  a  Gaussian  shape  with  the  peak 
at  t=  772  and  a  width  tw=T/ 5. 

duration  of  the  driving  pulse  e(t).  We  assume  that  s(t)  is  a 
Gaussian  pulse  so  that  a(t)  is  a  Gaussian  function  of  the 
time  t,  with  its  peak  value  at  772,  and  a  width  tw  significantly 
smaller  than  T 12.  All  the  functions  of  6  in  Eqs.  (20) -(23)  are 
decided  from  cos  6=  l/yjl  +  \r0a(t)\2  and  ^£l2(r,t)  +  g2(r) 
=  g( r)/cos  0.  To  simulate  the  randomness  of  the  atom  posi¬ 
tion  r,  we  vary  the  value  of  g(r)  in  the  simulation  to  look  at 
whether  the  final  result  changes  with  this  variation. 

B.  Shape  of  the  output  single-photon  pulse 

The  output  single-photon  pulse  shape  f(t)  can  be  easily 
constructed  from  the  solution  of  the  coefficients  Cj  through  a 
discrete  version  of  Eq.  (17).  The  result  is  shown  in  Fig.  2  for 
g(r)  =  3Ac  and  g(r)  =  6Ac.  Although  we  have  not  made  de¬ 
finitive  measurements,  we  estimate  that  g( r)  varies  within  a 
factor  of  roughly  2  in  the  current  Caltech  experiment 
[2,3,16].  Here  and  in  the  following,  the  pulse-shape  function 
f(t)  is  always  renormalized  according  to  f\f(t)\2dt=l  for 
convenience  of  comparison.  We  see  that  the  two  curves  over¬ 
lap  very  well,  which  confirms  the  prediction  that  the  output 
pulse  shape  is  very  insensitive  to  the  randomness  of  the  cou¬ 
pling  coefficient  g(r)  when  the  adiabatic  condition  is  satis¬ 
fied  (we  take  T=20/k  for  this  figure).  We  also  draw  in  this 
figure  the  pulse  shape  f(t)  given  by  Eq.  (18)  derived  in  the 
ideal  adiabatic  limit,  which  agrees  well  with  the  exact  nu¬ 
merical  results.  Therefore,  within  the  adiabatic  condition,  we 
can  use  the  analytical  result  (18)  to  design  the  shape  of  the 
output  single-photon  pulse  by  modulating  the  driving  pulse 
shape  s(t). 

C.  Noise  magnitudes  and  the  adiabatic  condition 

To  quantify  the  noise  magnitudes  in  this  setup,  we  can 
define  several  error  probabilities.  First,  we  have  the  leakage 
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error  due  to  the  atomic  spontaneous  emission.  A  photon  may 
be  emitted  to  modes  other  than  the  principal  cavity  mode 
through  the  spontaneous  emission  with  the  rate  ys .  As  a 
result,  the  norm  |q|2+  |<y|2+  \ce\2  +  ^f=i\cj\2  of  the  state 
(10)  decays  with  the  time  t ,  and  we  can  use 

N 

^spon=  1  - \cd(T)\2-\cb(T)\2-\ce(T)\2- Y  \cj(T)\2 

(24) 

at  the  final  time  T  to  quantify  the  total  possibility  of  the 
spontaneous  emission  loss.  Second,  due  to  the  finiteness  of 
the  operation  time  T  and  the  pumping  field  amplitude  s(t), 
the  initial  excitation  in  the  dark  state  is  not  necessarily  fully 
transferred  to  the  output  quantum  signal  at  the  final  time,  and 
we  can  use 


P^n=\cd(T)\2+\cb{T)\2  +  \ce{T)\2  (25) 


at  the  time  T  to  quantify  the  transmission  inefficiency.  In 
principle,  we  can  arbitrarily  decrease  the  transmission  ineffi¬ 
ciency  by  increasing  the  duration  T  or  the  amplitude  s(t )  of 
the  pumping  field.  Finally,  even  if  a  photon  is  emitted  into 
the  cavity  output  field,  it  is  not  necessarily  in  the  right  pulse 
shape  as  given  by  Eq.  (18)  due  to  the  nonadiabatic  correc¬ 
tion.  This  nonadiabatic  correction  depends  on  the  random 
atom  position  and  is  unknown,  so  it  is  also  a  source  of  noise. 
To  quantify  this  noise,  we  denote  the  ideal  pulse  shape  given 
in  Eq.  (18)  as  /id(f),  and  the  real  pulse  shape  calculated  from 
the  numerical  simulation  as  /real(Q,  then  the  shape  mis¬ 
matching  error  can  be  described  by 


p  ■  = 
■*  mis 


l- 


|A* 

Jo 


/realM/idM^ 


f  l/real(l)|2<*  f  L/W»|2<* 

Jo  Jo 


1/2 


(26) 


This  quantity  is  directly  related  to  the  visibility  of  the  fringes 
if  we  interfere  two  single-photon  pulses  from  two  such  set¬ 
ups. 

For  the  example  shown  in  Fig.  2,  with  g(r)  =  3  k  =  3  ys 
(the  other  parameters  are  given  in  the  figure  caption),  we 
have  FSpOn^4.0%,  Ptran^0.04%,  Fmis^0.18%.  The  domi¬ 
nant  source  of  noise  is  the  leakage  error  Pspon  induced  by  the 
spontaneous  emission.  If  we  increase  the  operation  time  T  so 
that  the  adiabatic  condition  is  better  satisfied,  the  above- 
defined  noise  magnitudes  can  be  reduced  a  little  bit,  but  not 
too  much.  For  instance,  with  the  above  example  but  with  T 
=  30 Ik,  we  have  Pspon^3.33%  and  Pmis^0.15%.  On  the 
other  hand,  if  T  is  reduced  so  that  the  adiabatic  condition  is 
not  well  satisfied,  the  error  probabilities  can  significantly  in¬ 
crease.  Figure  3  shows  the  output  pulse  shapes  for  g(r) 
=  3  k  and  g(r)  =  6  k  with  T  =  5/ k.  The  two  curves  are  obvi¬ 
ously  different  from  each  other  and  are  also  different  from 
the  ideal  shape  as  given  by  Eq.  (18).  For  the  example  with 
g(r)  =  3 K  =  3ys  and  T=5/k ,  we  have  Pspon^36%,  Ptran 
^3.2%,  Pmis^2.7%.  All  the  noise  magnitudes  significantly 
increase.  In  particular,  the  spontaneous  emission  loss  be¬ 


EIG.  3.  The  shape  |/(f)|  of  the  output  single-photon  pulse  for 
the  coupling  rates  g(r)  =  3  k  (solid  curve),  g(r)  =  6  k  (dotted  curve), 
and  in  the  ideal  adiabatic  limit  (dashed  curve).  We  assumed  the 
same  condition  as  in  Fig.  2,  except  that  T  =  5/k,  which  does  not 
satisfy  well  the  adiabatic  condition. 

comes  very  big.  This  can  be  easily  understood  since  without 
the  adiabatic  condition,  the  excited  state  \e)  will  be  popu¬ 
lated  during  the  operation,  and  thus  we  have  a  correspond¬ 
ingly  larger  spontaneous  emission  loss. 


D.  The  strong-coupling  condition 

Next  we  look  at  the  requirement  of  the  strong-coupling 
condition.  Let  g  denote  the  average  value  of  the  coupling 
rate  g( r).  Normally  one  requires  g2>Kys  to  satisfy  the 
strong-coupling  condition.  We  can  define  the  strong-coupling 
parameter  dsc  as  dsc  =  g2/ Kys ,  and  calculate  the  above- 
defined  noise  magnitudes  Fspon,  Ftran,  ^mis  under  different 
values  of  the  parameter  dsc .  We  assumed  T  =  30 Ik  and  A 
=  0  in  the  calculation  so  that  the  adiabatic  condition  is  well 
satisfied.  It  turns  out  that  the  spontaneous  emission  loss  Pspon 
is  always  the  dominant  loss  (about  ten  times  larger  than  other 
sources  of  noise).  Thus,  in  Fig.  4,  we  only  show  the  calcu¬ 
lation  result  for  Fspon  under  different  values  of  dsc  .  The  re¬ 
sult  can  be  approximately  simulated  by  an  empirical  curve 
with  Pspon«l/(4Jsc). 

We  can  use  this  simple  formula  to  estimate  the  spontane¬ 
ous  emission  loss  under  different  experimental  conditions. 
Actually,  in  current  experiments,  the  strong-coupling  condi¬ 
tion  is  only  marginally  satisfied.  For  instance,  for  the  cesium 
atom  in  the  Caltech  group,  (/c,  ys)l2iT^{ 8,5.2)  MHz  (note 
that  k  and  ys  here  denote  the  energy  decay  rates,  which  are 
two  times  the  corresponding  amplitude  decay  rates)  [2,3], 
and  gl2iT  is  expected  to  be  ^15  MHz  for  the  transition 
(6S 1/2  ,F  =  4,m  =  +  4)^(6F3/2  ,F  =  4,m  =  +4)  [Note  that 
the  transition  (6S  ]/2,F  =  4,m  —  +  4)^(6F3/2  ,F=  5,m  = 
+  5)  cannot  be  used  as  a  A  configuration  though  it  has  a 
slightly  larger  coupling  rate  g].  These  values  lead  to  dsc 
=  g2/ Kys^5A  and  a  resulting  spontaneous  emission  loss 
around  4.6%,  which  is  quite  accessible  with  the  present  tech- 
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FIG.  4.  The  spontaneous  emission  loss  Pspon  vs  the  average 
coupling  rate  g  (in  the  units  of  the  cavity  decay  rate  k).  We  as¬ 
sumed  A  =  0  and  ys  =  k,  so  the  strong-coupling  parameter  dsc  is 
simply  (g/ k)2.  The  circles  represent  the  results  from  the  numerical 
calculation,  and  the  dashed  curve  is  from  the  empirical  formula 
Ps p0n^  l/(4dsc)  which  simulates  well  the  numerical  results. 


FIG.  5.  The  amplitude  (the  modulus)  and  the  phase  (divided  by 
7t/2)  of  the  real  pulse  shape  /real(f)  (two  solid  curves)  and  the  ideal 
pulse  shape /id(0  (two  dashed  curves)  vs  the  time  t  with  the  single¬ 
photon  transition  detuning  A  =  k.  We  assumed  g(r)  =  3/c,  ys=  k, 
and  T=30/k.  In  this  case,  the  main  difference  between  /real(f)  and 
/id(0  lies  in  the  phase  difference. 


nology.  As  another  example,  in  the  recent  experiment  [18], 
one  has  (/c, ys)/2 77^(1.25, 6.0) MHz  and  gl2ir^2.5  MHz 
according  to  the  estimation  there.  With  these  parameters, 
g2l Kys^ 0.83  and  we  estimate  that  the  spontaneous  emission 
loss  is  about  Fspon^30%  if  one  uses  the  scheme  here.  If  the 
usual  adiabatic  scheme  is  adopted  with  a  uniform  driving 
pulse  perpendicular  to  the  cavity  axis,  the  spontaneous  emis¬ 
sion  loss  should  be  still  significantly  larger,  as  will  be  seen 
from  the  simulation  in  the  last  section. 

E.  The  influence  of  the  single-photon  transition  detuning 

In  the  above  calculations,  we  assumed  A  =  0.  Finally,  we 
discuss  the  influence  of  a  nonzero  single-photon  detuning  A . 
In  Fig.  5,  we  show  the  calculation  result  of  the  exact  pulse- 
shape  function /real(0  with  a  significant  detuning  A  =  k,  and 
compare  it  with  the  ideal  pulse  shape  function  /id(0  given  by 
Eq.  (18)  for  both  the  amplitude  and  the  phase.  The  other 
parameters  for  this  example  are  given  in  the  figure  caption. 
From  the  figure,  we  see  that  the  two  amplitudes  |/reai(0|  and 
|/id(0|  still  overlap  very  well,  but  their  phases  become  a  bit 
different  due  to  the  detuning. 

This  phase  difference  is  determined  by  the  the  detuning 
A,  whereas  the  latter  depends  on  the  different  level  shift 
between  ground  and  excited  states,  and  hence  varies  with  the 
atom  position  within  the  FORT  beam.  In  the  case  of  the 
simple  level  scheme  depicted  in  Fig.  1,  the  states  | g)  and  \  e) 
would  have  spatially  dependent  level  shifts  of  opposite  sign, 
which  would  lead  to  variations  in  A  comparable  to  the  trap 
depth.  Fortunately,  there  is  a  simple  way  to  mitigate  this 
difficulty  by  considering  the  multilevels  involved  for  the 
FORT  beam,  as  described  in  Ref.  [35],  so  that  the  trapping 
potentials  for  the  states  | g)  and  \e)  are  very  nearly  the  same. 
For  example,  for  the  experiment  of  Ref.  [16],  the  difference 
in  trap  depth  for  | g)  and  \e)  is  roughly  10%  of  the  trap 


depth.  Relative  to  the  current  analysis,  there  is  then  a  varia¬ 
tion  in  A  as  the  atom  moves  in  the  FORT  potential,  which  is 
unknown  when  the  adiabatic  protocol  is  implemented.  The 
curve  in  Fig.  5  is  an  attempt  to  estimate  the  impact  of  such 
random  detunings  by  setting  A  =  k,  which  exceeds  the  actual 
magnitude  of  any  spatially  dependent  detunings  for  FORT 
depths  up  to  about  50  MHz.  The  phase  difference  in  the 
pulse-shape  function  caused  by  the  unknown  detunings  is  a 
source  of  noise,  which  contributes  to  the  shape  mismatching 
error  defined  in  Eq.  (26).  For  this  example  with  g(r)  =  3K, 
we  have  Fspon^3.33%,  Ptran^  10-4,  which  are  basically  the 
same  as  the  corresponding  case  without  detuning,  but  P ^ 
^3.33%,  which  becomes  significantly  larger  due  to  the  con¬ 
tribution  of  the  phase  difference. 

F.  Comparison  with  the  usual  adiabatic  scheme 

In  our  scheme,  the  driving  pulse  is  matched  to  a  cavity 
mode  which  has  basically  the  same  spatial  mode,  structure  as 
the  cavity-QED  light.  In  usual  adiabatic  schemes  [18,27],  the 
driving  laser  is  assumed  to  be  perpendicular  to  the  cavity 
axis  with  uniform  illumination  intensity.  We  expect  that  with 
the  present  interaction  configuration,  our  scheme  is  more  in¬ 
sensitive  to  the  randomness  in  the  atom  position.  To  compare 
the  two  configurations  more  quantitatively,  we  have  calcu¬ 
lated  the  output  pulse  shapes  and  noise  magnitudes  for  both 
schemes. 

First,  let  us  assume  that  the  atom  has  been  trapped  in  one 
potential  well,  but  the  coupling  rate  g(  r)  may  vary  within  a 
factor  of  2  due  to  the  unknown  atom’s  position.  In  Fig.  6,  we 
show  the  calculation  results  of  the  output  pulse  shapes.  The 
solid  curve  shows  the  pulse-shape  function  \f(t)\  when 
g(r)  =  3Ac  and  I2m(r)=15/c,  where  ftm(r)  is  the  maximum 
of  H(r,0  with  respect  to  time  t  [I2(r,0  is  assumed  to  be  a 
Gaussian  function  of  t  as  specified  in  the  caption  of  Fig.  2]. 
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FIG.  6.  The  shape  \f(t)\  of  the  output  single-photon  pulse  for 
the  following  pairs  of  coupling  rates  and  the  driving  Rabi  frequen¬ 
cies:  first,  g(r)  =  3/c  and  flm(r)  =  15/c  (solid  curve);  second,  g(r) 
=  6k  and  flm(r)  =  30/c  (dashed  curve),  and  finally,  g(r)  =  6/c  and 
flm(r)  =  15 k  (dotted  curve).  The  other  parameters  are  the  same  as 
in  Fig.  2. 


FIG.  7.  The  real  part  of  the  pulse-shape  function  Re[/(t)]  as 
g(r)  varies  with  time  in  the  form  g(r)  =  6  k  sin(47rt/r+ <p0)  in  the 
usual  adiabatic  scheme  with  cp0  =  0  (solid  curve)  and  <p0  =  tt/2 
(dash-dotted  curve),  respectively.  The  dashed  curve  shows  the  ideal 
pulse  shape  calculated  from  Eq.  (18).  The  other  parameters  in  this 
figure  are  the  same  as  those  in  Fig.  2. 


Now,  if  g(r)  varies  by  a  factor  of  2  due  to  change  of  the 
atom’s  position,  in  our  scheme  the  Rabi  frequency  will  cor¬ 
respondingly  change  by  the  same  ratio.  The  dashed  curve 
shows  the  pulse  shape  for  g(r)  =  6K  and  ftm(r)  =  30/c.  One 
can  see  that  the  two  curves  overlap  very  well  with  the  mode 
mismatching  noise  smaller  than  0.2%.  In  contrast,  in  usual 
adiabatic  schemes  with  uniform  illumination  intensity, 
flm(r)  does  not  change  as  g(  r)  varies  with  the  atom  posi¬ 
tion,  so  we  have  the  same  I2m(r)  =  15/c.  The  dotted  curve  in 
Fig.  6  shows  the  pulse  shape  for  g(r)  =  6#c  and  flm(r) 
=  15  ac.  It  is  significantly  different  from  the  above  two  curves 
with  a  notable  mode  mismatching  noise  Pmisf^6.9%.  The 
improvement  by  this  scheme  would  become  more  impressive 
if  g(r)  has  a  larger  variation,  both  in  its  magnitude  and  in  its 
sign.  If  g(r)  gets  a  random  sign  as  the  atom  goes  through 
different  axial  positions  of  the  cavity,  in  the  usual  adiabatic 
scheme,  the  pulse-shape  function  f(t)  will  also  pick  up  a 
random  sign.  However,  in  the  present  scheme,  this  random 
sign  in/(0  can  be  eliminated.  Therefore,  by  this  interaction 
configuration,  the  scheme  is  more  robust  to  the  random 
variation  of  the  atom’s  position. 

The  improvement  by  this  protocol  is  also  very  remarkable 
if  we  consider  the  case  where  the  atom  is  not  fixed  in  one 
potential  well,  and  may  move  from  well  to  well  in  the  axial 
direction.  The  variation  of  the  atom’s  position  in  the  axial 
direction  is  typically  fast  compared  with  the  operation  time 
T,  so  we  have  a  time-varying  atom  position  r  and  coupling 
rate  g(r).  Here,  we  consider  an  explicit  form  of  the  time 
variation  of  g(r)  by  assuming  g(r(t))  =  6k  sin(4 77^/7+ 90), 
where  the  phase  cp0  is  randomly  chosen  corresponding  to  the 
randomness  in  the  initial  atom’s  position.  It  is  enough  to 
illustrate  the  general  result  by  considering  this  special  ex¬ 
ample.  First,  let  us  calculate  the  output  pulse  shape  f(t)  for 
the  usual  adiabatic  scheme,  where  I2m(r)  is  fixed  as  a  con¬ 
stant  [17,18,27].  The  solid  and  the  dash-dotted  curves  in  Fig. 
7  show  the  real  parts  of  f(t)  with  initial  phase  <p0  =  0  and 


9o=7t/2,  respectively  [the  imaginary  parts  of  f(t)  are  actu¬ 
ally  small  and  negligible].  The  two  curves  do  not  overlap  at 
all.  Neither  the  magnitude  nor  the  phase  of  the  pulse  shape 
f(t)  can  be  controlled  with  this  scheme.  We  also  calculate 
the  spontaneous  emission  loss  Fspon  for  this  example.  The 
average  spontaneous  emission  loss  is  about  Pspon~25% . 

Similarly,  we  can  calculate  the  pulse  shape  for  the  same 
example  with  the  present  scheme.  In  this  case,  due  to  the 
atomic  motion,  ftm(r)  varies  with  time  in  the  same  way  as 
g(  r),  but  the  ratio  flm(r)/g(r)  is  kept  constant.  Figure  8 
shows  the  real  part  of  the  shape  function  f(t)  in  this  case, 
with  the  solid  and  the  dash-dotted  curves  corresponding  to 
the  initial  phase  <p0  =  0  and  cp0=  7t/2,  respectively.  Although 
the  two  curves  do  not  overlap  very  well,  they  still  look  simi¬ 
lar  with  the  same  phase.  They  also  roughly  agree  with  the 
ideal  shape  function  given  by  Eq.  (18),  which  is  shown  as  the 
dashed  curve  in  Fig.  8.  The  average  mode  mismatching  noise 


FIG.  8.  The  real  part  of  the  pulse-shape  function  Re[/(0]  cal¬ 
culated  for  the  same  example  as  in  Fig.  7,  but  now  for  the  present 
adiabatic  scheme  where  the  driving  Rabi  frequency  varies  in  the 
same  way  as  the  the  coupling  rate  when  the  atom  moves. 
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for  these  two  curves  is  given  by  and  the  aver¬ 

age  spontaneous  emission  loss  is  Fspon^9.4%.  The  sponta¬ 
neous  emission  loss  is  also  significantly  reduced  with  the 
present  scheme.  This  can  be  understood  as  follows:  if  one 
has  a  constant  ftm(r)  as  the  usual  adiabatic  scheme,  when 
the  atom  moves  to  the  place  with  g(  r)  near  to  zero,  the 
adiabatic  condition  is  not  well  satisfied,  and  as  a  result,  one 
has  a  considerably  large  spontaneous  emission  loss;  however, 
in  the  present  scheme,  in  the  place  where  g(r)  is  near  zero, 
ftm(r)  is  also  near  zero.  The  excitation  probability  of  the 
atom  is  then  reduced,  and  the  adiabatic  condition  is  better 
satisfied.  Consequently,  one  has  a  smaller  spontaneous  emis¬ 
sion  loss. 

IV.  APPLICATIONS 

There  have  been  many  proposals  to  use  the  setup  with 
single  atoms  in  high- Q  cavities  for  various  applications,  such 
as  for  the  single-photon  or  entangled-photon  source  [6,7],  for 
quantum  communication  between  different  cavities  [9],  for 
atomic  quantum  teleportation  [11,12],  and  for  quantum  com¬ 
putation  [13].  In  these  proposals,  one  always  assumed  that 
the  atom  is  well  localized  so  that  the  Lamb-Dicke  condition 
is  satisfied.  However,  one  can  apply  the  method  here  to  all  of 
the  schemes  mentioned  above,  to  eliminate  the  challenging 
Lamb-Dicke  condition.  Basically,  what  one  needs  to  do  is  to 
replace  the  Raman  scheme  with  the  adiabatic  scheme,  and  to 
keep  the  pumping  laser  collinear  with  the  cavity  axis,  so  that 
the  driving  pulse  and  the  cavity  mode  have  the  same  spatial 
mode  structure.  All  the  calculation  results  (for  the  noise  mag¬ 
nitudes,  pulse  shape,  etc.)  in  this  paper  apply  to  these 
schemes.  After  the  improvement,  it  becomes  considerably 
easier  to  implement  these  schemes  with  the  current  technol¬ 
ogy.  Here,  we  briefly  review  these  schemes  and  discuss  how 
to  incorporate  the  present  method  into  these  schemes. 

A.  Controllable  single-photon  or  entangled-photon  source 

It  is  desirable  to  have  a  single-photon  source  with  all  its 
properties  fully  controllable,  including  its  emission  direction, 
emission  time,  and  pulse  shape.  This  kind  of  source  has  im¬ 
portant  applications  in  some  recent  quantum-information 
processing  schemes  [36],  which  are  normally  based  on  the 
interference  of  different  single-photon  pulses.  To  get  interfer¬ 
ence  between  different  single-photon  pulses,  one  requires  all 
the  pulses  to  be  directional  and  have  the  same  time  shape. 
Recently,  there  have  been  significant  experimental  advances 
in  the  realization  of  the  single-photon  source  [17,18,37-39]. 
In  the  experiments  based  on  the  solid-state  material  [37-39], 
the  single-photon  emitter  has  a  fixed  position,  and  one  can, 
in  principle,  control  the  pulse  shape  well.  However,  the  emit¬ 
ted  pulse  is  typically  not  directional.  On  the  other  hand,  in 
current  experiments  [17,18]  with  high-Q  cavities,  the  emitted 
pulse  is  directional,  but  its  shape  is  not  well  controlled  since 
with  uniform  illumination  of  a  perpendicular  driving  pulse, 
the  waveform  f(t)  depends  on  the  time  history  of  the  cou¬ 
pling  rate  g(r),  which  in  turn  depends  on  the  atom’s  posi¬ 
tion.  As  the  atom  falls  through  the  cavity,  it  has  basically  a 
random  trajectory,  leading  to  unknown  variations  in  g(r) 


both  in  magnitude  and  sign.  It  is  a  challenging  experimental 
endeavor  to  demonstrate  a  single-photon  source  with  all  the 
properties  mentioned  above  fully  controllable. 

The  method  in  this  paper  shows  that  the  single  atom 
trapped  in  a  high-Q  cavity  is  a  good  candidate  for  the  real¬ 
ization  of  the  fully  controllable  single-photon  source. 
Though  the  coupling  rate  g(  r)  is  not  completely  fixed  in 
current  setups  due  to  the  difficulty  in  fully  localizing  the 
atom,  the  emitted  single-photon  pulse  has  a  definitely  well- 
controllable  time  shape  and  emission  direction  with  an  ap¬ 
propriate  design  of  the  interaction  configuration  as  has  been 
shown  before. 

As  shown  in  Ref.  [7],  with  a  more  involved  atomic  level 
structure,  it  is  possible  to  engineer  entanglement  between 
different  single-photon  pulses.  It  is  straightforward  to  com¬ 
bine  the  method  here  with  that  scheme  to  eliminate  the  re¬ 
quirement  of  the  Lamb-Dicke  condition  in  Ref.  [7]  so  that 
one  can  get  an  entangled  single-photon  source  with  the 
“hot”  trapped  atom  as  well. 

B.  Quantum  communication  between  different  cavities 

The  dynamics  governed  by  the  Hamiltonian  (6)  is  revers¬ 
ible  if  we  neglect  the  atomic  spontaneous  emission  ys . 
Therefore,  if  one  directs  the  emitted  single-photon  pulse 
back  to  the  cavity,  and  at  the  same  time  reverses  both  of  the 
time  shapes  of  the  single-photon  pulse  and  the  driving  pulse, 
the  single-photon  pulse  will  be  completely  absorbed  as  long 
as  the  noise  effects  are  negligible.  It  was  first  proposed  in 
Ref.  [9]  that  one  can  use  this  kind  of  a  phenomenon  to 
achieve  quantum  communication  between  different  cavities, 
that  is,  to  transfer  quantum  states  of  a  trapped  atom  from  one 
cavity  to  another  cavity.  For  this  purpose,  one  can  require 
that  the  emitted  single-photon  pulse  has  a  time- symmetric 
shape  by  modulating  the  driving  pulse  shape.  For  a  time- 
symmetric  pulse,  its  time  reversal  is  itself,  so  we  can  directly 
input  this  pulse  to  another  cavity  with  the  same  configuration 
but  with  a  time-reversed  driving  pulse,  then  the  single¬ 
photon  pulse  will  be  completely  absorbed  by  this  cavity, 
which  transfers  the  atomic  state  from  one  cavity  to  the  other 
one.  The  scheme  in  Ref.  [9]  is  based  on  the  Raman  configu¬ 
ration,  but  it  is  straightforward  to  transfer  it  to  the  adiabatic 
configuration  discussed  in  this  paper  so  that  it  works  with  a 
hot  trapped  atom.  Note  that  the  same  setup  can  also  be  used 
for  storage  of  a  single-photon  pulse  with  a  known  shape 
[30,40,41]. 

To  get  a  time- symmetric  single-photon  pulse  for  a  com¬ 
plete  absorption  of  the  second  cavity,  Ref.  [9]  gives  a  nu¬ 
merical  solution  to  the  shape  of  the  driving  pulse.  For  the 
adiabatic  configuration,  one  has  an  analytic  expression  (18) 
which  connects  the  shape  of  the  output  single-photon  pulse 
to  the  shape  of  the  driving  pulse,  and  this  expression  has 
been  verified  to  be  a  good  approximation  under  reasonable 
experimental  parameters  in  Sec.  Ill  through  the  exact  nu¬ 
merical  calculations.  With  this  analytical  expression,  it  be¬ 
comes  easier  to  design  the  shape  s(t )  of  the  driving  pulse. 
The  form  of  sin  6{t)  can  be  easily  solved  from  Eq.  (18)  (see 
also  Ref.  [40])  with  the  expression 
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The  form  of  sin  0(t)  is  immediately  available  from  this  equa¬ 
tion  for  any  desirable  output  pulse  shape  f(t)  (which  has 
been  assumed  to  be  real  and  positive  for  simplicity).  Then, 
the  shape  of  the  driving  pulse  can  be  easily  decided  from 
and  sin  6{t)  =  r0a{t)l yj  1  +  \  rQa(t)\2,  where  rQ  is 
the  ratio  of  the  Clebsch-Gordan  coefficients.  For  instance,  if 
we  want  to  have  a  time-symmetric  f(t)  in  the  period  O^t 
T  with  the  form  f(t)  =  V/3/2sech[/3(£  —  772)],  where  we 
have  assumed  sech(  —  f3T/2)<  1,  sin  0(t)  should  be  in  the 
form  sin  0(t)  =  ^ (3/ k^I  +  tanh[/3(f— 772)].  Note  that  we  only 
have  a  solution  of  0(t)  when  the  rate  (3<  k!2,  which  is  con¬ 
sistent  with  the  observation  that  any  pulse  from  the  decay  of 
a  cavity  cannot  vary  with  time  faster  than  the  cavity  decay 
rate.  From  sin  6(t),  we  see  that  the  shape  s(t)  of  the  driving 
pulse  should  be  chosen  according  to 


e(f)a 


V 


1  +  tanh[/3(r  —  772)] 
(k/(3-  1  )-tanh[/3(7— 772)]  ' 


(28) 


As  a  special  case,  if  kI 2,  e(f)«:e^(?_772),  which  grows 
exponentially  with  the  time  t  for  the  operation  period  O^t 
T .  Therefore,  we  have  a  simple  solution  to  the  driving 
pulse  shape  for  quantum  communication  between  two  differ¬ 
ent  cavities:  for  the  first  cavity,  we  apply  an  exponentially 
increasing  pulse  with  s(t)  =  s( 0)eKtl2,  and  for  the  second 
cavity  we  apply  its  time  reversal,  that  is,  an  exponentially 
decreasing  pulse  with  the  decay  rate  k!2.  The  pulse  duration 
T  should  satisfy  kT>1,  and  the  initial  value  s(0)  is  deter¬ 
mined  by  the  requirement  r0a{TI2)  =  \.  The  single-photon 
pulse  connecting  the  two  cavities  then  has  a  time- symmetric 
shape  with  f(t)^SQch[K(t—  772) /2]. 


C.  Entanglement  generation  and  atomic  quantum 
teleportation 

If  one  has  two  cavities,  each  with  an  atom  inside,  one  can 
maximally  entangle  these  two  atoms  1  and  2  by  the  follow¬ 
ing  method:  The  two  atoms  are  initially  prepared  in  the  state 
| g),  and  then  we  excite  them  to  the  state  Is)  with  a  small 
possibility  1  —  exp[  —  /c/q  sin2  0(T)dr]  through  an  incom¬ 
plete  adiabatic  passage.  The  output  pulses  from  the  two  cavi¬ 
ties,  each  with  a  mean  photon  number  p0,  have  a  definite 
pulse  shape  as  we  have  shown  before,  so  that  they  can  inter¬ 
fere  with  each  other  at  a  50%-50%  beam  splitter.  The  outputs 
of  the  beam  splitter  are  detected  by  two  single-photon  detec¬ 
tors,  and  if  we  register  a  photon  from  one  of  the  detectors, 
due  to  the  interference,  we  do  not  know  from  which  cavity 
the  registered  photon  comes.  The  two  atoms  1  and  2  are  thus 
projected  to  a  quantum  superposition  state  (|g)i|s)2 
±\s)i\g)2)/ \/2,  which  is  maximally  entangled.  The  method 
described  here  is  just  an  adiabatic  passage  version  of  the 
scheme  in  Refs.  [11,12].  By  transformation  from  the  Raman 


version  to  the  adiabatic  passage  version,  the  output  pulse 
shapes  become  insensitive  to  the  random  atom’s  position  as 
is  required  for  interference,  which  is  important  for  the 
scheme  to  work  with  hot  atoms. 

After  entanglement  has  been  generated,  one  can  use  it  for 
atomic  Bell  inequality  detection,  for  quantum  teleportation 
of  atomic  states  [12],  or  even  for  realization  of  quantum 
repeaters  [42].  To  realize  quantum  repeaters,  what  one  needs 
to  do  is  to  simply  replace  the  atomic  ensemble  in  the  scheme 
in  Ref.  [42]  by  the  setup  of  a  single  atom  in  a  high-  Q  cavity. 

For  the  above  applications,  in  addition  to  the  entangle¬ 
ment  generation,  we  also  need  to  do  some  single-bit  opera¬ 
tions.  These  single-bit  operations  should  also  be  performed 
in  a  suitable  way  so  that  they  are  insensitive  to  the  random 
atom  position  r.  One  way  is  to  still  use  adiabatic  passages.  It 
is  possible  to  realize  any  single-bit  operation  with  adiabatic 
passages  [43,44],  but  for  this  purpose  one  needs  to  use  a 
four-level  scheme  instead  of  the  A  configuration.  There  is 
actually  a  simpler  way  for  getting  robust  single-bit  opera¬ 
tions  based  on  the  Raman  transitions.  Note  that  for  single-bit 
operations,  we  do  not  need  to  use  any  cavity  mode  or  cavity 
effect.  We  can  shine  two  traveling-wave  beams  on  the  atom 
coupling  to  the  transitions  |  g  )  — >  |  e  )  and  |  s  )  — >  |  e  ) .  They  are 
assumed  to  be  collinear  and  propagating  along  the  v  axis, 
which  is  perpendicular  to  the  cavity  axis  z.  The  two 
traveling-wave  beams  are  broad  with  the  beam  radius  much 
larger  than  the  typical  variation  length  of  the  atom’s  position. 
With  this  condition,  the  two  Rabi  frequencies  for  the  transi¬ 
tions  \g)^\e)  and  are  given  by  fli(r) 

=  tlmela)8exlc  and  I22(r)  =  I2 20ela)seX,c,  respectively,  where 
12 10  and  12  2o  are  basically  independent  of  the  atom  position 
r.  Under  a  large  detuning  A,  the  effective  Raman  coupling 
rate  I2^~I21(r)I22  (r)/ A°tela)zsX/c  is  very  insensitive  to  the 
random  atom’s  position  r,  since  cloogs  is  typically  much 
larger  than  the  variation  length  of  the  position.  Therefore,  as 
long  as  we  do  not  need  to  use  the  cavity  effect,  a  Raman 
scheme  with  two  broad  collinearly  propagating  beams  suf¬ 
fices  to  eliminate  the  sensitivity  to  the  random  atom’s  posi¬ 
tion. 


D.  Quantum  computation 

In  principle,  we  can  also  use  this  setup  for  quantum  com¬ 
putation  [13],  and  eliminate  the  requirement  of  the  Lamb- 
Dicke  condition  by  performing  all  the  quantum  gates  using 
adiabatic  passages  [43,45]  with  appropriate  configurations. 
However,  the  requirements  for  a  universal  quantum  compu¬ 
tation  are  more  challenging  compared  with  the  applications 
mentioned  above,  and  this  is  somewhat  a  long-term  goal,  so 
we  do  not  discuss  here  the  details  of  this  possibility. 

V.  DISCUSSION  OF  THE  EXPERIMENTAL  SITUATION 

Finally,  let  us  mention  the  current  experimental  situation 
related  to  this  work  at  the  Caltech  group.  In  the  Caltech 
experiment,  a  single  cesium  atom  is  trapped  inside  the  high- 
finesse  cavity  with  a  FORT  beam.  The  atomic  states  | g),  \s ), 
and  \e)  correspond  to  the  hyperfine  levels  (6Sm,F=2,m  = 
+  3),  (6S ]/2 =  4,m  =  +4),  and  (6P3/2,F=4,m  =  +4),  re- 
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spectively.  The  FORT  beam  is  incident  on  one  of  the  cavity 
mirrors  and  resonant  to  a  longitudinal  mode  of  the  cavity. 
Presently,  the  FORT  wavelength  XFOrt  is  936  nm.  This 
wavelength  was  chosen  because  with  such  a  beam,  the  trap¬ 
ping  potentials  for  the  ground  6Sm  manifold  and  the  excited 
6P 3/2  manifold  are  nearly  identical.  Considering  only  this 
reduced  manifold  of  states,  we  find  that  the  expression  for 
the  FORT  potential  of  the  ground  states  | g)  and  Is)  is  given 
[46]  by 


7 TC2ys 

U fort(y ) =  ~ 

2  co30 


(29) 


Here,  (A2)  is  the  detuning  of  the  FORT  light  of  fre¬ 
quency  coport  =  2 7tc/X Fort  from  the  Pm  (P 3/2)  level,  and 
y^/277^5.2  MHz  is  the  spontaneous  decay  rate  of  the  level 
6P  3/2.  The  intensity  7(r)  of  the  standing- wave  mode  inside 
the  cavity  is  given  by 


7(r)  = 


8  P 


•  2 
sin 


2  TTZ 

^FORT 


*2  +  y2\ 
w2  / 


(30) 


where  w0^25  jmm  is  the  waist  of  the  Gaussian  mode,  and  P 
is  the  power  of  the  FORT  beam  inside  the  cavity.  The  trap 
frequencies  vaxiai,  ^radial  in  the  axial  and  radial  directions 
follow  from  these  expressions  as 


(  r' axial  ’  r' radial) 


h2-2 


mfort 


(31) 


where  U0=UFORT(ti)  is  the  trap  depth.  The  typical  power  of 
the  FORT  beam  measured  outside  the  cavity  is  about  1  mW, 
and  the  power  P  inside  the  cavity  is  enhanced  by  a  factor  of 
the  cavity  finesse,  which  is  about  2200  at  the  wavelength  of 
the  FORT  beam.  With  this  number,  the  typical  values  for  the 
trap  depth  and  frequencies  are  given  by  38  MHz, 
r' axial ^  310  kHz,  and  vT3L^a\  ^4.3  kHz,  respectively.  The  cur¬ 
rent  achievable  temperature  Ttem  of  the  trapped  atom  is  a 
significant  fraction  of  the  trap  depth  U0  (such  as  a  half).  With 
such  a  temperature,  the  spatial  extent  of  the  atomic  motion  in 
the  axial  and  radial  directions  are  estimated,  respectively,  by 

fc/XpoRT^  ( l/27r)arcsin  ^kBTtGm/ U  0,  (32) 

SrL  «=  w0  \/-  ln(  1  -  kBTtem /U()),  (33) 


which  will  induce  significant  variation  of  the  coupling  rate 
g(  r)  given  by  Eq.  (1).  For  example,  for  the  temperature  of 
half  of  the  trap  depth,  the  axial  uncertainty  is  120  nm,  while 
the  radial  one  is  15  jmm.  These  uncertainties  cause  variations 
in  g  of  30%  due  to  the  radial  motion,  and  35%  due  to  the 
axial  one.  Therefore,  within  the  current  experimental  tech¬ 
nique,  it  is  important  to  use  the  method  given  in  this  paper  to 
make  the  application  schemes  insensitive  to  the  variation  of 
g( r).  The  time  scale  for  the  variation  of  g(r)  is  estimated  by 


the  inverse  of  the  trap  frequencies  ^axial  and  radial  in  the  axial 
and  radial  directions,  respectively.  The  operation  time  T  is 
typically  significantly  shorter  than  l/r^diai*  but  longer  or 
comparable  to  •  So,  we  can  take  the  static  average  of 

g(r)  in  the  radial  direction,  and  the  dynamical  average  of 
g(  r)  in  the  axial  direction  as  discussed  in  Ref.  [34]. 

We  also  would  like  to  note  that  although  the  method  in 
this  paper  shows  that  many  application  schemes  of  the 
cavity-QED  setup  can  be  demonstrated  before  the  achieve¬ 
ment  of  efficient  cooling  of  the  trapped  atom  inside  the  cav¬ 
ity,  the  cooling  is  still  an  important  and  desirable  technology 
yet  to  be  achieved  to  significantly  increase  the  trapping  time 
of  the  atom.  In  addition,  a  combination  of  the  cooling  tech¬ 
nology  and  the  method  here  could  further  improve  the  per¬ 
formance  of  various  application  schemes. 


VI.  SUMMARY 

In  summary,  we  have  shown  that  the  setup  with  a  single 
trapped  atom  in  a  high-  Q  cavity  can  be  used  to  realize  many 
cavity-QED  and  quantum-information  processing  schemes 
even  if  the  atom  is  still  hot  and  not  fully  localized  in  space 
(the  Lamb-Dicke  condition  is  not  yet  satisfied).  This  could 
significantly  simplify  the  on-going  experiments,  since  it 
means  many  interesting  schemes  can  be  demonstrated  with 
the  present  technology  before  the  achievement  of  efficient 
cooling  inside  the  cavity.  Even  with  further  advances  in 
atomic  localization  in  cavity-QED,  our  scheme  should  lead 
to  a  greater  robustness  against  certain  experimental  nonide¬ 
alities.  The  basic  idea  of  this  method  is  to  design  an  appro¬ 
priate  adiabatic  passage  so  that  the  relevant  dynamics  only 
depend  on  the  ratio  of  two  coupling  rates.  Though  each  of 
the  coupling  rates  is  sensitive  to  the  unknown  or  time- 
varying  atom’s  position,  their  ratio  is  fixed  and  controllable, 
as  the  two  rates  depend  on  the  random  atom  position  in  the 
same  way  with  the  appropriate  interaction  configuration  that 
we  have  described.  We  confirm  the  validity  of  this  method  by 
solving  the  complete  model,  which  describes  the  realistic 
setup.  The  approximate  analytical  solution  and  the  exact  nu¬ 
merical  simulations  agree  with  each  other.  From  the  numeri¬ 
cal  simulations,  we  also  calculate  quantitatively  various 
noise  magnitudes  in  this  setup,  and  show  that  one  can 
achieve  reasonably  good  performance  with  the  values  of  the 
parameters  based  on  the  present  technology.  Finally,  we 
show  that  this  method  can  be  incorporated  into  many  previ¬ 
ous  schemes,  allowing  the  demonstration  of  these  application 
schemes  without  the  requirement  of  the  full  localization  of 
the  atom. 
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Conventional  lasers  (from  table- top  systems  to  microscopic 
devices)  typically  operate  in  the  so-called  weak-coupling  regime, 
involving  large  numbers  of  atoms  and  photons;  individual 
quanta  have  a  negligible  impact  on  the  system  dynamics.  How¬ 
ever,  this  is  no  longer  the  case  when  the  system  approaches  the 
regime  of  strong  coupling  for  which  the  number  of  atoms  and 
photons  can  become  quite  small.  Indeed,  the  lasing  properties  of 
a  single  atom  in  a  resonant  cavity  have  been  extensively  investi¬ 
gated  theoretically1-11.  Here  we  report  the  experimental  realiza¬ 
tion  of  a  one-atom  laser  operated  in  the  regime  of  strong 
coupling.  We  exploit  recent  advances12  in  cavity  quantum  electro¬ 
dynamics  that  allow  one  atom  to  be  isolated  in  an  optical  cavity  in 
a  regime  for  which  one  photon  is  sufficient  to  saturate  the  atomic 
transition.  The  observed  characteristics  of  the  atom-cavity  sys¬ 
tem  are  qualitatively  different  from  those  of  the  familiar  many- 
atom  case.  Specifically,  our  measurements  of  the  intracavity 
photon  number  versus  pump  intensity  indicate  that  there  is  no 
threshold  for  lasing,  and  we  infer  that  the  output  flux  from  the 
cavity  mode  exceeds  that  from  atomic  fluorescence  by  more  than 
tenfold.  Observations  of  the  second-order  intensity  correlation 
function  demonstrate  that  our  one-atom  laser  generates  mani¬ 
festly  quantum  (nonclassical)  light,  typified  by  photon  anti¬ 
bunching  and  sub-poissonian  photon  statistics. 

The  usual  laser  theories  rely  on  system- size  expansions  in  inverse 
powers  of  critical  atom  and  photon  numbers  ( N0,n0 )  1,  and 

arrive  at  a  consistent  form  for  the  laser  characteristics13-17.  By 
contrast,  over  the  past  twenty  years,  technical  advances  on  various 
fronts  have  pushed  laser  operation  to  regimes  of  ever  smaller  atom 
and  photon  number,  pressing  toward  the  limit  of  ‘strong  coupling’ 
for  which  (N0,n0)  1  (ref.  18).  Significant  milestones  include  the 

realization  of  one-  and  two-photon  micromasers19-21,  as  well  as 
microlasers  in  atomic  and  condensed  matter  systems22-24. 

As  illustrated  in  Fig.  1,  our  experiment  consists  of  a  single 
caesium  atom  trapped  in  a  far-off-resonance  trap  (FORT)  within 
a  high-finesse  optical  cavity12,25.  The  lasing  transition  6P3/2, 
Ff  =  3'  —> ►  6S 1/2,  F  =  4  is  nearly  resonant  with  and  strongly  coupled 
to  a  single  mode  of  this  cavity.  The  coupling  is  parameterized  by  the 
Rabi  frequency  2 g0  for  a  single  quantum  of  excitation,  and  the  atom 
and  field  have  amplitude  decay  rates  y  and  k ,  respectively.  The  upper 
level  F1  =3'  is  pumped  by  the  external  drive  123,  while  effective 
decay  of  the  lower  level  F  =  4  takes  place  via  the  combination  of  the 
drive  &4  and  decay  734,  4  — ►  4'  — ►  3.  In  essential  character  this 
system  is  analogous  to  a  Raman  scheme  with  pumping  3  — ►  3', 
lasing  3'  — ►  4,  and  decay  4  — ►  3.  Of  particular  relevance  to  our  work 
are  detailed  treatments  of  the  ion-trap  laser6-9. 

We  emphasize  that  a  ‘one-and-the-same’  atom  laser  as  illustrated 
in  Fig.  1  is  quite  distinct  from  ‘single-atom’  micromasers19-21  and 
lasers22  for  which  steady  state  is  reached  through  the  incremental 
contributions  of  many  atoms  that  transit  the  cavity,  even  if  one  by 
one19,20  or  few  by  few22.  By  contrast,  in  our  experiment  steady  state 
is  reached  with  one-and-the-same  atom  over  a  time  interval 
ht  ~  10_7s  that  is  much  shorter  than  the  trap  lifetime 
At  ~  0.05  s.  Our  pumped  atom-cavity  system  provides  a  continu¬ 
ous  source  of  nonclassical  light  as  a  gaussian  beam  for  the  entire 
duration  that  an  atom  is  trapped. 


Because  conventional  lasers  operate  in  the  limit  (JV0,n0)  1, 

there  is  a  generic  form  associated  with  the  laser  threshold  in  the 
transition  from  nonlasing  to  lasing  action  that  is  independent  of  the 
model  system13,26.  However,  as  the  system  size  is  reduced,  the 
sharpness  of  the  laser  ‘turn  on’  is  lost,  with  then  no  clear  consensus 
about  how  to  define  the  lasing  threshold26.  Well  into  the  regime  of 
strong  coupling  ( N0,n0 )  <C  1,  even  the  familiar  qualitative  charac¬ 
teristics  of  a  laser  (for  example,  the  statistical  properties  of  the 
output  light)  are  profoundly  altered,  leaving  open  the  question  of 
how  to  recognize  a  laser  in  this  new  regime. 

To  address  this  question,  we  have  carried  out  extensive  theoretical 
analyses  for  a  four-state  model  based  upon  Fig.  lb  for  parameters 
relevant  to  our  experiment.  A  synopsis  of  relevant  results  from  this 
work  is  given  in  the  Supplementary  Information,  with  the  full 
treatment  presented  in  ref.  27.  In  brief,  the  steady-state  solutions 
obtained  from  a  semiclassical  theory  exhibit  familiar  characteristics 
of  conventional  lasers,  including  a  clearly  defined  laser  threshold 
and  population  inversion.  The  condition  Cx  1  is  required  to 
observe  threshold  behaviour  for  one  atom  pumped  inside  the 
resonator,  where  for  our  experiment  the  cooperativity  parameter 
Ci  =  l/N0  —  12.  By  contrast,  the  fully  quantum  analysis  for  the 
four-state  model  results  in  qualitatively  different  characteristics.  In 
particular,  the  input-output  relationship  for  the  mean  intracavity 
photon  number  n  versus  the  pump  intensity  I3  =  (F23/2y)2 
has  several  key  features  to  be  compared  with  experimental 
results  presented  below,  namely  the  immediate  onset  of  emission 
(‘thresholdless’  behaviour),  and  the  saturation  and  eventual 
quenching  of  the  output. 

Our  actual  experiment  is  more  complex  than  indicated  by  the 
simple  drawing  in  Fig.  1,  with  many  of  the  technical  aspects 
described  in  more  detail  in  refs  12  and  25.  In  brief,  the  principal 
cavity  QED  (cQED)  parameters  of  our  system  areg0/2TT  =  16  MHz, 
k! 2it  =  4.2  MHz,  and  yl2it  =  2.6  MHz,  where  g0  is  based  upon  the 
reduced  dipole  moment  for  the  6S1/2,  F  =  4  <->  6P3/2,  F'  =  3' 
transition  in  atomic  caesium.  Strong  coupling  is  thereby  achieved 
(go  (k, 7)),  resulting  in  critical  photon  and  atom  numbers  n0  = 
y2/(2g20)  =  0.013,  No  =  iKy/g2  =  0.084. 


Figure  1  A  simplified  schematic  of  the  experiment,  a,  A  caesium  atom  (black  dot)  is 
trapped  inside  a  high-finesse  optical  cavity  formed  by  the  curved,  reflective  surfaces  of 
mirrors  Mi  and  M2.  Light  generated  by  the  atom’s  interaction  with  the  resonant  cavity 
mode  propagates  as  a  gaussian  beam  to  single-photon  detectors  Di  and  D2.  b,  The 
relevant  transitions  involve  the  6S1/2,  F=  3,4  «-►  6 P3/2,  F'  =  3',  4'  levels  of  the  D2  line 
at  852.4  nm  in  atomic  caesium.  Strong  coupling  at  rate  g  is  achieved  for  the  lasing 
transition  F'  =  3'  — ►  F=  4  near  a  cavity  resonance.  Pumping  of  the  upper  level 
F'  =  3'  is  provided  by  the  field  f23,  while  recycling  of  the  lower  level  F=  4  is  achieved  by 
way  of  the  field  I24  (4  — ►  4')  and  spontaneous  decay  back  to  F=  3.  Decay 
(3', 4')  — ►  (3,4)  is  also  included  in  our  model.  Relevant  cavity  parameters  are  length 
/ o  =  42.2  |xm,  waist  w0  =  23.6  fim,  and  finesse  &  =  4.2  x  1 05  at  A  Ds  =  852  nm. 
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Atoms  are  trapped  in  the  cavity  by  means  of  a  FORT28  with 
wavelength  AF  =  935.6  nm,  which  is  matched  to  a  TEM00  mode 
along  the  cavity  axis.  For  all  experiments  herein,  the  trap  depth  is 
Uo/kB  =  2.3  mK  (47  MHz)  where  kB  is  the  Boltzmann  constant. 
The  FORT  has  the  important  feature  that  the  potential  for  the 
atomic  centre-of-mass  motion  is  only  weakly  dependent  on  the 
atom’s  internal  state12. 

After  the  trap-loading  stage  (as  described  in  the  section  on 
Methods),  the  transverse  03>4  fields  are  switched  to  pump  and 
recycle  the  atomic  population  in  the  fashion  depicted  in  Fig.  lb.  Two 
examples  of  the  resulting  output  counts  versus  time  are  shown  in 
Fig.  2.  By  averaging  traces  such  as  these,  we  arrive  at  an  average 
signal  level  versus  time,  as  shown  in  the  inset  to  Fig.  2a.  Typical 
lifetimes  for  a  trapped  atom  in  the  presence  of  the  driving  fi3A  fields 
are  50-100  ms,  which  should  be  compared  to  the  lifetimes  of  2-3  s 
recorded  in  the  absence  of  these  fields12.  Significantly,  the  approxi¬ 
mately  exponential  decay  of  the  signal  with  time  does  not  result 
from  a  time-dependent  diminution  of  the  flux  from  single  trapped 
atoms,  but  rather  from  the  average  of  many  events  each  of  a  variable 
duration.  That  is,  for  a  given  set  of  external  control  parameters,  each 
atom  gives  a  reasonably  well-defined  output  flux  over  the  time  that 
it  is  trapped. 

For  a  fixed  set  of  operating  conditions,  we  collect  a  set  of  60-300 
traces  as  in  Fig.  2,  determine  the  average  output  flux  for  each  trace, 
and  find  the  mean  and  variance,  as  well  as  the  trap  lifetime  for  the 
set.  Figure  3  displays  a  collection  of  such  measurements  for  the 
mean  intracavity  photon  number  n  as  a  function  of  the  dimension¬ 
less  pump  intensity  x,  scaled  in  units  of  the  fixed  recycling  intensity 
(see  section  on  Methods).  More  precisely,  the  parameter  x  is  the 
ratio  of  measured  intensities,  and  can  be  written  asx  =  (7/9)(/3//4), 
where  I3A  =  (Q3A!2y)2.  The  factor  of  (7/9)  is  needed  because  the 


Figure  2  Total  counting  rate  R  recorded  by  detectors  D12  is  displayed  as  a  function  of 
time  for  two  separate  trapped  atoms,  with  the  counts  summed  over  5-ms  bins.  At  t  =  0, 
the  %4  fields  are  switched  to  predetermined  values  of  intensity  and  detuning.  In  a,  the 
atom  is  trapped  for  t  —  90  ms  before  escaping,  with  the  background  level  due  to 
scattered  light  from  the  fields  and  detector  dark  counts  evident  as  the  residual  output 
at  later  times.  In  b,  the  atom  (atypically)  remains  trapped  for  the  entire  observation  cycle 
=270  ms  and  then  is  dumped.  The  inset  in  a  displays  R  versus  time  obtained  by 
averaging  400  such  traces.  Two  cases  are  shown;  in  one,  the  number  of  atoms  delivered 
to  the  cavity  mode  has  been  diminished  by  about  two  fold.  The  curves  are  nearly  identical, 
so  we  conclude  that  cases  with  N  >  1  atom  play  a  negligible  role.  The  overall  detection 
efficiency  £  =  0.05  from  intracavity  photon  to  a  detection  event  at  D-i  or  D2  is  made  up  of 
the  following  factors:  7]  =  0.60  cavity  escape  efficiency,  T  =  0.50  for  only  mirror  M2 
output,  T  =  0.33  propagation  efficiency  from  M2  to  D1i2,  and  a  =  0.5  detection 
quantum  efficiency  at  D1i2. 


two  transitions  have  different  dipole  moments.  For  these  measure¬ 
ments,  we  estimate  that  the  incoherent  sum  of  intensities  of  the  four 
f24  beams  is  about  50  mW  cm-2,  which  corresponds  to  I4  ~  13.  The 
output  count  rate  at  detectors  D1j2  is  converted  to  intracavity 
photon  number  using  the  known  propagation  and  detection 
efficiency  £  =  0.05. 

Important  features  of  the  data  shown  in  Fig.  3  include  the  prompt 
onset  of  output  flux  ku  emerging  through  the  cavity  mirrors  M1>2  as 
the  pump  intensity  I3  is  increased  from  zero.  In  a  regime  of  strong 
coupling,  the  atom-cavity  system  behaves  as  a  ‘thresholdless’  device. 
With  further  increases  in  pump  intensity  /3,  the  output  flux 
saturates  at  a  maximum  value  /cnmax  around  x  —  0.1.  We  attribute 
this  behaviour  to  a  bottleneck  associated  with  the  recycling  of 
population  4  — ►  4 7  — ►  3,  with  the  rate-limiting  step  in  the  recycling 
process  being  spontaneous  decay  4 7  —>  3  at  rate  y34  in  the  limit  of 
large  Rabi  frequency  f?4  y.  For  a  single  intracavity  atom,  quanta 
can  be  deposited  into  the  cavity  mode  no  faster  than  the  maximum 
recycling  rate.  As  the  pump  level  I3  is  increased  beyond  x  ~  1,  the 
output  flux  kyi  gradually  drops,  presumably  owing  to  splitting  of  the 
pumped  excited  state  F 7  =  3 7  by  the  Autler-Townes  effect,  although 
this  is  still  under  investigation.  Heating  of  the  atomic  motion  at 
higher  pump  levels  is  certainly  a  concern  as  well;  however,  our 
simulations,  which  do  not  incorporate  atomic  motion,  show  the 
same  trend  as  in  Fig.  3  (ref.  27). 

Beyond  these  considerations,  we  have  also  undertaken  extensive 
theoretical  analyses  based  both  upon  the  four- state  model  shown  in 
Fig.  1,  as  well  as  on  the  full  set  of  Zeeman  states  for  each  of  the  levels 
F  =  3,4  and  F'  =  3 7 ,4 7  and  two  cavity  modes,  one  for  each  of  two 
orthogonal  polarizations27.  These  analyses  are  in  reasonable  accord 
with  the  principal  features  of  the  data  in  Fig.  3.  Moreover,  our 
quantum  simulations  support  the  conclusion  that  the  range  of 
coupling  values  g  that  contribute  to  our  results  is  restricted  roughly 
to  0.5go  ~  g  ~  go-  Furthermore,  the  simulations  yield  information 
about  the  atomic  populations,  from  which  we  deduce  that  the  rate 
of  emission  from  the  cavity  kyi  exceeds  that  by  way  of  fluorescent 
decay  3 7  — ►  4,  y43'  (<t3'j3'),  by  roughly  tenfold  over  the  range  of 
pump  intensity  I3  shown  in  Fig.  3,  where  {a3'j3>)  is  the  steady-state 
population  in  level  37. 


Figure  3  The  intracavity  photon  number  n  ±  an,  inferred  from  measurements  as  in  Fig.  2, 
is  plotted  as  a  function  of  dimensionless  pump  intensity  x  =  (7/9)  (/3//4)  for  fixed  /4  =  1 3 
over  two  ranges  of  pump  level  x.  a,  n  versus  x  is  shown  over  the  entire  range  x  =  0  to 
2.33  recorded  in  our  measurements,  b,  An  expanded  scale  displays  n  for  small  x.  The 
immediate  onset  of  emission  supports  the  conclusion  of  ‘thresholdless’  lasing.  The  two 
independent  sets  of  measurements  (red  and  blue  points)  agree  reasonably  well. 
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To  investigate  the  quantum -statistical  characteristics  of  the  light 
emerging  in  the  TEM00  mode  of  the  cavity  output,  we  probe  the 
photon  statistics  of  the  light  by  way  of  the  two  single-photon 
detectors  D1j2  illustrated  in  Fig.  1.  From  the  cross-correlation  of 
the  resulting  binned  photon  arrival  times  and  the  mean  counting 
rates  of  the  signals  and  the  background,  we  construct  the  normal¬ 
ized  intensity  correlation  function  (see  the  Supplementary  Infor¬ 
mation) 

where  the  colons  denote  normal  and  time  ordering  for  the  intensity 
operators  I  (ref.  15).  Over  the  duration  of  the  trapping  events,  we 
find  no  evidence  that  is  a  function  of  f,  although  we  do  not 

have  sufficient  data  to  confirm  quantitatively  stationarity  of  the 
underlying  processes. 

Examples  of  two  measurements  for  g(2)(r)  are  given  in  Fig.  4.  In 
Fig.  4  a  and  b,  we  again  have  I4  —  13  and  the  pump  intensity  I3  is  set 
for  operation  with  x  —  0.83  well  beyond  the  ‘knee’  in  n  versus  x, 
while  in  Fig.  4  c  and  d,  the  pump  level  is  decreased  to  x  —  0.17  near 
the  peak  in  n.  Significantly,  in  each  case  these  measurements 
demonstrate  that  the  light  from  the  atom-cavity  system  is  mani¬ 
festly  quantum  (that  is,  nonclassical)  and  exhibits  photon  anti¬ 
bunching  g(2)(0)  <  g(2)(r)  and  sub-poissonian  photon  statistics 
g(2)(0)  <  1  (ref.  15).  The  actual  coincidence  data  n{ t)  used  to 
obtain  g(2)(r)  are  presented  in  the  Supplementary  Information. 
Significantly,  these  data  directly  provide  evidence  of  the  nonclassical 
character  of  the  emitted  light,  with  relatively  minor  corrections  for 
background  light  required  for  the  determination  ofg(2)(r). 

Beyond  the  nonclassical  features  around  r  —  0,  g(2)(r)  also 
exhibits  excess  fluctuations  extending  over  r  —  ±  1  pus,  with 
£max(r)  “  1-7.  Fluctuations  in  the  intensity  of  the  intracavity  light 
over  these  timescales  are  presumably  related  to  the  stochastic 
character  of  the  pumping  3  — ►  31 2 3 4 5 6  7  and  recycling  4  — ►  4'  — ►  3  pro¬ 
cesses  for  a  single,  multi-state  atom.  Also  of  significance  is  the 
interplay  of  atomic  motion  and  optical  pumping  into  dark  states  by 
the  [2 3,4  fields  (which  is  responsible  for  cooling;  ref.  29  and 
references  therein),  as  well  as  Farmor  precession  that  arises  from 
residual  ellipticity  in  polarization  of  the  intracavity  FORT12,30. 
Indeed,  in  Fig.  4a,  c  there  is  a  hint  of  an  oscillatory  variation  in 
£Sax(r)  w ith  period  r  —  ±  2  ps.  Fourier  transformation  of  the 
associated  coincidence  data  leads  to  a  small  peak  at  about 
500  kHz,  which  is  near  to  the  predicted  frequency  for  axial  motion 


Figure  4  The  intensity  correlation  function  g(2\r)  is  given  for  two  values  of  the  pump 
intensity  in  a-d,  where  b  and  d  show  the  central  features  of  a  and  c  over  a  smaller  range 
of  r.  Panels  a  and  b  are  for  pump  intensity  parameter  x=  0.83,  whereas  for  c  and  d, 
x  =  0.17.  Note  that  the  light  exhibits  sub-poissonian  photon  statistics  and  antibunching. 
All  traces  have  been  ‘smoothed’  by  convolution  with  a  gaussian  function  of  width 
a  =  5  ns. 


of  a  trapped  caesium  atom  at  the  bottom  of  the  FORT  potential,  as 
well  as  to  the  Farmor  frequency  inferred  from  other  measurements. 

In  agreement  with  the  trend  predicted  by  the  four- state  model 
discussed  in  the  Supplementary  Information,  g(2)(0)  increases  with 
increasing  pump  intensity,  with  a  concomitant  decrease  in  these 
nonclassical  effects.  Moreover,  our  experimental  observations  of 
g(2)(r)  are  described  reasonably  well  by  the  results  obtained  from 
more  detailed  quantum  simulations  based  upon  the  entire  manifold 
of  Zeeman  states  for  the  caesium  atom,  two  cavity  modes  with 
orthogonal  polarizations,  and  a  simple  model  to  describe  the 
polarization  gradients  of  the  fi3A  fields27. 

The  realization  of  this  strongly  coupled  one-atom  laser  is  signifi¬ 
cant  on  several  fronts.  From  the  perspective  of  the  dynamics  of  open 
quantum  systems,  our  system  demonstrates  the  radical  departures 
from  conventional  laser  operation  wrought  by  strong  coupling  for 
the  quantized  light-matter  interaction.  On  a  more  practical  level, 
throughout  the  interval  when  an  atom  is  trapped  (which  is  deter¬ 
mined  in  real  time),  our  system  provides  an  approximately  station¬ 
ary  source  of  nonclassical  light  in  a  collimated,  gaussian  beam,  as 
has  been  anticipated  in  the  literature  on  one-atom  lasers 1,3-658-1  \  and 
which  has  diverse  applications.  Some  remaining  technical  issues  in 
our  work  are  to  improve  the  modelling  and  measurements  related  to 
atomic  motion,  both  within  the  FORT  potential  and  through  the 
polarization  gradients  of  the  f23j4  fields.  We  have  employed  our 
quantum  simulations  to  calculate  the  optical  spectrum  of  the  light 
output,  and  have  devised  a  scheme  for  its  measurement.  □ 

Methods 

While  the  atom  is  trapped  in  a  standing-wave  FORT  along  the  cavity  axis,  another  set  of 
fields  (designated  by  f23;4  in  Fig.  1)  propagate  in  the  plane  transverse  to  the  cavity  axis  and 
illuminate  the  region  between  the  cavity  mirrors.  These  fields  are  used  not  only  for  the 
pumping  scheme  described  in  association  with  the  operation  of  the  one-atom  laser  with 
strong  coupling,  but  also  for  cooling  in  the  trap-loading  phase.  Each  f23;4  field  consists  of 
two  orthogonal  pairs  of  counter-propagating  beams  in  a  a+  —  a~  configuration. 

Unfortunately,  it  is  difficult  to  calibrate  accurately  the  intensities  f3)4  for  the  f?3;4  beams 
at  the  location  of  the  atom  in  the  region  between  the  cavity  mirrors.  We  estimate  that  our 
knowledge  of  either  intensity  is  uncertain  by  an  overall  scale  factor  of  about  2.  However,  we 
do  know  the  ratio  of  intensities  much  more  accurately  than  either  intensity  individually, 
and  therefore  plot  the  data  in  Fig.  3  as  a  function  of  this  ratio. 

In  the  pumping  stage  of  the  experiment,  the  fields  are  tuned  10  MHz  blue  of 
F  =  3  — ►  F'  =  3'  in  the  case  of  the  f23  beams  and  17  MHz  blue  of  F  =  4  — *■  F'  =4'  in 
the  case  of  the  f24  fields.  The  detuning  between  the  3'  — *■  4  transition  at  co4>3  and  the  cavity 
resonance  coc  is  Aca  =  coc  —  co4;3  =  2tt  X  9  MHz.  These  detunings  are  chosen 
operationally  in  a  trade4 -off  between  achieving  a  large  cavity  output  flux  from  the  3'  — > ►  4 
transition  while  maintaining  a  reasonable  lifetime  for  the  trapped  atom  despite  heating 
from  the  various  fields29.  The  cavity  length  itself  is  actively  stabilized  with  an  auxiliary  laser 
at  wavelength  Ac  =  835.8  nm  that  does  not  interfere  with  the  trapping  or  the  cQED 
interactions. 

Our  experimental  protocol  begins  with  the  formation  of  a  magneto-optical  trap 
(MOT)  above  the  cavity.  After  a  stage  of  sub-Doppler  cooling,  the  cloud  of  atoms  is 
released.  The  J23j4  beams  are  then  used  as  cooling  beams  (with  independent  settings  of 
intensity  and  detuning)  to  load  an  atom  into  the  FORT12.  About  ten  atoms  transit  the 
cavity  mode  after  each  MOT  drop,  and  the  loading  efficiency  is  set  such  that  an  atom  is 
loaded  into  the  FORT  once  every  3-10  drops.  We  then  switch  the  intensities  and  detunings 
of  the  transverse  fields  f23;4  to  the  pumping  configuration  and  record  the  cavity  output  by 
way  of  the  single-photon  detectors  D1>2  shown  in  Fig.  1.  Each  photoelectric  pulse  from  D1>2 
is  stamped  with  its  time  of  detection  (1-ns  resolution)  and  then  stored  for  later  analysis, 
with  examples  of  the  record  of  output  counts  versus  time  displayed  in  Fig.  2. 
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large  number  of  electrons.  In  materials  with  strong  electron 
correlations  a  quantum  phase  transition  at  zero  temperature  can 
occur,  and  a  quantum  critical  state  has  been  predicted1,2,  which 
manifests  itself  through  universal  power-law  behaviours  of 
the  response  functions.  Candidates  have  been  found  both  in 
heavy-fermion  systems3  and  in  the  high-transition  temperature 
(high-Tc)  copper  oxide  superconductors4,  but  the  reality  and  the 
physical  nature  of  such  a  phase  transition  are  still  debated5-7. 
Here  we  report  a  universal  behaviour  that  is  characteristic  of  the 
quantum  critical  region.  We  demonstrate  that  the  experimentally 
measured  phase  angle  agrees  precisely  with  the  exponent  of  the 
optical  conductivity.  This  points  towards  a  quantum  phase 
transition  of  an  unconventional  kind  in  the  high-Tc 
superconductors. 

In  the  quantum  theory  of  collective  fields  one  anticipates  order  at 
small  coupling  constant,  and  for  increasing  coupling  one  expects  at 
some  point  a  phase  transition  to  a  quantum-disordered  state. 
Quantum  criticality  in  the  copper  oxides,  if  it  exists,  occurs  as  a 
function  of  charge  carrier  doping  x,  at  a  particular  doping  level  xc 
close  to  where  the  superconducting  phase  transition  temperature 
reaches  its  maximum  value.  When  this  phase  transition  is  continu¬ 
ous,  a  critical  state  is  realized  right  at  the  transition,  which  is 
characterized  by  scale  invariance  resulting  in  the  above-mentioned 
power-law  response  up  to  some  (non-universal)  high-energy  cut¬ 
off  Q. 

The  optical  conductivity,  a(co)  =  cji(co)  +  ia2(co),  is  the  absorptive 
(di)  and  reactive  (a2)  current  response  to  a  time-varying  external 
electrical  field  of  frequency  co,  and  is  usually  expressed  as  the 
correlation  function  of  the  currents  j(ri)  and  j(r2)  at  times  tx 
and  r2,  which  is  Xj/O 7"i,t2)  =  (j(ri),  j(r2)),  by  the  Kubo  formula.  In 
Fig.  1  we  present  the  experimental  optical  conductivity  function 
o  i  (co)  of  an  optimally  doped  Bi2Sr2Ca0.92Y0.08Cu2O8+5  single  crystal 


Competing  interests  statement  The  authors  declare  that  they  have  no  competing  financial 
interests. 

Correspondence  and  requests  for  materials  should  be  addressed  to  H.J.K  (hjkimble@caltech.edu). 


Quantum  critical  behaviour  in 
a  high-rc  superconductor 

D.  van  der  Marel1*,  H.  J.  A.  Molegraaf '  ,  J.  Zaanen  ,  Z.  Nussinov  , 

F.  Carbone,  A.  Damascelli,  H.  Eisaki,  M.  Greven3,  P.  H.  Kes2  &  M.  Li2 

1  Materials  Science  Centre,  University  of  Groningen,  9747  AG  Groningen, 

The  Netherlands 

2Leiden  Institute  of  Physics,  Leiden  University,  2300  RA  Leiden,  The  Netherlands 

3 Department  of  Applied  Physics  and  Stanford  Synchrotron  Radiation  Laboratory, 
Stanford  University,  California  94305,  USA 

*  Present  addresses:  Departement  de  Physique  de  la  Matiere  Condensee,  Universite  de  Geneve,  CH-1211 
Geneve  4,  Switzerland  (D.v.d.M.,  H.J.A.M.,  F.C.);  Los  Alamos  National  Laboratories,  Los  Alamos,  New 
Mexico  87545,  USA  (Z.N.);  Department  of  Physics  and  Astronomy,  University  of  British  Columbia, 
Vancouver,  British  Columbia,  V6T  1Z1,  Canada  (A.D.);  Low- Temperature  Physics  Group,  National 
Institute  of  Advanced  Industrial  Science  and  Technology,  Umezono,  Tsukuba,  305-8568,  Japan  (H.E.) 


Quantum  criticality  is  associated  with  a  system  composed  of 
a  nearly  infinite  number  of  interacting  quantum  degrees  of 
freedom  at  zero  temperature,  and  it  implies  that  the  system 
looks  on  average  the  same  regardless  of  the  time-  and  length  scale 
on  which  it  is  observed.  Electrons  on  the  atomic  scale  do  not 
exhibit  such  symmetry,  which  can  only  be  generated  as  a 
collective  phenomenon  through  the  interactions  between  a 
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Figure  1  Optical  properties  along  the  copper-oxygen  planes  of  B^S^Cao^Yo.osCi^Os+s 
for  a  selected  number  of  temperatures,  a,  Optical  conductivity  and  b,  the  frequency 
dependent  scattering  rate  defined  as  1  /r(«)  =  Re{a>2/4Tr<r(a>)}  (see  Methods).  The 
relatively  high  transition  temperature  (Tc  =  96  K)  of  this  crystal  compared  to  previous 
reports  on  Bi-2212  is  caused  by  the  partial  substitution  of  yttrium  on  the  calcium  sites. 
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State-Insensitive  Cooling  and  Trapping  of  Single  Atoms  in  an  Optical  Cavity 
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Single  cesium  atoms  are  cooled  and  trapped  inside  a  small  optical  cavity  by  way  of  a  novel  far-off- 
resonance  dipole-force  trap,  with  observed  lifetimes  of  2-3  s.  Trapped  atoms  are  observed  continuously 
via  transmission  of  a  strongly  coupled  probe  beam,  with  individual  events  lasting  ^  1  s.  The  loss  of 
successive  atoms  from  the  trap  A>3— >2— >1— >0is  thereby  monitored  in  real  time.  Trapping, 
cooling,  and  interactions  with  strong  coupling  are  enabled  by  the  trap  potential,  for  which  the  center-of- 
mass  motion  is  only  weakly  dependent  on  the  atom’s  internal  state. 

DOI:  10. 1 103/Phy sRevLett.90. 1 33602  PACS  numbers:  42.50.Vk,  03.67.-a,  32.80.Pj 


A  long-standing  ambition  in  the  field  of  cavity  quan¬ 
tum  electrodynamics  (QED)  has  been  to  trap  single  atoms 
inside  high -Q  cavities  in  a  regime  of  strong  coupling  [1]. 
Diverse  avenues  have  been  pursued  for  creating  the  trap¬ 
ping  potential  for  atom  confinement,  including  additional 
far  off-resonant  trapping  beams  [2],  near- resonant  light 
with  h  1  intracavity  photons  [3,4],  and  single  trapped 
ions  in  high-finesse  optical  cavities  [5,6],  although  strong 
coupling  has  yet  to  be  achieved  for  trapped  ions.  A 
critical  aspect  of  this  research  is  the  development  of 
techniques  for  atom  localization  that  are  compatible 
with  strong  coupling,  as  required  for  quantum  computa¬ 
tion  and  communication  [7-12]. 

In  this  Letter  we  present  experiments  to  enable  quan¬ 
tum  information  processing  in  cavity  QED  by  (1)  achiev¬ 
ing  extended  trapping  times  for  single  atoms  in  a  cavity 
while  still  maintaining  strong  coupling,  (2)  realizing  a 
trapping  potential  for  the  center-of-mass  motion  that  is 
largely  independent  of  the  internal  atomic  state,  and  (3) 
demonstrating  a  scheme  that  allows  continuous  observa¬ 
tion  of  trapped  atoms  by  way  of  the  atom-field  coupling. 
More  specifically,  we  have  recorded  trapping  times  up  to 
3  s  for  single  Cs  atoms  stored  in  an  intracavity  far-off 
resonance  trap  (FORT)  [13],  which  represents  an  im¬ 
provement  by  a  factor  of  102  beyond  the  first  realization 
of  trapping  in  cavity  QED  [2],  and  by  roughly  104  beyond 
prior  results  for  atomic  trapping  [3]  and  localization  [4] 
with  h  =*  1  photon.  We  have  also  continuously  monitored 
trapped  atoms  by  way  of  strong  coupling  to  a  probe  beam, 
including  observations  of  trap  loss  atom  by  atom  over 
intervals  ^  1  s.  These  measurements  incorporate  auxili¬ 
ary  cooling  beams,  and  provide  the  first  realization  of 
cooling  for  trapped  atoms  strongly  coupled  to  a  cavity. 
Our  protocols  are  facilitated  by  the  choice  of  a  “magic” 
wavelength  for  the  FORT  [14-16],  for  which  the  relevant 
atomic  levels  are  shifted  almost  equally,  thereby  provid¬ 
ing  significant  advantages  for  coherent  state  manipulation 
of  the  atom-cavity  system. 

A  major  obstacle  to  the  integration  of  a  conventional 
red-detuned  FORT  within  the  setting  of  cavity  QED  is 
that  excited  electronic  states  generally  experience  a  pos- 

0031-9007/ 03/90(13)/133602(4)$20.00 


itive  ac-Stark  shift  of  comparable  magnitude  to  the  neg¬ 
ative  (trapping)  shift  of  the  ground  state  [13].  This 
effectively  introduces  a  spatially  dependent  detuning  be¬ 
tween  the  atom  and  cavity  resonances,  an  unfortunate 
additional  complication  [16].  However,  due  to  the  specific 
multilevel  structure  of  cesium,  the  wavelength  \F  of  the 
trapping  laser  can  be  tuned  to  a  region  where  both  of  these 
problems  are  eliminated  for  the  6Si/2  — ►  6 P3/2  transition, 
as  illustrated  in  Fig.  1  [14-16].  Around  the  magic  wave¬ 
length  XF  =  935  nm,  the  sum  of  ac-Stark  shifts  coming 
from  different  allowed  optical  transitions  results  in  the 
ground  6S1/2  and  excited  6 P3/2  states  both  being  shifted 
downwards  by  comparable  amounts,  S6Sy2  —  S6Py2,  albeit 
with  small  dependence  on  ( F ',  mFi)  for  the  shifts  86Py2. 

The  task  then  is  to  achieve  state- independent  trapping 
while  still  maintaining  strong  coupling  for  the  65/ /2  — ► 
6P3/2  transition.  Our  experimental  setup  to  achieve  this 
end  is  schematically  depicted  in  Fig.  2  [2].  Significantly, 
the  cavity  has  a  TEM00  longitudinal  mode  located  nine 


FIG.  1  (color  online).  ac-Stark  shifts  (86S  ,  86Py2)  for  the 
(65j/2,  6P3/2)  levels  in  Cs  for  a  linearly  polarized  FORT.  The 
inset  shows  ( 86Si/2 ,  86Py2>Fi= 4)  as  functions  of  wavelength  XF. 
The  full  plot  gives  86Py2  versus  mFi  for  each  of  the  levels  6 P3/2, 
F'  =  2,  3,  4,  5  for  XF  =  935.6  nm.  In  each  case,  the  normal¬ 
ization  is  8  =  8/[86Sy2(XF  =  935.6  nm)]. 
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FIG.  2  (color  online).  Schematic  of  experiment  for  trapping 
single  atoms  in  an  optical  cavity  in  a  regime  of  strong  coupling. 
Relevant  cavity  parameters  are  length  /  =  43.0  fxm,  waist 
w0  =  23.9  pan,  and  finesse  J  =  4.2  X  105  at  852  nm.  The  inset 
illustrates  transverse  beams  used  for  cooling  and  repumping. 


mode  orders  below  the  mode  employed  for  cavity  QED  at 
852  nm,  at  the  wavelength  XF  =  935.6  nm,  allowing  the 
implementation  of  a  FORT  with  S6Si/i  —  S6Py2.  The  field 
to  excite  this  cavity  mode  is  provided  by  a  laser  at  XF, 
which  is  independently  locked  to  the  cavity.  The  finesse  of 
the  cavity  at  XF  is  ^  —  2200  [11],  so  that  a  mode- 
matched  input  power  of  1.2  mW  gives  a  peak  ac-Stark 
shift  S6SiI2/2tt  =  —47  MHz  for  all  states  in  the  6Si/2 
ground  manifold,  corresponding  to  a  trap  depth  UG/kB  = 
2.3  mK,  which  was  used  for  all  experiments. 

Principal  parameters  relevant  to  cavity  QED  with  the 
system  in  Fig.  2  are  the  Rabi  frequency  2 g0  for  a  single 
quantum  of  excitation  and  the  amplitude  decay  rates 
(i k ,  y)  due  to  cavity  losses  and  atomic  spontaneous  emis¬ 
sion.  For  our  system,  go/2ir  =  24  MHz,  /c/27t  = 
4.2  MHz,  and  yl2ir  =  2.6  MHz,  where  g0  is  f°r  the 
(65'1/2,  F  =  4,  mF  =  4)  — ►  (6 P3/2,  F'  =  5,  m'F  =  4)  tran¬ 
sition  in  atomic  Cs  at  A0  =  852.4  nm.  Strong  coupling 
is  thereby  achieved  [go  »  ( k ,  y)],  resulting  in  critical 
photon  and  atom  numbers  n0  =  y2/(2gl)  —  0.006,  V0  = 
2/cy/go  —  0-04.  The  small  transition  shifts  for  our  FORT 
mean  that  g0  is  considerably  larger  than  the  spatially 
dependent  shift  <50  of  the  bare  atomic  frequency  em- 
ployed  for  cavity  QED,  g0  »  «50  =  \S6P  -  <56Sl/2|, 
whereas  in  a  conventional  FORT,  <50  ~  2\86Sy2\  »  g0. 

In  addition  to  the  FORT  field,  the  input  to  the  cavity 
consists  of  probe  and  locking  beams,  all  of  which  are 
directed  to  separate  detectors  at  the  output.  The  trans¬ 
mitted  probe  beam  is  monitored  using  heterodyne  detec¬ 
tion,  allowing  real-time  detection  of  individual  cold 
atoms  within  the  cavity  mode  [18].  The  cavity  length  is 
actively  controlled  using  a  cavity  resonance  at  Ac  = 
835.8  nm,  so  the  length  is  stabilized  and  tunable  inde¬ 
pendently  of  all  other  intracavity  fields  [2].  The  probe  as 
well  as  the  FORT  beam  are  linearly  polarized  along  a 
direction  /+  orthogonal  to  the  x  axis  of  the  cavity  [17,19]. 

Cold  atoms  are  collected  in  a  magneto-optical  trap 
(MOT)  roughly  5  mm  above  the  cavity  mirrors  and  then 
released  after  a  stage  of  sub-Doppler  polarization- 


gradient  cooling  [13].  Freely  falling  atoms  arrive  at  the 
cavity  mode  over  an  interval  of  about  10  ms,  with  kinetic 
energy  EK/kB^  0.8  mK,  velocity  0.30  m/s,  and 
transit  time  At  =  2 w0/v  —  150  p, s.  Two  additional  or¬ 
thogonal  pairs  of  counterpropagating  beams  in  a  a+  — 
cr~  configuration  illuminate  the  region  between  the  cav¬ 
ity  mirrors  along  directions  at  ±45°  relative  to  y,  z  (the 
“y  —  z  beams”)  and  contain  cooling  light  tuned  red  of 
F  =  4  — ►  F'  =  5  and  repumping  light  near  the  F  =  3  — ► 
F'  =  3  transition  [20].  These  beams  eliminate  the  free- 
fall  velocity  to  capture  atoms  in  the  FORT  and  provide  for 
subsequent  cooling  of  trapped  atoms. 

We  employed  two  distinct  protocols  to  study  the  life¬ 
time  for  single  trapped  atoms  in  our  FORT:  (1)  Trapping 
in  the  dark  with  the  atom  illuminated  only  by  the  FORT 
laser  at  XF  and  the  cavity-locking  laser  at  Ac.  For  this 
protocol,  strong  coupling  enables  real-time  monitoring  of 
single  atoms  within  the  cavity  for  initial  triggering  of 
cooling  light  and  for  final  detection.  (2)  Trapping  with 
continuous  observation  of  single  atoms  with  cavity  probe 
and  cooling  light  during  the  trapping  interval.  In  this 
case,  atoms  in  the  cavity  mode  are  monitored  by  way  of 
the  cavity  probe  beam,  with  cooling  provided  by  the 
auxiliary  y  —  z  beams. 

(1)  In  our  first  protocol,  the  F  =  4  — ►  F'  =  5  transition 
is  strongly  coupled  to  the  cavity  field,  with  zero  detuning 
of  the  cavity  from  the  bare  atomic  resonance,  Ac  = 
coc  —  co4_^ 5  =  0.  In  contrast  to  Ref.  [2],  here  the  FORT 
is  ON  continuously  without  switching,  which  makes  a 
cooling  mechanism  necessary  to  load  atoms  into  the  trap. 
The  initial  detection  of  a  single-atom  falling  into  the 
cavity  mode  is  performed  with  the  probe  beam  tuned  to 
the  lower  sideband  of  the  vacuum-Rabi  spectrum  (Ap  = 
cop  —  co4_^ 5  =  —27 t  X  20  MHz).  The  resulting  increase 
in  transmitted  probe  power  when  an  atom  approaches  a 
region  of  optimal  coupling  [21,22]  triggers  ON  a  pulse  of 
transverse  cooling  light  from  the  y  —  z  beams,  detuned 
41  MHz  red  of  (o4^5.  During  the  subsequent  trapping 
interval,  all  near-resonant  fields  are  turned  OFF  (includ¬ 
ing  the  transverse  cooling  light).  After  a  variable  delay  tT, 
the  probe  field  is  switched  back  ON  to  detect  whether  the 
atom  is  still  trapped,  now  with  A^  =  0. 

Data  collected  in  this  manner  are  shown  in  Fig.  3(a), 
which  displays  the  conditional  probability  P  to  detect  an 
atom  given  an  initial  single-atom  triggering  event  versus 
the  time  delay  tT .  The  two  data  sets  shown  in  Fig.  3(a) 
yield  comparable  lifetimes,  the  upper  acquired  with  mean 
intracavity  atom  number  N  =  0.30  atoms  and  the  lower 
with  N  =  0.019  [23].  The  offset  in  P  between  these  two 
curves  arises  primarily  from  a  reduction  in  duration  St  of 
the  cooling  pulses,  from  100  to  5  |ms,  which  results  in  a 
reduced  capture  probability.  Measurements  with  constant 
St  but  with  N  varied  by  adjusting  the  MOT  parameters 
allow  us  to  investigate  the  probability  of  trapping  an  atom 
other  than  the  “trigger”  atom  and  of  capturing  more  than 
one  atom.  For  example,  with  St  =  5  |ms  as  in  the  lower  set, 
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FIG.  3  (color  online),  (a)  Detection  probability  P  as  a  func¬ 
tion  of  trapping  time  tT.  The  upper  data  set  is  for  mean  intra¬ 
cavity  atom  number  N  ~  0.30,  while  the  lower  set  is  for 
N  ~  0.019  atoms.  Exponential  fits  (solid  lines)  yield  lifetimes 
Tupper  =  (2.4  ±  0.2)  s  and  rlower  =  (2.0  ±  0.3)  s.  (b)  The  frac¬ 
tional  population  f4(tD)  in  F  =  4  following  depletion  of  this 
level  at  tD  =  0.  An  exponential  fit  (solid  line)  gives  rR  = 
(0.11  ±0.02)  s. 


we  have  varied  0.011  ^  N  ^  0.20  with  no  observable 
change  in  either  PT  or  the  trap  lifetime  r.  Since  a  con¬ 
servative  upper  bound  on  the  relative  probability  of  trap¬ 
ping  a  second  atom  is  just  N / 2  (when  N  «  1),  these  data 
strongly  support  the  conclusion  that  our  measurements 
are  for  single  trapped  atoms.  We  routinely  observe  life¬ 
times  2  s  <  r  <  3  s  depending  upon  the  parameters 
chosen  for  trap  loading  and  cooling. 

Figure  3(b)  explores  scattering  processes  within  the 
FORT  that  transfer  population  between  the  6Si/2,  F  = 
(3,  4)  ground-state  hyperfine  levels.  For  these  measure¬ 
ments,  the  F  =  4  level  is  initially  depleted,  and  then  the 
population  in  F  =  4  as  well  as  the  total  3  +  4  population 
are  monitored  as  functions  of  time  tD  to  yield  the  frac¬ 
tional  population  f4(tD)  in  F  =  4.  The  measured  time 
rR  =  (0.11  ±  0.02)  s  for  reequilibration  of  populations 
between  F  =  (3,  4)  agrees  with  a  numerical  simulation 
based  upon  scattering  rates  in  our  FORT,  which  predicts 
tr  =  0. 10  s  for  atoms  trapped  at  the  peak  FORT  intensity 
in  an  initially  unpolarized  state  in  the  F  =  3  level. 

Turning  next  to  the  question  of  the  mechanisms  that 
limit  our  FORT  lifetime,  we  recall  that  parametric  heat¬ 
ing  can  be  caused  by  intensity  fluctuations  of  the  trapping 
field  [2,24].  From  measurements  of  intensity  fluctuations 
for  our  FORT  around  twice  the  relevant  harmonic  fre¬ 
quencies  (z'axiai  =  570,  ^diai  =  4.8)  kHz,  we  estimate  a 
lower  bound  to  the  FORT  lifetime  of  raxial  >  1.6  s  [25]. 
Although  this  estimate  suggests  that  parametric  heating 
could  be  a  limiting  factor  in  Fig.  3,  we  were  later  able  to 
reduce  the  noise,  giving  a  new  lower  bound  raxial  >  9  s. 
Unfortunately,  the  measured  FORT  lifetime  increased 
only  to  t  =  (3.1  ±  0.4)  s,  indicating  that  other  mecha¬ 
nisms  are  partially  responsible  for  the  observed  decay. 

A  second  suspect  is  a  heating  process  described  by 
Corwin  et  al.  [26]  associated  with  inelastic  Raman  scat¬ 


tering  in  an  elliptically  polarized  FORT  field  [19].  We 
calculate  rates  Ts  for  spontaneous  Raman  scattering  in 
our  FORT  to  be  2.5  to  7  s-1  for  transitions  that  change  the 
hyperfine  quantum  number  F,  and  between  0.8  and 
2.5  s-1  when  only  mF  changes  [27].  Based  on  Eq.  3  in 
Ref.  [26]  (a  two-state  model),  we  estimate  an  upper  limit 
to  the  heating  rate  from  this  mechanism,  TIR  ^  0.2T^, 
giving  heating  times  as  short  as  0.7  s  for  the  fastest 
calculated  scattering  rate.  However,  we  have  also  under¬ 
taken  a  full  multilevel  simulation  of  the  optical  pumping 
processes,  which  indicates  much  slower  heating,  TIR  ~ 
0.02  s-1.  We  are  working  to  resolve  this  discrepancy. 

A  third  suspect  that  cannot  be  discounted  is  the  pres¬ 
ence  of  stray  light,  which  we  have  endeavored  to  elimi¬ 
nate.  For  lifetimes  as  in  Fig.  3,  we  require  intracavity 
photon  number  h  «  10-5,  which  is  not  trivial  to  diag¬ 
nose.  A  final  concern  is  the  background  pressure  in  the 
region  of  the  FORT.  Although  the  chamber  pressure  is 
3  X  10“ 10  Torr  (leading  to  r  ^  30  s),  we  have  no  direct 
measurement  of  the  residual  gas  density  in  the  narrow 
cylinder  between  the  mirror  substrates  (diameter  1  mm 
and  length  43  pan),  except  for  the  trap  lifetime  itself. 

(2)  Toward  the  goals  of  continuous  observation  of  sin¬ 
gle  trapped  atoms  [3,4]  and  of  implementing  A  schemes 
in  cavity  QED  [7-9,28],  we  next  present  results  from  our 
second  protocol.  Here  the  F  =  4  —>  F'  =  4  transition 
is  strongly  coupled  to  the  cavity  field,  with  A'c  =  coc  — 
co4_^ 4  =  0.  In  contrast  to  our  protocol  (1),  the  FORT  and 
the  transverse  y  —  z  beams  are  left  ON  continuously,  with 
the  latter  containing  only  light  near  the  F  =  3  — ►  F'  =  3 
resonance,  with  detuning  A3.  Significantly,  we  observe 
trap  loading  with  no  cooling  light  near  the  F  =  4  —> 
F'  =  5  transition. 

An  example  of  the  resulting  probe  transmission  is 
shown  in  Fig.  4,  which  displays  two  separate  records  of 
the  continuous  observation  of  trapped  atoms.  Here  the 
probe  detuning  A 'p  =  cop  —  co4^4  =  0  and  the  probe 
strength  is  given  in  terms  of  m  =  \(a)\ 2  deduced  from 
the  heterodyne  current,  with  a  as  the  annihilation  oper¬ 
ator  for  the  intracavity  field.  We  believe  that  the  y  —  z 
repumping  beams  (which  excite  F  =  3  — ►  F1  =  3)  pro¬ 
vide  cooling,  since  without  them  the  atoms  would 
“roll”  in  and  out  of  the  near- conservative  FORT  potential 
(indeed  no  trapping  occurs  in  their  absence).  In  addition, 
this  is  a  continuous  cooling  and  loading  scheme,  so  that 
we  routinely  load  multiple  atoms  into  the  trap. 

The  most  striking  characteristic  of  the  data  collected  in 
this  manner  is  that  m  versus  t  always  reaches  its  deepest 
level  within  the  =*  10  ms  window  when  the  falling  atoms 
arrive,  subsequently  increasing  in  a  discontinuous  “stair¬ 
case”  of  steps.  As  indicated  in  Fig.  4,  our  interpretation  is 
that  there  is  a  different  level  for  in  associated  with  each 
value  N  of  the  number  of  trapped  atoms  (with  the  level 
decreasing  for  higher  N),  and  that  each  step  is  due  to  the 
loss  of  an  atom  from  the  cavity  mode.  In  addition,  we 
observe  a  strong  dependence  both  of  the  initial  trapping 
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FIG.  4  (color  online).  Two  traces  of  the  continuous  observa¬ 
tion  of  trapped  atoms  inside  a  cavity  in  a  regime  of  strong 
coupling.  After  an  initial  sharp  reduction  around  t  =  0  as 
atoms  are  cooled  into  the  cavity  mode,  the  intracavity  field 
strength  m  increases  in  a  discontinuous  fashion  as  trapped 
atoms  escape  from  the  cavity  mode  one  by  one.  rf  detection 
bandwidth  =  1  kHz,  A'c  =  0  =  A'p,  and  A3/2t t  =  25  MHz. 

probability  and  of  the  continuous  observation  time  on  the 
detuning  of  the  transverse  beams,  with  an  optimal  value 
A 3  25  MHz  to  the  blue  of  the  3  — ►  3  transition,  which 

strongly  suggests  blue  Sisyphus  cooling  [29]. 

We  stress  that  observations  as  in  Fig.  4  are  made 
possible  by  strong  coupling  in  cavity  QED,  for  which 
individual  intracavity  atoms  cause  the  displayed  changes 
in  probe  transmission.  While  m  in  Fig.  4  is  only  ^  0.01,  it 
represents  an  output  flux  =*  5  X  105  photons  per  second. 
The  probe  is  also  critical  to  the  cooling,  although  it  is  not 
clear  whether  this  beam  is  acting  as  a  simple  “repumper” 
[29]  or  is  functioning  in  a  more  complex  fashion  due  to 
strong  coupling.  We  have  not  seen  such  striking  phenom¬ 
ena  under  similar  conditions  for  cavity  QED  with  the  F  = 
4  — ►  F'  =  5  transition.  Note  that  our  ability  to  monitor 
the  atom  as  well  as  to  cool  its  motion  are  enabled  by  the 
state-insensitive  character  of  the  trap,  since  the  net  tran¬ 
sition  shifts  are  small,  (g0>  A3)  »  <50. 

In  summary,  we  have  demonstrated  a  new  set  of 
ideas  within  the  setting  of  cavity  QED,  including  state- 
insensitive  trapping  suitable  for  strong  coupling.  Trapping 
of  single  atoms  with  g0  »  (<50,  k,  y)  has  been  achieved 
with  lifetimes  r  —  2-3  s.  Since  intrinsic  heating  in  the 
FORT  is  quite  low  (  ~  11  |xK/s  due  to  photon  recoil), 
we  anticipate  extensions  to  much  longer  lifetimes. 
Continuous  observations  of  multiple  atoms  in  a  cavity 
have  been  reported,  and  involve  an  interplay  of  a  strongly 
coupled  probe  field  for  monitoring  and  a  set  of  y  —  z 
cooling  beams.  Our  measurements  represent  the  first 
demonstration  of  cooling  for  trapped  atoms  strongly 
coupled  to  a  cavity.  Beyond  its  critical  role  here,  state- 
insensitive  trapping  should  allow  the  application  of 
diverse  laser  cooling  schemes,  leading  to  further  advances 
in  quantum  information  science. 
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While  all  bipartite  pure  entangled  states  violate  some  Bell  inequality,  the  relationship  between 
entanglement  and  nonlocality  for  mixed  quantum  states  is  not  well  understood  We  introduce  a  simple 
and  efficient  algorithmic  approach  for  the  problem  of  constructing  local  hidden  variable  theories  for 
quantum  states.  The  method  is  based  on  constructing  a  so-called  symmetric  quasiextension  of  the 
quantum  state  that  gives  rise  to  a  local  hidden  variable  model  with  a  certain  number  of  settings  for  the 
observers  Alice  and  Bob. 
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It  was  Bell  [1]  who  quantified  how  measurements  on 
entangled  quantum  mechanical  systems  can  invalidate 
local  classical  models  of  reality.  His  original  inequality 
has  generated  a  field  of  research  devoted  to  general  Bell 
inequalities  and  experimentally  observed  violations  of 
such  inequalities. 

Perhaps  surprisingly,  the  nature  of  the  set  of  states  that 
violate  local  realism  is  poorly  understood,  although  it  is 
known  from  the  seminal  work  of  Werner  [2]  that  not  all 
entangled  states  violate  a  Bell  inequality.  Recent  results 
in  quantum  information  theory  have  revealed  the  com¬ 
plex  structure  of  the  set  of  entangled  states  but  have  as  yet 
shed  little  light  on  the  relation  between  this  structure  and 
violation  of  Bell  inequalities.  For  example,  it  has  been 
conjectured  by  Peres  [3]  that  so-called  bound  entangled 
states  which  satisfy  the  Peres- Horodecki  “partial  trans¬ 
position”  criterion  [4]  (i.e.,  they  have  positive  partial 
transpose),  do  not  violate  any  Bell  inequalities.  There 
are  various  results  that  support  this  conjecture  both  in 
the  bipartite  and  multipartite  case,  see  Ref.  [5],  but  none 
of  the  results  is  conclusive. 

What  has  been  lacking  in  the  literature  so  far  is  a 
systematic  way  of  deciding  whether  a  quantum  state 
does  or  does  not  violate  some  Bell  inequality.  The  diffi¬ 
culty  is  that  the  possible  types  of  local  measurements  and 
the  number  of  measurements  that  observers  can  perform 
is  in  principle  unbounded  and  the  enumeration  of  Bell 
inequalities  is  computationally  hard  [6]. 

In  this  Letter  we  present  the  first  systematic  approach 
for  constructing  local  hidden  variable  theories  for  quan¬ 
tum  states,  depending  only  on  the  number  of  local  mea¬ 
surement  settings  for  each  observer.  Our  approach  has 
yielded  both  numerically  constructed  local  hidden  vari¬ 
able  theories  for  a  variety  of  quantum  states  as  well  as 
analytical  results  for  Werner  states  [2]  and  a  class  of 
bound  entangled  states  based  on  real  UPBs  [7]. 

Before  we  can  state  our  main  result,  we  recapitulate  the 
mathematics  of  local  hidden  variable  (LHV)  models  and 
Bell  inequalities  for  bipartite  systems.  We  refer  the  reader 
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to  Refs.  [3,6,8]  for  some  literature  on  the  theoretical 
formulation  of  general  Bell  inequalities.  Each  of  the  ob¬ 
servers,  Alice  and  Bob,  has  a  set  of  local  measurements. 
Let  i  =  1, . . . ,  sa  be  the  number  of  measurements  for 
Alice  and  let  each  measurement  have  oa(i)  outcomes. 
Let  k  =  l, . . . ,  sb  be  the  number  of  measurements  for 
Bob  and  ob(k)  be  the  number  of  outcomes  per  measure¬ 
ment.  The  probability  P^^i  denotes  the  probability  that 
Alice’s  ith  measurement  has  outcome  j  and  Bob’s  kth 
measurement  has  outcome  /.  A  local  hidden  variable 
model  assumes  the  existence  of  a  shared  random  variable 
between  Alice  and  Bob  that  is  used  to  locally  generate  a 
measurement  outcome  depending  only  on  the  choice  of 
the  local  measurement  (and  not  on  the  choice  of  the  other, 
remote,  measurement).  The  local  hidden  variable  model 
generates  the  probability  vector  P  with  entries  Ptj  ki  when 
it  generates  measurement  outcomes  in  accordance  with 
these  probabilities.  Mathematically  one  defines  a  convex 
set  S(sa,  sb,  oa,  ob )  which  is  the  set  of  probability  vectors 
P  that  can  be  generated  by  LHV  models.  It  is  known 
that  S  is  a  polytope  and  that  the  extremal  vectors  B  of  S 
are  vectors  with  0,  1  entries  [8].  These  extremal  vec¬ 
tors  B  correspond  to  the  situation  in  which  the  outcomes 
of  the  measurements  are  determined  with  certainty  and 
can  be  labeled  by  two  sets  of  indices  m  =  (mj, . . .,  mSa ) 
where  m*  =  1, . . . ,  oa(i)  and  n  =  ( nh  . . . ,  nSb)  where 
nk  =  1, . . ob{k).  A  brief  expression  for  these  extremal 
vectors  is 

K,%  =  8  jmi8ln,  (1) 

In  words,  each  extremal  vector  specifies  a  single  outcome 
with  probability  one  for  each  local  measurement,  inde¬ 
pendently  of  the  measurement  made  by  the  other  parties. 

For  a  quantum  mechanical  system  p  in  3~Cd  0  3~[ d]3 
the  probability  PH  is  given  by  Pn  k](p)  =  Tr Ef-  0 
Here  {Efj  >  0:  XjEfj  =  /jA}  are  the  POVM  (positive 
operator-valued  measure)  elements  for  Alice’s  ith  mea¬ 
surement  and  {E^}  are  the  POVM  elements  for  Bob’s  kth 
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measurement.  There  is  a  violation  of  a  Bell  inequality 
if  and  only  if  Ptj  kt  cannot  be  generated  by  a  LHV  model, 
or  P  £  S. 

In  this  Letter  we  prove  the  first  necessary  condition  for 
a  state  to  violate  a  Bell  inequality  depending  only  on  the 
number  of  settings  for  Alice  and  Bob.  We  explicitly  con¬ 
struct  a  LHV  model  in  a  sa  =  2  and  arbitrary  sb  setting 
(and  vice  versa)  for  any  bound  entangled  state  based  on  a 
real  unextendible  product  basis  (UPB)  [4].  Then  we  dis¬ 
cuss  numerical  work  that  shows  that  many  of  the  known 
bipartite  bound  entangled  states  cannot  violate  a  Bell  in¬ 
equality  with  two  settings  either  for  Alice  or  Bob.  Finally, 
we  partially  reproduce  and  extend  some  of  Werner’s  origi¬ 
nal  results  by  showing  that  it  is  possible  to  use  our 
procedure  to  analytically  construct  LHV  theories  for 
Werner  states.  It  is  noteworthy  to  mention  that  our  meth¬ 
ods  (Theorem  1  and  Theorem  2)  straightforwardly  gen¬ 
eralize  to  multipartite  states,  even  though  we  have  not 
explored  this  direction. 

We  connect  violations  of  Bell  inequalities  to  the  exis¬ 
tence  of  a  symmetric  (quasi-)  extension  of  a  quantum 
state  [9].  An  extension  of  a  quantum  state  p  on,  say,  a 
system  AB ,  is  another  quantum  state  defined  on  a  system 
ABC  such  that  when  we  trace  over  C  we  obtain  the 
original  quantum  state  p.  We  are  interested  in  the  situ¬ 
ation  where  the  system  C  =  0  and  we 

demand  that  the  extension  be  invariant  under  all  permu¬ 
tations  of  the  sa  copies  of  system  A  among  each  other  and 
similarly  invariant  under  any  permutation  of  the  B  sys¬ 
tems.  It  is  clear  that  if  the  quantum  state  p  is  separable, 
i.e„  p  =  X,  P;(l  </';><*/'; Da  ®  (I0;X<£;Db,  such  an  extension 
always  exists:  we  just  copy  the  individual  product  states 
onto  the  other  spaces: 

Pext  =  Xp;(I«/';X*A«I)®s"  ®  (\(f>i)(<f>i\)®Sb-  (2) 

i 

If  the  state  p  is  a  pure  entangled  state,  then  it  is  also  clear 
that  such  a  symmetric  extension  cannot  exist.  The  sym¬ 
metry  requirement  implies  that  the  pure  entangled  state 
pAB  must  equal  pA<B,  where  A7  is  another  A  system,  which 
is  impossible.  In  popular  terms  we  may  say  that  pure 
entanglement  is  “monogamous”:  B  cannot  be  entangled 
with  A  and  A'  at  the  same  time.  In  some  sense  what  we 
show  in  this  Letter  is  that  (i)  a  violation  of  a  Bell  inequal¬ 
ity  indicates  that  the  entanglement  in  the  quantum  state  is 
monogamous  and  (ii)  there  are  many  mixed  entangled 
states  whose  entanglement  is  not  monogamous. 

Thus  the  existence  of  a  symmetric  extension  can  be 
viewed  as  a  separability  criterion  (see  Ref.  [11]  for  a 
similar  but  stronger  separability  criterion  where  one  de¬ 
mands  that  the  symmetric  extension  has  positive  partial 
transposes).  For  considering  Bell  inequality  violations 
we  generalize  our  criterion  slightly  and  ask  whether  a 
state  has  a  symmetric  quasiextension  Hp  which  is  not 
necessarily  positive.  In  order  to  define  this  notion  we  need 
the  definition  of  a  multipartite  entanglement  witness, 


which  is  an  entanglement  witness  which  can  detect  any 
multipartite  entanglement  in  a  state.  It  has  the  property 
that  for  all  states  <f>h  ..  .,<f>Sb,  (ij/h  tf/Sa, 

4>Sb)  >  0. 

Definition  (symmetric  quasiextension):  Let  it  : 
J~C®S  — ►  3~C®S  be  a  permutation  of  spaces  3~C  in  J~C®S. 
We  define 

Sym(p)  =  -1^  77/977+.  (3) 

S'  77 

We,  say,  that  p  on  J~CA  0  J~CB  has  a  (sa,  sb)~ symmetric 
quasiextension  when  there  exists  a  multipartite  entangle¬ 
ment  witness  Hp  on  J~CASa  0  J~CBSb  such  that 
T Ttf*a-l\3ffb-l)HP  =  p  and  HP  =  SymA  0  Sym*(ffp). 

The  reason  for  considering  such  quasiextensions  is 
clear  from  the  following  theorems  which  are  the  main 
results  of  this  Letter. 

Theorem  1:  If  p  has  a  (sa,  sb) -symmetric  quasiexten¬ 
sion,  then  p  does  not  violate  a  Bell  inequality  with  (sa,  sb ) 
settings. 

Before  proving  this  theorem,  it  is  important  to  note  the 
generality  of  the  result;  it  holds  for  all  possible  choices  of 
measurements  which  includes  POVM  measurements  with 
an  unbounded  number  of  measurement  outcomes.  We 
show  below  that  the  quasiextension  of  p  effectively  cre¬ 
ates  a  LHV  model  for  p  when  Alice  and  Bob  have  sa  and 
sb  arbitrary  measurements. 

Proof:  We  prove  our  theorem  by  extracting  a  LHV 
model  from  the  quasiextension.  The  LHV  model  for  p 
for  (s^,  sb)  settings  should  reproduce  the  vector  Pq^iip)  = 
Tr Efj  0  Eblp  for  all  possible  choices  of  POVM  measure¬ 
ments  {Efj,  Ebl},  as  a  convex  combination  of  the  extremal 
B  vectors,  i.e., 

Pij,u(p)  =  X  /Vn04  EBul  (4) 

m,n 

where  pmn(.)  ^  0.  If  a  symmetric  quasiextension  exists 
for  p,  then  Tr  Efj  0  Eftp  =  Tr  (Efj  0  Ebl  0  I  )Hp.  Using  the 
definition  of  the  B  vectors,  the  properties  of  the  POVMs 
=  IdAB)i  and  the  symmetry  properties  of  Hp  it  is 
not  hard  to  verify  that 

Pij.u(p)  =  Tr  Efj  ®  EBklP  =  X(TrEm  ®  En  Hp)B%*.  (5) 

m,n 

Here  =  £fmi  ®  Ejmi  0 ...  0  and  similarly  for  Ef . 

Since  Hp  is  a  quasiextension  pm>n({Efj, E^},  p)  =  TrE^  0 
E^Hp  >  0,  and  we  have  obtained  a  LHV  model.  □ 

One  way  of  looking  at  this  result  is  the  following  [12]. 
If  p  has  a  symmetric  extension  p,  then  instead  of  mea¬ 
surement  on  p,  Alice  and  Bob  can  do  measurements  on  p. 
Because  of  the  symmetry  Alice  can  do  the  first  measure¬ 
ment  on  the  first  Alice  space  and  the  second  measurement 
on  the  second  Alice  space,  etc.  But  now  these  measure¬ 
ments  are  all  commuting  and  can  be  considered  as  one  big 
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measurement.  But  we  know  that  when  Alice  and  Bob  each 
have  only  a  single  measurement  a  LHV  model  for  their 
measurements  exists  and  thus  we  have  a  LHV  model  for 
the  measurements  on  p.  With  this  picture  in  mind,  it  is  not 
hard  to  understand  the  following  strengthening  of  our 
results  (see  also  Ref.  [10]): 

Theorem  2:  If  p  has  a  (1,  ^-symmetric  quasiexten¬ 
sion,  then  p  does  not  violate  a  Bell  inequality  with  sb 
settings  for  Bob  and  any  number  of  settings  for  Alice. 

Remark:  The  theorem  also  holds  when  Alice  and  Bob 
are  interchanged. 

Proof:  The  intuition  behind  this  theorem  relies  on  the 
fact  that  there  are  no  violations  of  Bell  inequalities  when 
one  party  has  only  one  measurement  setting,  thus  suggest¬ 
ing  that  it  is  unnecessary  to  extend  to  copies  of  Alice’s 
space  as  well  as  Bob’s.  Here  is  the  local  hidden  variable 
model  that  we  construct  from  a  quasiextension  Hp ,  on 
3Ta  ®  3~Cfh.  We  set 


Pm,nmp  £&},  P) 


IT,“  {TxEa,  ®E *Hd) 

i—  lv  rm-/  n  P ' 

(Tr  IA®EBnHp)s«-1 


(6) 


Each  pm  n  is  non-negative  since  Hp  is  an  entanglement 
witness.  We  can  substitute  this  expression  in  Eq.  (4)  and 
verify  that  we  obtain  the  correct  probabilities  P^&/(p)  by 
using  the  definition  of  the  B  vectors,  the  normalization  of 
the  POVMs,  and  the  symmetry  of  Hp  as  before.  □ 

This  method  for  constructing  LHV  theories  may 
be  implemented  both  numerically  and  analytically. 
Let  us  first  show  a  simple  analytic  construction  of  a 
(2,  2)-symmetric  extension  for  any  bound  entangled  state 
based  on  a  real  unextendible  product  basis  [7].  Let  PBE  = 
I  —  Yi  \av  b i)(ai y  b(\  be  the  projector  onto  such  a  bound 
entangled  state,  where  {| aif  bt)  =  \ a*,  b*)}  is  the  real  un- 
extendable  product  basis.  Our  (unnormalized)  extension 
will  be  \V)A2Al  ®  |^)B|b2  -  Zi  k>  av  K  ^,)a2a,b1b2, 
where  |M/)  =  Yi  I**)-  It  is  evident  that  this  extension  has 
the  desired  symmetry  property.  It  is  not  hard  to  verify 
that  by  tracing  over  the  systems  A2  and  B2  we  obtain 
PBE  =  PBE.  The  existence  of  a  symmetric  (2,  2)  extension 
implies  the  existence  of  both  (2,  1)  and  (1,  2)  symmetric 
extensions  for  the  state  by  tracing  out  copies  of  A  or  B ,  so 
any  Bell  inequality  violation  for  this  class  of  states  must 
involve  more  than  two  measurement  settings  for  both 
parties. 

We  have  implemented  numerical  tests  for  the  condi¬ 
tions  of  these  two  theorems.  First,  we  look  for  the  exis¬ 
tence  of  a  symmetric  extension  with  Hp  >  0.  If  such  an 
extension  does  not  exist,  there  is  still  the  possibility  that 
some  other  kind  of  quasiextension  does  exist.  We  have 
focused  on  the  existence  of  a  decomposable  entanglement 
witness  Hp  because  in  both  these  cases  the  numerical 
problem  corresponds  to  a  semidefinite  program  [13].  We 
label  the  partitions  of  J~CASa  0  J~CBSb  into  bipartite  sys¬ 
tems  by  p  and  we  denote  partial  transposition  with  re¬ 
spect  to  one  of  the  two  subsystems  as  Tp.  A  decomposable 


entanglement  witness  may  then  be  written  as  Hp  =  P+ 

YP  QpP  >  where  P  >  0,  Qp  >  0  for  all  p. 

Semidefinite  programs  correspond  to  optimizations  of 
linear  functions  on  positive  matrices  subject  to  trace  con¬ 
straints.  They  are  convex  optimizations  and  are  particu¬ 
larly  tractable  both  analytically  and  numerically.  We 
show  how  to  numerically  construct  symmetric  exten¬ 
sions,  the  decomposable  quasiextension  case  is  very  simi¬ 
lar.  The  condition  that  the  partial  trace  of  Hp  is  p  is 
equivalent  to  requiring  that  Tv(X  0  I  )Hp  =  Tr Xp  for  all 
operators  X  on  J~CA  0  3~CB.  If  we  write  X  in  terms  of  a 
basis  {<X;}  for  the  real  vector  space  of  Hermitian  operators, 
then  by  linearity  it  is  enough  to  check  that  this  trace 
constraint  holds  for  each  element  of  the  basis.  We  assume 
that  the  basis  is  orthogonal  in  the  trace  inner  product 
Tr cTjCTj  =  Stj  and  that  cr0  =  Ic1a  0  I  dn / sJdAdB.  The  index 
i  ranges  from  zero  to  (< dAdB )2  —  1.  Consider  then  this 
semidefinite  program 

minimize  Tr^, 

subject  to  TrSymA0Symfi(cr;  0l)^  =  r u  i  >  0, 

K>  0, 

where  rt  =  Tr atp.  If  the  optimum  is  less  than  or  equal  to 
one,  then,  by  adding  a  multiple  of  the  identity  to  the 
optimal  K ,  we  obtain  some  Kp  that  satisfies  Tr Kp  =  1  as 
well  as  the  other  constraints.  If  we  define  Hp  =  SymA  0 
Sym5(^Tp)  it  is  clear  that  Hp  is  a  (sa,  sb)~ symmetric  ex¬ 
tension  of  p.  Duality  properties  of  semidefinite  programs 
imply  that  an  optimum  greater  than  one  precludes  the 
existence  of  a  (sa,  s^-symmetric  extension  [13]. 

We  have  implemented  this  semidefinite  program  using 
SeDuMi  [14]  for  several  examples  of  bound  entangled 
states  with  dA  =  dB  =  3.  The  results  are  summarized  in 
Table  I.  For  example,  the  Choi-Horodecki  (C-H)  states 
considered  in  Ref.  [15]  depend  on  a  parameter  a  and 
include  separable  ( a  E  [2,  3]),  bound  entangled  (a  E 
(3,  4]),  and  nonpositive  partial  transpose  (NPT)  states 
for  a  >  4.  They  turn  out  to  have  (2,  l)-symmetric  exten¬ 
sions  well  into  the  range  for  which  the  states  are  en¬ 
tangled  and  even  NPT.  Over  the  range  a  E  [4.34,4.84] 
they  have  decomposable  symmetric  quasiextensions  but 
no  symmetric  extensions  showing  that  the  former  prop¬ 
erty  provides  a  strictly  stronger  sufficient  condition  for 
the  existence  of  a  LHV  theory.  The  bound  entangled  states 
of  [16]  do  not  even  have  (1,  2)  extensions.  These  and  many 
of  the  BruB-Peres  states  [17]  are  therefore  the  strongest 
candidates  for  bound  entangled  states  with  Bell  inequal¬ 
ity  violations.  However,  we  have  looked  for  but  did  not 
find  Bell  inequality  violations  for  (sa  =  2,  sb  =  2)  set¬ 
tings  and  three  outcome  projective  measurements  and  for 
(3,  3)  settings  and  two  outcome  projective  measurements. 

Finally  we  considered  Werner  states  [2]  defined  in 
dimensions  d  =  dA  =  dB  >  2  as  pw  =  I(d  —  <E>)  + 
(JT>  —  1)V],  where  V  is  the  flip  operator.  Werner  [2] 
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TABLE  I.  Numerical  results  on  the  existence  of  symmetric 
extensions  (ext)  and  decomposable  quasiextensions  (q-ext)  for 
(sa  =  1  ,sb=  2),  (sfl  =  2 ,sb  =  1),  (sa  =  1,  sb  =  3),  and  (sa  =  3, 
sb  =  1).  For  the  one  and  two  dimensional  families  of  states 
[2,15,16]  we  performed  a  systematic  search  of  the  parameter 
space.  For  the  high  dimensional  families  [17,18]  we  make 
qualitative  statements  based  on  randomly  chosen  examples. 
Note  that  states  may  have  an  (s,  1)  extension  and  no  (1,  s) 
extension.  We  have  performed  both  tests  in  all  cases,  but  the 
general  results  are  unaffected  although  we  do  find  examples  of 
BruB-Peres  states  with  (2,1)  extensions,  say,  but  no  (1,2) 
extensions. 


(2, 1),  (1,  2) 
ext  q-ext 

(3, 1),  (1,  3) 
ext 

C-H  [15]:a  G 

[2, 4.33] 

[2,4.84] 

[2, 4.00] 

Complex  UPB  [18] 

Yes 

Yes 

Few 

H-F  [16] 

No 

No 

No 

BruB-Peres  [17] 

Few 

Few 

No 

Werner  [2] 

d>  3 

d>  3 

d>  4 

Werner  d  =  2,  O  > 

-1/2 

-1/2 

-1/3 

showed  that  for  <E>  >  —  1  +  ^  these  states  do  not  violate 
any  Bell  inequality  with  an  arbitrary  number  sa,  sb  of 
von  Neumann  measurements  (in  Ref.  [19]  the  author 
constructs  LHV  models  for  arbitrary  POVM  settings  for 
a  more  restricted  range  of  <J>).  We  found  that  using  sym¬ 
metry  techniques  similar  to  those  in  Ref.  [20]  it  is  pos¬ 
sible  to  analytically  solve  the  dual  optimization  problem 
to  the  semidefinite  program  described  above;  see  [21].  The 
value  of  the  optimum  establishes  that  all  Werner  states 
have  symmetric  extensions  so  long  as  sa  +  sb  <  d.  Hence 
these  states  have  LHV  theories  for  all  Bell  experiments 
where  the  minimum  number  of  settings  v  =  min^,  sb) 
satisfies  s  +  1  <  d.  This  result  is  more  general  than 
Werner’s  in  the  sense  that,  as  in  Ref.  [19],  it  holds  for 
general  POVM  elements.  It  is  weaker  in  the  sense  that  the 
number  of  settings  is  bounded  by  the  dimension  of  the 
space.  Numerical  and  analytical  results  (see  Table  I  and 

[21])  show  that  Werner  states  for  d  =  2  actually  have 
symmetric  (quasi-)  extensions  beyond  this  analytically 
derived  bound. 

Even  though  our  method  is  the  most  powerful  tool  to 
date  for  constructing  local  hidden  variable  theories,  we 
believe  that  it  is  unlikely  that  every  LHV  model  can  be 
constructed  from  a  symmetric  quasiextension.  It  has  been 
proven  that  only  separable  states  have  (sa  =  l,  sb  — ►  oo) 
extensions;  see  Refs.  [10,22].  This  result  can  be  extended 
to  quasiextensions  [23]  even  though  LHV  theories  for 
entangled  states  with  an  arbitrary  number  of  settings  do 
exist  [2,19].  Our  work  is  only  the  starting  point  for  a 
more  thorough  exploration  of  the  existence  of  LHV  mod¬ 
els  and  (quasi-)  extensions  for  entangled  quantum  states. 
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"If  two  separated  bodies,  each  by  itself 
known  maximally,  enter  a  situation  in 
which  they  influence  each  other,  and 
separate  again,  then  there  occurs  regularly 
that  which  I  have  [just]  called  entanglement 
of  our  knowledge  of  the  two  bodies." 

—Erwin  Schrodinger  (translation  by  J.  D. 
Trimmer) 

Erwin  Schrodinger  coined  the  word  entanglement  in 

1935  in  a  three-part  paper1  on  the  "present  situation  in 
quantum  mechanics."  His  article  was  prompted  by 
Albert  Einstein,  Boris  Podolsky,  and  Nathan  Rosen's 
now  celebrated  EPR  paper  that  had  raised  fundamental 
questions  about  quantum  mechanics  earlier  that  year. 
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Einstein  and  his  coauthors  had  recognized  that  quantum 
theory  allows  very  particular  correlations  to  exist 
between  two  physically  distant  parts  of  a  quantum 
system;  those  correlations  make  it  possible  to  predict 
the  result  of  a  measurement  on  one  part  of  a  system  by 
looking  at  the  distant  part.  On  that  basis,  the  EPR  paper 
argued  that  the  distant  predicted  quantity  should  have  a 
definite  value  even  before  being  measured  if  the  theory 
were  to  claim  completeness  and  respect  locality. 
However,  because  quantum  mechanics  disallows  such 
definite  values  prior  to  measuring,  the  EPR  authors 
concluded  that,  from  a  classical  perspective,  quantum 
theory  must  be  incomplete. 


Schrodinger' s  1935  perspective  comes  closer  to  the 
modern  view:  The  wavefunction  or  state  vector  gives  us 


all  the  information  that  we  can  have  about  a  quantum 
system.  About  entangled  quantum  states,  he  wrote,  "The 
whole  is  in  a  definite  state,  the  parts  taken  individually 

are  not,"  which  we  now  understand  as  the  essence  of 
pure-state  entanglement.  In  that  same  1935  article, 
Schrodinger  also  introduced  his  famous  cat  as  an 
extreme  illustration  of  entanglement:  A  cat  physically 
isolated  in  a  box  with  a  decaying  atom  and  vial  of 
cyanide  represents  a  quantum  state  having  macroscopic 
degrees  of  freedom.  If  the  atom  were  to  decay  and 
trigger  the  release  of  cyanide,  the  cat  would  die.  The 
quantum-mechanical  description  of  the  system  is  a 
coherent  superposition  of  one  state  in  which  the  atom  is 
still  excited  and  the  cat  alive,  and  another  state  in  which 
the  atom  has  decayed  and  the  cat  is  dead: 


The  isolated  cat-trigger-atom-cyanide  system  as  a  whole 
is  in  a  definite  entangled  state,  even  though  the  cat  itself 
exists  as  a  probabilistic  mixture  of  being  alive  or  dead. 


For  the  three  decades 
following  the  1935 
articles,  the  debate  about 
entanglement  and  the 
"EPR  dilemma"— how  to 
make  sense  of  the 
presumably  nonlocal 
effect  one  particle's 
measurement  has  on 
another— was 
philosophical  in  nature, 
and  for  many  physicists 
it  was  nothing  more  than 

that.  The  1964  publication  by  John  Bell  (pictured  in 
figure  1)  changed  that  situation  dramatically.  Bell 
derived  correlation  inequalities  that  can  be  violated  in 
quantum  mechanics  but  have  to  be  satisfied  within  every 
model  that  is  local  and  complete— so-called  local  hidden- 
variable  models.  Bell's  work  made  it  possible  to  test 
whether  local  hidden-variable  models  can  account  for 
observed  physical  phenomena.  Early  and  ongoing  recent 

experiments3  showing  violations  of  such  Bell 
inequalities  have  invalidated  local  hidden-variable 
models  and  lend  support  to  the  quantum- mechanical 
view  of  nature.  In  particular,  an  observed  violation  of  a 


Figure  1 


Bell  inequality  demonstrates  the  presence  of 
entanglement  in  a  quantum  system. 

In  1995,  Peter  Shor  at  AT&T  Research  discovered  that, 
for  certain  problems,  computation  with  quantum  states 
instead  of  classical  bits  can  result  in  tremendous  savings 

in  computation  time.4  He  found  a  polynomial-time 
quantum  algorithm  that  solves  the  problem  of  finding 
prime  factors  of  a  large  integer.  To  date,  no  classical 
polynomial-time  algorithm  for  this  problem  exists. 

Shor's  breakthrough  generated  an  avalanche  of  interest 
in  quantum  computation  and  quantum  information 
theory.  In  this  context,  a  modern  theory  of 
entanglement  has  begun  to  emerge:  Researchers  now 
treat  entanglement  not  simply  as  a  paradoxical  feature 
of  quantum  mechanics,  but  as  a  physical  resource  for 
quantum-information  processing  and  computation.  A 
whole  zoo  of  various  kinds  of  pure  and  mixed  entangled 
states  may  be  prepared— well  beyond  the  simple  pure- 
state  superpositions  that  Schrodinger  envisioned.  And 
those  mixed  entangled  states  may  be  measured, 
distilled,  concentrated,  diluted,  and  manipulated.  A 
surprisingly  rich  picture  of  entanglement  is  now  taking 
shape. 

Entanglement  for  the  21st  century 

The  discovery  of  quantum  teleportation  by  IBM 
researcher  Charles  Bennett  and  five  collaborators  in 
1993  marks  the  starting  point  of  the  modern  view.  In 
quantum  teleportation  (see  the  article  by  Charles 
Bennett  in  Physics  Today.  October  1993.  page  24),  an 
experimentalist,  Alice,  wishes  to  send  an  unknown  state 
| s)  =  a|o)  +  (3|i)  of  a  two-level  quantum  system  to 
another  experimentalist,  Bob,  in  a  distant  laboratory. 
The  two-level  system  could  refer,  for  example,  to  the 
polarization  of  a  single  photon,  the  electronic  excitation 
of  an  effective  two-level  atom,  or  the  nuclear  magnetic 
spin  of  a  hydrogen  atom.  Alice  and  Bob  do  not  have  the 
means  of  directly  transmitting  the  quantum  system  from 
one  place  to  another  (for  photons,  this  could  be  the  case 
when  using  a  high-loss  optical  fiber),  but  let  us  imagine 
that  they  do  share  an  entangled  state.  Consider  the  case 
in  which  Alice  and  Bob  each  have  one  spin  of  a  shared 

singlet  state  of  two  spin-V2  particles]  T~)  =  i/(2)°'5(|,i)- 
!nL,)),  also  called  an  EPR  pair.  Alice  can  transmit  her  spin 
|s)  to  Bob  by  performing  a  certain  joint  measurement  on 
her  spin  state  |s)  and  her  half  of  the  EPR  pair.  She  tells 


Bob  the  result  other  measurement  and,  depending  on 
her  information,  Bob  rotates  his  half  of  the  EPR  pair  to 
obtain  the  state  | s).  The  teleportation  protocol 
demonstrates  that  the  resources  of  classical 
communication  and  the  sharing  of  prior  EPR 
entanglement  are  sufficient  to  transmit  an  unknown 
spin  state  |s>.  (For  the  experimental  realization,  see 
Physics  Today,  February  iqq8,  page  i8.j 

The  spin-singlet  EPR  state  that  Alice  and  Bob  share  in 
quantum  teleportation  is  called  a  maximally  entangled 
state.  Even  though  the  two  spins  together  constitute  a 
definite  pure  state,  each  spin  state  is  maximally 
undetermined  or  mixed  when  considered  separately.  In 
mathematical  terms,  Alice's  local  density  matrix- 
obtained  by  tracing  over  Bob's  spin  degrees  of  freedom, 

TrB(|vF-}(vF-|)— has  equal  probability  for  spin  up  and  spin 

down.  In  keeping  with  Schrodinger's  understanding  of 
entanglement,  one  measures  the  amount  of 
entanglement  in  a  general  pure  state  ra  in  terms  of  the 
lack  of  information  about  its  local  parts.  The  von 
Neumann  entropy  S(p)  =  -Tr(plogp)  is  used  as  a 
measure  of  that  information.  In  other  words,  the 
entropy  of  entanglement  E  of  the  pure  state  <|)  is  equal  to 
the  von  Neumann  entropy  of,  say,  Alice's  density  matrix 

P=  Tt*H>X<|>|. 

Mixed  entanglement 

In  the  quantum  teleportation  scenario,  we  imagined, 
unrealistically,  that  Alice  and  Bob  shared  an  EPR  pair 
free  of  noise  or  decoherence.  More  generally,  Alice  and 
Bob  have  quantum  systems  that  interact  directly  or 
through  another  mediating  quantum  system— like 
Rydberg  atoms  in  a  laser  cavity  that  interact  via 
photons,  or  two  ions  in  an  ion  trap  that  interact  through 

phonon  modes  of  the  trap.5  A  related  example  of 
interest  in  quantum  computation  is  an  array  of 
interconnected  ion  traps,  each  holding  a  small  number 
of  ions  that  are  coupled  by  traveling  photons  or  by  ions 

that  are  moved  between  the  traps.  The  interaction,  or 
"quantum  link,"  between  a  pair  of  systems  is  subject  to 
noise  or  decoherence  through  photon  loss  or  heating  of 
the  phonons,  for  instance.  For  simplicity,  assume  that 
Alice  and  Bob's  local  operations  on  the  quantum 
systems— operations  on  the  ions  in  a  single  trap,  say— are 
perfect,  and  their  exchange  of  classical  information  is 
also  perfectly  noise  free.  That  idealization  enables  one  to 


measure  the  strength  of  the  quantum  link  between  the 
systems. 

An  essential  question  is,  Given  unavoidable  noise  levels, 
is  it  possible  to  establish  a  strong  quantum  link— a  set  of 
pure  EPR  pairs,  in  other  words— between  two  systems? 

If  it  is,  then  the  noise  is  weak  enough  to  permit  the 
error-free  exchange  of  quantum  information  between 
the  systems,  since  the  teleportation  through  the 
generated  EPR  pairs  will  be  error  free.  That  capability 
may  come  at  a  certain  cost,  determined  by  the  amount 
of  noisy  interaction  required  to  generate  an  EPR  pair.  If 
it  is  not  possible  to  generate  EPR  pairs,  that 
decoherence  in  the  system  imposes  a  fundamental 
limitation  on  our  ability  to  perform  quantum 
information  processing. 

The  possibility  of  generating  shared  EPR  entanglement 
in  noisy  environments  is  not  only  of  interest  in 
entanglement  theory,  but  is  crucial  for  the  realization  of 

long-distance  quantum  communication  and  possibly 
large-scale  quantum  computation.  For  example,  it  was 

recently  shown-  that  fault-tolerant  quantum 
computation  can  be  achieved  in  the  presence  of  very 
high  noise  levels  in  the  interaction  link— a  link  can  have 
an  error  rate  of  two-thirds— between  quantum  systems 
that  are  "small"  in  a  particular  sense,  if  one  assumes  that 
local  quantum  processing  on  each  end  is  (almost)  error 
free. 


Pure  quantum  states 
have  their  entanglement 
quantified  fairly 
intuitively  by  considering 
the  degree  of  local 
"mixedness"  or  entropy. 
However,  mixtures  of 
entangled  and 
unentangled  states  are 
murkier:  Recognizing 
which  mixtures  are  still 
entangled  may  be  difficult.  So,  just  what  physical 
systems  can  we  call  "entangled"?  An  operational 
description— expressing  entanglement  in  terms  of  its 
negation— is  helpful.  Suppose  that  Alice  and  Bob, 
working  in  their  distant  labs,  each  receive  the  same 
random  number  over  the  phone.  Depending  on  the 
random  number,  each  of  them  locally  prepares  a  certain 
quantum  state.  The  physical  state  of  their  whole  system, 


Figure  2 


expressed  as  a  density  matrix,  typically  exhibits 
correlations  between  the  two  systems.  However,  those 
correlations  would  be  classical,  since  they  arise  from 
classical  random  numbers.  A  quantum  state  that  can  be 
prepared  in  this  way  over  the  phone  is  called 
"unentangled"  or  separable,  and  such  a  state  can  be 
mathematically  expressed  as  a  mixture  of  unentangled 
pure  states  (see  figure  2).  Conversely,  a  state  is 
"entangled"  if  it  cannot  be  prepared  over  the  phone,  but 
requires  coherent  interaction  between  the  two  systems 
or  the  transmission  of  superpositions  of  quantum  states. 

Measures  of  noisy  entanglement 

For  mixed  states,  it  is  harder  to  establish  a  good 
measure  of  entanglement,  since  such  a  measure  has  to 
distinguish  between  entropy  arising  from  classical 
correlations  in  the  state— a  state  of  thermal  equilibrium, 
for  example— and  local  entropy  due  to  purely  quantum 
correlations.  Two  measures  of  entanglement  that  have 
explicit  physical  meaning  in  the  processing  of  quantum 
information  have  emerged  from  the  quantum-link 
notion  just  described:  the  entanglement  cost  E(p)  of  a 
quantum  state  and  the  distillable  entanglement  D(p)  of 
a  quantum  state,  first  defined  in  reference  Q. 

Assume  that  Alice  and 
Bob  have  created, 
using  their  noisy  link, 
many  (n)  shared 
copies  of  an  entangled 
quantum  state  p;  we 
denote  such  a 

collection  as  p®n.  To 
distill  some  EPR  pairs 
from  those  copies, 

Alice  and  Bob  perform 
several  rounds  of  local,  error-free  operations  to  their 
parts  of  the  copies  and  communicate  their 
measurements  (or  other  classical  data)  to  each  other. 
Such  a  protocol  is  called  entanglement  distillation; 
figure  3  illustrates  one  round  of  such  a  scheme.  The  aim 
is  to  produce  fewer  states  that  are,  however,  more 
entangled  than  the  initial  ones.  Ideally,  the  protocol 
produces  nearly  perfect  maximally  entangled  EPR  pairs 

in  the  limit  of  a  large  number  of  input  states  p®n  with  n 
00.  The  distillable  entanglement  £>( p)  is  then  the 
number  of  such  EPR  pairs  that  can  be  extracted  per  copy 


Figure  3 


of  p  in  this  asymptotic  limit. 


The  reverse  process  also  has  physical  meaning.  What  is 
the  smallest  number  k  of  EPR  pairs  that  Alice  and  Bob 
initially  need  to  create  a  set  of  n  copies  of  p  for  n  — >  oo  by 
local  error-free  operations?  This  asymptotic  ratio  k/n  is 
the  second  measure  of  entanglement,  the  entanglement 
cost  E(  p). 

Reversible  and  irreversible  manipulation 

Attentive  readers  may  have  noticed  a  quirk  in  our 
notation:  The  formalism  uses  the  same  symbol  E  to 
denote  both  the  entanglement  cost  for  general  states  and 
the  entropy  of  entanglement  for  pure  states.  The 
notation  coincidence  is  harmless  since  the  creation  cost 
of  a  pure  state  equals  the  local  entropy  of  entanglement 
E.  Furthermore,  for  a  pure  state  <|),  it  turns  out  that  E(§) 

=  ZD(4>)  (see  box  l  on  page  50).  Physically,  this  means 
that  the  process  of  entanglement  dilution— converting 
EPR  pairs  into  lesser  entangled  pure  states  (|)— can  be 
reversed  without  loss  of  entanglement.  The  reverse 
process  is  called  entanglement  concentration  and  it 
produces  D($)n  =  E($)n  EPR  pairs  from  an  initial  supply 
of  n  states  <|). 

For  mixed  states,  D  is  believed  to  be  generically  less 
than  E,  which  implies  that  the  preparation  of  mixed 
states  from  EPR  pairs  is  a  process  involving  an 
irreversible  loss  of  entanglement.  Curiously,  the  D  <  E 
conjecture  has  only  been  proven  for  some  special  classes 

of  mixed  states. 

In  1998,  the  Horodecki  family  of  Gdansk,  Poland  (father 
Ryszard  and  sons  Pawel  and  Michal),  identified  a  class 
of  entangled  states  that  exhibit  an  extreme  form  of 
irreversibility.  They  proved  that  no  entanglement  can  be 

distilled  ( D  =  o)  from  these  "bound  entangled  states."— 
And  for  a  large  set  of  states  from  that  class, 
irreversibility  was  established  by  proving  that 
entanglement  is  required  to  prepare  the  states  E  >  o. 

Consider  the  metaphor 
illustrated  in  figure  4.  If 
EPR  pairs  were  nodes 
connected  by  lines  or 
strands  that  represent 
quantum  correlations 


between  particles,  then 
one  could  think  of  mixed 
entanglement  as 
entanglement  in  which 
the  strands  are  simply 
mixed  up.  The  mixing 
may  make  it  hard  to 
reconstruct  which 
particle  of  Alice  is 
entangled  with  which 
particle  of  Bob.  Cutting  a 
few  strands  reduces  the 
clutter,  but  every  line  cut 
represents  an  EPR  pair  lost  (compare  this  process  with 
the  distillation  protocol  in  figure  3).  Bound  entangled 
states  are  those  mixtures  that  are  so  thoroughly  mixed 
up  that  every  single  line  has  to  be  cut  to  remove  the 
noise  or  clutter  from  the  system.  But,  when  every  line  is 
cut,  no  entanglement  remains  to  be  distilled. 

"Black  holes"  of  quantum  information 

Because  the  modern  theory  of  entanglement  treats 
quantum  states  as  physical  resources  for  processing 
information,  one  might  consider  them  hierarchically.  A 
simple  and  ideal  world  would  have  only  two  classes  of 
quantum  states:  unentangled,  classically  correlated 
states  that  are  useless  as  a  resource  in  quantum 
teleportation  and  don't  violate  any  Bell  inequalities,  and 
entangled  states  whose  distillation  rate  D  measures  their 
usefulness  in  quantum  teleportation.  If  the  distillation 
rate  D  is  nonzero,  one  can  distill  from  such  states  some 
EPR  pairs,  known  to  violate  Bell  inequalities. 

Bound  entanglement  tells  us  that  life  is  not  so  simple. 
Bound  entangled  states  are  costly  {E  >  o),  but  useless  in 
various  quantum-information-processing  protocols  like 
teleportation.  Furthermore,  there  is  evidence  that  bound 
entangled  states  do  not  violate  any  Bell  inequalities. 

In  those  two  senses,  bound  entangled  states  are  the 
"black  holes"  of  quantum  information  theory. 
Entanglement  goes  in  but  is  impossible  to  recover.  And 
like  black  holes  in  the  theory  of  gravitation,  bound 
entangled  states  test  the  limits  of  our  understanding  and 
puzzle  us  by  their  intrinsic  irreversibility. 

Bound  entanglement  and  partial  transposition 


In  what  sense  are  bound  states  so  thoroughly  mixed  up 
that  no  entanglement  at  all  can  be  extracted?  Bound 
entangled  states  behave  intrinsically  differently  from 
every  other  entangled  state:  They  remain  physical  under 
the  unphysical  operation  of  partial  transposition. 

Researchers  realized  that  they  could  characterize 
entanglement  in  terms  of  how  states  behave  under 

certain  unphysical  operations.—  In  1996,  Asher  Peres  at 
the  Technion-Israel  Institute  of  Technology  in  Haifa, 
Israel,  noted  that  matrix  transposition  is  just  such  an 
unphysical  operation  when  applied  to  entangled  states. 
Taking  the  transpose  of  a  system's  density  matrix 
produces  another  density  matrix— a  physically  valid 
result.  And  taking  the  transpose  of,  say,  Bob's  part  of  an 
unentangled  state  yA®\ \fB  yields  another  physically  valid 

quantum  state,  since  each  part  of  the  quantum  state  can 
transform  separately;  \\>A  is  not  changed,  and  the  density 

matrix  of  \\ >B  is  transposed.  But  when  applied  to  part  of  a 

pure  entangled  state,  matrix  transposition  produces  an 
unphysical  result.  (For  details,  see  box  2  on  page  ri.) 

Peres  conjectured  that  partial  transposition  was  the 
defining  criterion  for  entanglement.  In  other  words,  all 
entangled  states— pure  or  mixed— should  map  onto 
unphysical  states  by  partial  matrix  transposition,  and  all 
unentangled  states  will  remain  physical  under  the  same 
operation. 

Remarkably,  the  truth  of  that  conjecture  depends  on  the 
dimension  of  the  underlying  Hilbert  spaces  or  phase 
spaces.  If  one  considers  the  state  of  two  spin-V2 
particles,  the  polarization  degrees  of  freedom  of  two 
laser  beams,  or  two  modes  of  a  light  field  having  a 
Gaussian  Wigner  function,  then,  indeed,  all  entangled 
states  map  onto  unphysical  states  by  partial 
transposition.  However,  for  two  spin-one  (or  higher¬ 
dimensional  system)  particles  or  a  Gaussian  light  field 
with  at  least  two  modes  for  both  Alice  and  Bob,  that  is 
no  longer  true  in  general;  there  exist  entangled  mixed 
states  that  pass  the  "partial  transpose"  test  and  have 
therefore  lost  an  essential  property  of  entanglement. 

The  loss  of  that  property  is  precisely  what  the  Horodecki 
family  showed  would  lead  to  a  zero  distillation  rate  D. 
Entangled  states  that  pass  the  partial  transpose  test  are 
the  bound  entangled  states  in  which  the  entanglement  is 
forever  locked  or  "bound"  inside. 


Entanglement  witnesses 


Given  that  entanglement  can  be  such  a  subtle  property 
of  quantum  states,  just  how  can  one  distinguish  between 
entangled  and  unentangled  states?  A  violation  of  a  Bell 
inequality  has  been  the  traditional  telltale  sign  of 
entanglement  in  a  quantum  system.  Examples  of  such 

experiments-  used  pairs  of  entangled  photons  created 
from  nonlinear  optical  processes,  especially  parametric 
down-conversion;  the  polarization  degrees  of  freedom  of 
the  emitted  photons  carried  entanglement.  Alice  and 
Bob  checked  for  a  Bell  inequality  violation  by  using  local 
analyzers  to  measure  the  polarization  of  the  photons 
along  various  angles. 

Unfortunately,  many  quantum  states,  including  the  set 
of  bound  entangled  states,  are  not  known  to  violate  any 
Bell  inequality.  And  considering  the  existing  limitations 
on  experimental  control  of  quantum  systems, 
experimentalists  prefer  to  check  for  entanglement  using 
the  fewest  possible  local  measurements.  The  theoretical 
framework  of  an  entanglement  witness,  of  which  a  Bell 

inequality  is  a  particular  example,1  addresses  those  two 
issues.  The  defining  property  of  an  entanglement 
witness  W  is  that  its  expectation  value  with  respect  to 
any  unentangled  state  p  is  always  nonnegative,  Tr(PVp)  > 
o.  At  the  same  time,  there  exist  entangled  states  a  for 
which  Tr(lVa)  <  o.  Measuring  Won  a  quantum  state  a 
and  finding  a  negative  expectation  value  thus  establishes 
the  entanglement  of  a.  The  good  news  is  that  there  is  an 
entanglement  witness  for  every  entangled  state;  given  an 
experimental  means,  any  entanglement,  bound  or 
otherwise,  can  be  detected.  The  bad  news  is  that 
entanglement  witnesses  are  nonlocal  observables. 
Nevertheless,  one  can  measure  the  expectation  value  of 
W  by  measuring  the  expectation  value  of  a  number  of 
local  observables  PE- ,  such  that  W  =  I-  W- .  Research  is 

under  way  to  determine  the  minimal  number  of  local 
measurements  for  a  given  witness.1^ 

Bell's  communication  advantages 

Given  the  framework  of  entanglement  witnesses,  what  is 
special  about  Bell  inequalities?  Although  they  can  be 
considered  a  type  of  entanglement  witness,  Bell 
inequalities  do  not,  strictly  speaking,  test  for 
entanglement  but  for  a  departure  from  local  hidden 
variable  theories.  Interpreted  as  such,  Bell  inequalities 


have  taken  on  a  whole  new  life  in  quantum- 
communication  science.  Researchers  consider  remote 
parties  who  have  to  carry  out  a  certain  task  with 
minimal  communication  between  them.  One  compares 
the  amount  of  communication  necessary  if  those  parties 
are  given  shared  random  bits  (that  can  be  viewed  as 
local  hidden  variables)  or  an  entangled  quantum  state. 
Sharing  entangled  states  leads  to  savings  in 
communication  precisely  because  the  correlations  in 
quantum  states  cannot  always  be  adequately  described 

by  local  hidden  variable  theories15  (see  the  article  by 
Andrew  M.  Steane  and  Wim  van  Dam,  in  Physics  Today. 
February  2000.  page  35). 

What  lies  beyond 

The  efforts  of  the  quantum  information  theorists  over 
the  past  eight  years  would  come  to  little  if  the  theory 
were  not  supplemented  by  an  ability  to  create  and 
manipulate  entanglement  in  the  lab.  There  is  a  rapidly 
growing  list  of  physical  systems— optical  and  atomic 
systems  especially— in  which  it  is  possible  to  prepare 
various  kinds  of  entangled  states.  As  discussed 
previously,  the  use  of  photonic  degrees  of  freedom,  such 
as  polarization  or  momentum,  has  been  a  long-time 

favorite  way  to  create  entanglement.3  Entangled  states 
consisting  of  the  quadrature  observables  of  different 
modes  of  light  have  been  prepared  in  optical  parametric 

oscillators  and  optical  fibers.  Entanglement  in  the 

states  of  motion  of  the  valence  electrons5  of  trapped  ions 
or  of  Rydberg  atoms  in  cavity  quantum  electrodynamics 
has  involved  up  to  four  different  atoms.  Another 
promising  avenue  is  the  recently  observed  entanglement 

of  large  ensembles  of  atoms.12 

This  short  review  showcases  just  a  few  striking  facets  of 
the  modern  theory  of  entanglement.  Most  notably, 
entanglement  shared  between  more  than  two 
subsystems  is  outside  our  scope  here.  The  broader  study 
of  entanglement  between  many  subsystems  may  lead  the 
field  to  better  understand  the  role  of  large-scale 
entanglement  in  quantum  computation  or  quantum 
many-body  systems. 

We  have  focused  on  the  role  of  entanglement  in  the 
transmission  of  quantum  information.  Entanglement 
also  proves  useful,  however,  when  the  goal  is  to  transmit 
classical  information  as  efficiently  as  possible. 


Researchers  are  studying  many  measures  of  mixed 
entanglement  beyond  the  two  most  prominent  measures 
discussed  in  this  review.  As  for  bound  entanglement, 
there  is  some  evidence  that  it  may  have  a  role  to  play  as 
"helper"  entanglement,  useless  by  itself,  but  useful  when 
combined  with  other  sources  of  entanglement.  For 
entanglement-theory  overview  articles  that  highlight  the 
field,  see  volume  l  of  Quantum  Information  and 
Computation  (July  2001). 
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We  describe  the  fabrication  of  an  atom  mirror  by  etching  of  a  common  hard  drive,  and  we  report  the 
observation  of  specular  retroreflection  of  11  jaK  cesium  atoms  using  this  mirror.  The  atoms  were 
trapped  and  cooled  above  the  hard  drive  using  the  mirror  magneto-optical  trap  technique,  and  upon 
release,  two  full  bounces  were  detected.  The  hard  drive  atom  mirror  will  be  a  useful  tool  for  both 
atom  optics  and  quantum  computation.  ©  2003  American  Institute  of  Physics. 
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Laser  cooling  and  trapping  techniques  have  made  pos¬ 
sible  the  preparation  of  extremely  cold  samples  of  atoms. 
Atom  optics  employs  elements  such  as  mirrors,  lenses,  grat¬ 
ings,  and  beam  splitters  to  manipulate  these  cold  atoms  in  a 
fashion  similar  to  the  familiar  photon  optics.  The  advent  of 
the  Bose-Einstein  condensation  of  neutral  atoms  has  en¬ 
hanced  the  importance  of  developing  atom  optical  elements. 
In  particular,  atom  mirrors — surfaces  that  reflect  atoms — 
play  a  crucial  role  in  the  field  of  atom  optics,  and  it  is  of  keen 
interest  to  develop  mirrors  that  are  simple  to  fabricate  yet 
highly  specular.  In  this  letter  we  demonstrate  a  straightfor¬ 
ward  technique  to  produce  large  area,  high  resolution 
permanent-magnetic  structures  on  flat,  rigid,  and  inexpensive 
substrates. 

Several  types  of  atom  mirrors  have  been  fabricated  using 
evanescent  light  fields,1  dynamic  magnetic  fields,2  and  static 
magnetic  fields.3  Evanescent  mirrors  repulse  atoms  from  a 
prism  surface  using  a  potential  created  by  a  blue-detuned 
light  field.  Although  magnetic  mirrors  cannot  generally  be 
modulated  as  easily  as  evanescent  mirrors,  they  do  offer 
many  advantages:  passive  operation,  compactness  (no  laser 
access  is  needed),  and  much  larger  repulsive  areas. 

Magnetic  mirrors  employ  a  sheet  of  alternating  current 
or  magnetization  to  create  an  exponentially  increasing  poten¬ 
tial  near  the  mirror  surface.4  To  lowest  order,  this  potential  is 
proportional  to  B0e~ky.  The  surface  field,  B0 ,  sets  the  maxi¬ 
mum  atom  energy  that  can  be  reflected,  and  the  spatial  pe¬ 
riod  of  the  current  or  magnetization,  a  =  27r/k ,  determines 
the  amount  of  time  the  atoms  interact  with  the  mirror.  The 
magnetic  mirror  approximates  a  perfectly  flat  mirror  as  B0 
increases  and  a  decreases.  For  example,  if  B0=  l  kG  and  a 
=  1  jam,  a  cesium  atom  in  the  62Sm  F  =  4,  mF=  4  state  will 
be  reflected  when  dropped  from  a  height  of  0.4  m,  and  will 
only  interact  with  the  mirror  for  5  juts  if  dropped  from  2  cm. 

Mirrors  made  from  serpentine  patterns  of  wires  can  pro¬ 
duce  time-dependent  reflection  potentials.  However,  they 
have  not  been  fabricated  with  periods  smaller  than  10  jjl m, 
and  the  power  dissipated  by  the  small  wires  requires  cooling 
by  liquid  nitrogen  and  pulsed  operation.5  Sinusoidal  magne¬ 
tization  of  audio-tape,  floppy  disks,  and  videotape  can  pro¬ 
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duce  magnetic  mirrors  with  magnetization  periods  down  to 
12  jam.3  Mirrors  made  from  millimeter- sized  arrays  of  per¬ 
manent  magnets  have  been  demonstrated,  as  have  mirrors 
produced  by  1-4  jam  periodic  structures  fabricated  by  sput¬ 
tering  ferromagnetic  material  onto  a  grooved  substrate  pat¬ 
terned  by  electron-beam  lithography.6 

We  recently  fabricated  a  magnetic  mirror  by  etching  a 
common  hard  drive,  and  we  have  used  this  mirror  to  retrore- 
flect  a  cold  cloud  of  106  cesium  atoms.  Hard  drives  offer 
several  advantages  for  making  and  using  atom  mirrors.  The 
common  hard  drive  provides  a  large  surface  area  of  thin 
magnetic  film  whose  surface  is  specifically  designed  to  be 
very  flat,  smooth,  and  rigid.  Furthermore,  the  film’s  remnant 
magnetic  field  and  coercivity  can  be  as  large  as  7  and  3  kG, 
respectively.7  An  atom  mirror  could  in  principle  be  fabricated 
with  a  2  jam  periodicity  over  the  entire  surface  of  the  hard 
drive.  Old  or  discarded  hard  drives  may  be  used:  an  Apple 
hard  drive  from  the  mid- 1 990 ’s  was  used  for  the  experiment 
presented  here. 

We  fabricate  the  mirror  by  etching  2  jam  wide, 
—  100  nm  deep  trenches  into  a  1  cm2  section  of  the  surface 
of  the  hard  drive.  These  100  nm  trenches  extend  past  the 
magnetic  layer  to  form  a  periodic  array  of  1  jam  wide,  30  nm 
thick,  and  1  cm  long  stripes  of  cobalt  alloy  (see  Fig.  1).  The 
cobalt  alloy  is  granular,  which  enhances  the  coercivity  and 
allows  us  to  magnetize  the  material  in  plane  and  parallel  to 
the  short  axis  of  the  magnetic  strips.  The  typical  grain  size  is 
20-50  nm,7  and  we  expect  the  magnetization  to  be  uniform 
for  our  much  larger  features.  We  do  not  know  the  exact  ma¬ 
terials  and  thicknesses  of  the  layers  of  the  proprietary  hard 
drive.  However,  etching  —  100  nm  is  sufficient  to  remove  the 
magnetic  layer. 

Standard  photolithography  is  used  to  create  the  etch 


Sublayer  (Ni-P) 


10  qm 


Substrate  (Al-Mg) 


0.6  mm 


FIG.  1.  A  cross  section  of  the  etched  hard  drive.  The  magnetization  is 
in-plane.  See  Ref.  7  for  a  description  of  the  hard  drive  layers. 
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FIG.  2.  20  jmm  wide  (a)  AFM  scan,  (b)  MFM  scan,  and  (c)  MFM  cross- 
section  of  the  etched  hard  drive  surface. 


mask.  After  cutting  the  hard  drive  into  2-3  cm2  sections, 
positive  photoresist  (TSMR-8900,  Tokyo  Ohka  Kogyo  Co.) 
is  spun  onto  the  cleaned  hard  drive  surface  for  40  s  at  4200 
rpm.  A  15  s  exposure  followed  by  65  s  in  the  developer 
(NMD-W  2.38%)  maps  the  photomask  lines  into  the  resist. 
The  sample  is  ion  milled  with  argon  in  a  inductively  coupled 
plasma  (ICP)  system.  We  etch  for  8  minutes  at  a  forward 
power  of  100  W,  ICP  power  of  400  W,  and  an  argon  flow  of 
40  seem.  The  remaining  photoresist  is  removed  with  acetone 
and,  if  necessary,  a  soft  swab.  To  erase  the  hard  drive’s  bits 
and  magnetize  it  as  a  mirror,  we  insert  the  hard  drive  section 
into  the  field  of  an  8  kG  electromagnet  whose  field  is  parallel 
to  the  surface  and  perpendicular  to  the  magnetic  stripes. 

The  magnetic  field  from  the  etched  hard  drive,  with  in¬ 
plane  magnetization,  M0,  parallel  to  the  short  axis  of  the 
magnetic  stipes,  is  analogous  to  a  periodic  sheet  of  alternat¬ 
ing  in-plane  magnetization  +M0/2  and  —  M0/2.  In  the  infi¬ 
nite  array  limit,  the  magnetic  field  above  the  surface  is 

B2=B2le~2ky  +  2BlB3cos(2hx)e~4ky+B23e~9ky+ 

(1) 

where  Bl  =  \  —  e~kb)l tt,  Z?3  =  /u,0M0(1  —  e~3kb)/3TT, 

and  Z?  =  30nm  is  the  thickness  of  the  magnetic  layer.  The 
field  has  no  components  in  the  z  direction,  and  rotates  with  a 
period  equal  to  a  in  the  x-y  plane.  Cesium  atoms  in  the  F 
=  4,  mF= 4  state,  which  has  the  largest  weak-field  seeking 
magnetic  moment,  would  have  to  be  dropped  from  a  height 
of  25  cm  to  penetrate  to  a  height  at  which  the  second  term  in 
the  expansion  is  equal  the  first,  so  to  a  good  approximation 
the  field  may  be  written  as 


B^Bxe  ky  +  B3e  3ky  cos(2kx). 


(2) 


For  our  hard  drive  mirror,  Bx  is  equal  to  2-4  kG  depending 
on  the  specific  cobalt  alloy.  When  a=  1  jmm,  the  ratio  of  the 
first  harmonic  term  to  the  purely  exponential  term  for  a  ce¬ 
sium  atom  dropped  from  a  height  of  2  mm  (20  mm)  is  1 
X10-6  (1X10-3)  at  the  turning  point  y  =  0.8  jmm  (y 
=  0.4  jmm). 

The  etched  hard  drive  used  for  the  experiment  has  a 
^3  jmm  and  c  ~  1  jmm  resulting  in  a  ratio  of  magnetic  layer 
to  gap  that  is  approximately  1:2.  Figures  2(a)  and  2(b)  show 
20  jmm  wide  atomic  force  microscopy  (AFM)  and  MFM 
scans  of  the  hard  drive  surface.  The  trenches  in  the  AFM 
scan  are  dark,  and  the  light  to  dark  variation  of  magnetic 
strips  shows  the  north  and  south  poles  of  the  magnetization. 
Figure  2(c)  shows  a  20  jmm  cross  section  of  the  MFM  scan: 
peaks  represent  the  north  and  south  poles.  To  describe  the 
field  above  our  etched  hard  drive,  Eq.  (2)  can  be  modified  to 

account  for  the  deviation  from  a  1 : 1  width  ratio  by  multiply  - 
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ing  Bx  by  sin(7 Tda)  and  B 3  by  sin(37 tc/o).  In  our  device,  the 
ratio  of  c/a  ^1/3  decreases  the  Bx  term  by  0.9,  but  causes 
the  corrugation  term  to  nearly  vanish. 

The  atom  mirror  is  placed  facing  upwards  in  a  vacuum 
chamber  pumped  to  5  X  10“ 9  Torr.  In  contrast  to  the  experi¬ 
ments  that  use  a  standard  magneto-optical  trap  (MOT)  to  trap 
and  cool  the  atoms  ~2cm  above  the  mirror,  we  use  the 
mirror  MOT  technique  to  collect  the  atoms  1.5-4  mm  above 
the  surface.8  A  MOT  requires  the  zero  of  a  magnetic  quad- 
rupole  field  to  be  centered  at  the  intersection  of  six  circularly 
polarized  laser  beams  coming  from  all  cardinal  directions.  To 
satisfy  this  configuration  near  the  hard  drive  surface,  two  1 
cm  diameter  beams  of  opposite  circular  polarization  reflect  at 
45°  from  the  1  cm2  etched  region.  A  retroreflected  beam  is 
positioned  perpendicular  to  the  45°  beams  and  grazes  the 
surface  of  the  hard  drive.  Aligning  the  axis  of  the  quadrupole 
field  with  one  of  the  45°  beams  completes  the  mirror  MOT 
configuration.  The  trapping  lasers,  each  with  an  intensity  of 
4  mW/cm2  and  1  cm  wide,  are  detuned  by  10  MHz  from 
cesium’s  F  =  4,  F'  =  5  cycling  transition.  A  repumping  beam 
tuned  to  the  F=  3,  F'=4  transition  is  superimposed  onto 
both  the  grazing  beam  and  a  45°  beam.  The  atoms  are  loaded 
from  a  thermal  vapor. 

In  previous  experiments  using  a  perfectly  reflecting  gold 
mirror,  we  have  been  able  to  trap  2X  106  cesium  atoms  in  a 
mirror  MOT  and  cool  them  to  3  jm K.  One  might  expect  trap¬ 
ping  and  cooling  to  be  much  less  effective  with  the  etched 
hard  drive  due  to  its  poor  qualities  as  an  optical  mirror:  the 
reflectivity  is  only  —50%,  it  is  a  good  optical  grating,  and 
the  magneto-optical  Kerr  effect  degrades  the  circularity  of 
the  reflected  45°  beams.  Nevertheless,  we  have  been  able  to 
collect  1  X  106  atoms  and  subdoppler  cool  them  to  11  jm K. 
Achieving  this  low  temperature  is  crucial  because  the  atoms 
released  directly  from  the  mirror  MOT,  at  a  temperature  of 
— 120  jm  K,  expand  too  quickly  and  become  too  diffuse  to 
detect  by  the  time  they  reach  the  hard  drive  surface. 

The  poor  optical  reflectivity  of  the  mirror  does  slightly 
complicate  the  subdoppler  cooling  procedure;  however,  with 
careful  zeroing  of  the  magnetic  field  it  is  still  possible  to 
achieve  polarization-gradient  cooling  to  11  jm K  in  a  (down¬ 
wards)  moving  reference  frame.  The  atoms  are  optically 
pumped  into  the  F  =  4,  mF=  4  Zeeman  substate  just  before 
being  dropped,  and  we  apply  a  100  mG  bias  field  parallel  to 
the  magnetic  stripes  in  order  to  maintain  alignment  of  the 
atomic  spins  while  they  are  falling/bouncing. 

We  have  been  able  to  detect  two  full  bounces  of  the 
atoms  from  the  hard  drive  atom  mirror.  Figures  3(a)  and  3(b) 
show  data  from  five  runs  of  the  experiment.  The  top  panel 
shows  the  mean  position  of  the  atoms  above  the  hard  drive 
surface  as  a  function  of  time.  Superimposed  is  a  curve  de¬ 
picting  the  expected  trajectory  of  a  particle  falling  under 
gravity  and  bouncing  from  a  hard  wall.  The  slope  of  a  line  fit 
to  the  lateral  expansion  of  the  falling  atom  cloud  provides  a 
measure  of  the  atoms’  rms  velocity.  A  nonspecular  mirror 
would  heat  and  diffusely  scatter  the  reflected  atoms  as  they 
bounce,  resulting  in  a  sharp  increase  of  the  cloud  expansion 
rate.  We  made  a  linear  fit  to  prereflection  (t<  15  ms)  data  in 
each  of  the  data  sets,  and  deviation  from  this  line,  postreflec¬ 
tion,  would  be  evidence  of  nonspecularly.  The  dashed  seg¬ 
ment  demarcates  the  region  of  unfitted  data,  and  we  do  not 
to  AIP  license  or  copyright,  see  http://ojps.aip.org/aplo/aplcr.jsp 
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FIG.  3.  Panel  (a)  shows  the  mean  height  of  the  atoms  above  the  hard  drive 
surface  during  the  first  bounce.  Panel  (b)  shows  the  residuals  from  a  linear 
fit  to  the  width  of  the  expanding  atom  cloud. 


see  any  increase  or  offset  of  the  residuals  in  this  postreflec¬ 
tion  region:  to  within  the  experimental  resolution,  we  do  not 
detect  any  deviation  from  specular  reflection. 

We  have  realized  a  specular  atom  mirror  built  by  etching 
a  common  hard  drive.  Magnetization  periodicity  of  3  jmm  has 
been  achieved,  and  we  believe  it  would  be  straightforward  to 
reduce  this  to  2  jmm  with  photolithography  and  to  ~  1  jmm 
using  a  large  area  electron-beam  writer.  The  hard  drive  atom 
mirror  is  compact,  passive,  relatively  simple  to  fabricate,  and 
possesses  a  large  remanent  magnetic  field.  Moreover,  it  has 
several  desirable  properties  for  applications  beyond  simple 
reflection  of  atoms.  The  hard  drive’s  large  coercivity  should 
allow  one  to  use  wires  fabricated  directly  on  its  surface  to 
augment  the  mirror’s  ability  to  manipulate  atoms.  Likewise, 


electric  pads  could  be  printed  on  the  surface.  These  pads 
would  allow  state-independent  forces  to  act  in  concert  with 
the  state-dependent  forces  from  the  mirror’s  magnetic  field  to 
perform  quantum  logic  gates  necessary  for  quantum 
computation.9  The  mirror  can  trap  cold  atom  gases  in  two 
dimensions,  and  can  act  as  an  adjustable  grating  when  used 
in  conjunction  with  a  magnetic  bias  field.10,11  Large  area  mir¬ 
rors  can  be  fabricated,  and  it  seems  possible  that  these  mir¬ 
rors  could  be  useful  for  guiding  or  confining  cold  neutrons.12 
As  hard  drive  platters  are  expected  to  have  good  surface 
flatness  and  substrate  rigidity,  it  may  be  possible  to  create 
two-dimensional  waveguides  and  other  devices  by  holding 
an  opposing  pair  of  atom  mirrors  a  few  microns  apart. 
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Magnetometry  is  fundamentally  a  parameter  estima¬ 
tion  process  because,  like  all  fields,  magnetism  cannot  be 
directly  observed.  Rather,  the  strength  of  a  magnetic  field 
must  be  inferred  from  its  influence  on  a  probe  such  as  an 
atomic  spin  ensemble  [1].  In  a  canonical  atomic  magne¬ 
tometer,  such  an  ensemble  would  be  prepared  into  a 
coherent  spin  state  with  its  bulk  magnetization  polarized 
along  the  x  axis,  (J(0))  =  (J,  0,  0)(such  as  by  optical 
pumping).  Then,  a  magnetic  field  along  the  y  axis  with 
magnitude,  B ,  would  induce  the  atomic  Bloch  vector, 
(J(f)),  to  precess  in  the  x-z  plane  with  frequency,  coL  = 
yB .  Thus,  the  magnetic  field  could  be  estimated  from  the 
free  induction  decay  of  the  atomic  magnetization  by 
monitoring  the  z  component  of  the  Bloch  vector,  ( Jz(t ))  = 
/exp(— t/T2)  sin(coLt),  where  T2  is  the  transverse  spin 
coherence  time. 

In  practice,  current  atomic  magnetometers  operate 
by  continuously  pumping  the  atomic  sample  while  a 
(Jz)~ dependent  optical  property  of  the  ensemble  is  moni¬ 
tored  [2-4].  Because  of  pumping,  the  atoms  are  con¬ 
stantly  repolarized  as  they  Larmor  precess.  For  small 
fields  (the  relevant  case  when  discussing  detection  limits), 
the  ensemble  rapidly  achieves  an  equilibrium  that  is 
nearly  polarized  along  the  x  axis,  but  with  a  steady-state 
offset,  (/z)ss  oc  yBJ.  The  uncertainty  in  measuring  Jz  is 
due  to  projection  noise  [5],  (A J1/)  =  (J%)  —  ( Jz )2,  which 
has  a  value  of  J/2  for  a  coherent  spin  state.  Averaging  a 
sequence  of  independent  measurements  of  (Jz)ss  with  this 
variance  leads  to  the  conventional  shot-noise  detection 
limit  for  a  total  measurement  time  of  ttot  [2,3], 


SB  =* 


1 

yy]JT2tioi 


(1) 


Since  (A/^)  sets  an  intrinsic  limit  on  the  field  sensi¬ 
tivity,  reducing  the  projection  noise  below  its  standard 
quantum  limit  would  improve  the  precision.  This  natu¬ 
rally  leads  one  to  consider  spin-squeezed  states  [6]  where 
uncertainty  in  (Jz)  is  reduced  by  redistributing  it  into  the 
orthogonal  spin  component  so  that  (A/y)  >  J/2.  Since 
(Jy)  does  not  directly  affect  the  field  estimation,  spin 


squeezing  should  enable  one  to  surpass  the  conventional 
shot-noise  magnetometry  limit. 

An  improved  magnetometry  protocol  would  ideally  be 
implemented  by  utilizing  the  conditional  spin  squeezing 
that  is  automatically  generated  by  continuous  observation 
of  an  atomic  sample  [7-9].  This  dynamically  generated 
squeezing  does  not  occur  in  steady-state  (narrow-band) 
magnetometers  because  of  the  continuous  optical  pump¬ 
ing.  However,  it  should  be  possible  to  enable  sub-shot- 
noise  magnetometry  by  turning  off  the  optical  pumping 
once  a  coherent  spin  state  has  been  prepared  followed  by 
continuous  observation  of  the  atoms. 

But  the  nature  of  conditional  spin  squeezing  gives  rise 
to  potential  complications  that  make  it  initially  unclear 
how  to  exploit  the  reduced  uncertainty  for  improved  mag¬ 
netometry.  Figure  1  shows  simulated  data  (generated  ac¬ 
cording  to  a  quantum  trajectory  model  described  below 
[10])  of  a  spin  ensemble  under  continuous  measurement 
with  no  external  field,  B  =  0.  As  (A j\)  decreases  [shaded 
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FIG.  1  (color  online),  (a)  Simulated  single-shot  atomic  mag¬ 
netometry  photocurrent  low  pass  filtered  at  Fc  =  27r^fJ/tioi. 
(b)  Corresponding  diffusion  of  the  atomic  Bloch  vector  as 
conditional  squeezing  is  produced  by  continuous  quantum  non¬ 
demolition  observation. 
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region  in  Fig.  1(b)]  with  the  onset  of  spin  squeezing,  there 
is  no  apparent  change  in  the  noise  of  the  associated  ( Jz(t )) 
measurement,  y(/L)  [Fig.  1(a)],  which  is  due  to  constant 
optical  shot  noise. 

The  dynamical  generation  of  spin  squeezing  starting 
from  an  initial  coherent  state  involves  a  stochastic  tran¬ 
sient  at  early  times.  As  suggested  by  the  error-ellipse 
diagrams  of  Fig.  1(b),  conditional  evolution  gradually 
localizes  the  quantum  spin  state  around  a  constant,  but 
random,  value  of  (Jz)c.  In  an  ensemble  of  continuous 
measurement  trajectories,  this  constant  value  would  be 
distributed  with  a  variance  of  J/2  corresponding  to  (A /z) 
of  the  initial  coherent  state.  Therefore,  the  mean  value  of 
(Jz)c  assumes  a  nonzero  value  even  in  the  absence  of  an 
applied  magnetic  field ,  producing  a  stochastic  offset  in 
the  photocurrent  that  must  be  distinguished  from  Larmor 
precession  in  a  magnetometry  experiment. 

Fortunately,  with  appropriate  filtering,  Larmor  preces¬ 
sion  of  the  spin  state  can  be  distinguished  from  the 
projection  noise  in  such  a  way  that  the  field  estimation 
benefits  from  spin  squeezing.  In  this  Letter,  we  demon¬ 
strate  that  quantum  trajectory  theory  [11,12]  allows  one 
to  construct  a  Kalman  filter  [13-16]  that  optimally  esti¬ 
mates  the  field  magnitude  from  continuously  observed 
conditional  atomic  dynamics.  This  filtering  procedure 
enables  Heisenberg  limited  magnetometry  despite  the 
optical  shot  noise  and  the  transient  effects  of  spin  state 
estimation.  Furthermore,  we  show  that  for  time-invariant 
fields,  our  optimal  strategy  approximately  reduces  to  the 
simple  and  intuitive  data  analysis  procedure  of  linear 
regression  which  is  a  potentially  simpler  experimental 
approach  to  sub-shot-noise  magnetometry. 

We  propose  a  magnetometer  in  which  the  atomic  en¬ 
semble  undergoes  a  continuous  quantum  nondemoli¬ 
tion  (QND)  observation  of  Jz .  It  has  been  shown  that 
such  a  measurement  can  be  implemented  by  detecting 
^-dependent  changes  in  the  phase  of  an  off-resonant 
cavity  mode  coupled  to  the  atomic  ensemble  [9]  or  by 
the  Faraday  rotation  of  a  far-detuned  traveling  mode 
[17,18]  that  passes  through  the  ensemble.  In  both  cases 
the  magnetometer  photocurrent  is  given  by 

y{t)dt  =  2r]-/M(Jz(t))cdt  +  JvdW(t),  (2) 

where  ( Jz{t))c  is  the  conditional  expectation  value  of  Jz,  r\ 
is  the  detector  efficiency,  and  M  (in  units  of  frequency)  is 
an  implementation-dependent  constant  referred  to  as  the 
measurement  strength.  The  optical  shot  noise  is  reflected 
by  stochastic  increments,  dW(t),  that  obey  Gaussian 
white-noise  statistics,  E[dW]  =  0  and  dW 2  =  dt. 

Conditional  evolution  of  the  atomic  ensemble  sub¬ 
jected  to  a  magnetic  field  along  the  y  axis  and  a  QND 
measurement  of  Jz  is  described  by  the  stochastic  master 
equation, 

dpc(t)  =  —iyB[jy,  Pc\dt  +  MT)[Jz\pcdt 

+  ^M^3{[Jz]pcdW(t),  (3) 


where  pc(t )  is  the  reduced  atomic  density  operator 
conditioned  on  the  measurement  record  [12].  The 
superoperators,  T)  and  3~C,  are  given  by  T)[r]p  =  rpr^  — 
(r^rp  +  pr^r)/ 2  and  d~C\r]p  =  rp  +  pr ^  —  tr[(r  +  r^)p]p, 
and  the  initial  condition  is  an  optically  pumped  coherent 
spin  state  along  the  v  axis,  p( 0)  =  \J)X(J\. 

Each  term  in  Eq.  (3)  has  a  physical  implication  for 
magnetometry.  First,  the  Hamiltonian,  H(B )  =  yBJy , 
generates  the  desired  Larmor  precession  signal  used  to 
detect  the  magnetic  field.  The  second  term  reflects  mea¬ 
surement-induced  atomic  decoherence  that  results  from 
coupling  the  ensemble  to  the  optical  shot  noise  on  the 
probe  laser.  As  a  result  the  length  of  the  Bloch  vector 
decays  over  time,  J(t )  =  /exp(— Mt/2),  and  M  can  be 
related  to  a  bound  on  the  transverse  spin  relaxation,  T2  ^ 
2M~l.  The  significance  of  the  third  term  in  Eq.  (3)  is  best 
seen  by  employing  an  approximation  that  holds  for  a  large 
net  magnetization  and  small  field  ( coLt  «  1).  For  an 
ensemble  polarized  along  the  v  axis,  quantum  fluctua¬ 
tions  in  (Jx)  are  at  least  second  order  and  the  operator,  Jx , 
is  well  approximated  by  the  length  of  the  Bloch  vector, 
Jx  — ►  J.  Physically,  this  assumption  capitalizes  on  the 
large  value  of  J  to  treat  the  Bloch  sphere  as  a  locally 
flat  phase  space.  This  approximation  is  extremely  good  for 
both  coherent  and  squeezed  states  with  /  »  1. 

In  a  Gaussian  approximation,  the  first  and  second  mo¬ 
ments  of  Jz  are  sufficient  to  completely  characterize  the 
atomic  state.  Therefore,  equations  of  motion  for  the  mean 
and  variance, 

d(Jz)c  =  yBJe~M,/2dt  +  2jMTj(AJ2z)dW(t),  (4) 

d(Ajj)  =  - 4MT](AJ2z)2dt,  (5) 

provide  a  closed  representation  of  the  magnetometer’s 
conditional  quantum  dynamics  (in  the  coLt  «  1  and  t  ^ 
M~l  limits).  The  physical  significance  of  Eq.  (4)  is  that 
the  atomic  Bloch  vector  experiences  two  types  of  motion: 
deterministic  Larmor  precession  and  stochastic  diffusion. 
Equation  (5)  reflects  the  deterministic  reduction  of  (A/z) 
as  the  atomic  state  is  localized  by  the  observation  process, 
i.e.,  conditional  spin  squeezing. 

Equations  (4)  and  (5)  can  be  used  to  implement  an 
optimal  estimation  procedure  that  capitalizes  on  squeez¬ 
ing  without  mistaking  measurement-induced  Bloch  vec¬ 
tor  rotations  for  true  Larmor  precession.  Since  the  atomic 
dynamics  are  stochastic,  the  estimator  must  be  described 
probabilistically — we  desire  a  conditional  probability 
distribution,  p(B  |B[o,q),  which  measures  the  likelihood 
that  the  field  has  magnitude  B  given  the  measurement 
record,  H,  defined  in  terms  of  the  photocurrent,  da  t  = 
y(t)dt/(2r]^M).  The  estimated  magnitude,  B ,  and  its 
uncertainty,  A B2,  are  obtained  from  the  moments 

B  =  f  Bp(B\~m)dB,  (6) 
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A  B2  =  j(B-B)2p(B\am)dB  (7) 

of  the  conditional  distribution,  p(B |H[o,*]). 

Constructing  a  maximum-likelihood  estimator  is  ac¬ 
complished  by  defining  an  update  rule  that  iteratively 
improves  p(B\a[o,t])  as  the  measurement  record  is  ac¬ 
quired.  Prior  knowledge  of  the  distribution  of  magnetic 
field  values  is  encoded  in  /?(Z?|Bo),  which  may  be  as¬ 
signed  infinite  variance  in  order  to  assure  an  unbiased 
estimate.  Optimality  requires  that  the  conditional  proba¬ 
bility  must  be  updated  according  to  a  Bayes’s  rule, 


,  M 
8B(t)  =  — , 


(1  +  2rjJMt) 


(12) 


4 yJ^ae  Mt  +  Ae  Mt^2{\r\J  +  1)  +  b 
with  a  and  b  given  by 

a  =  -\2rjJ(Mt  +  4)  +  1],  b  =  Mt  +  2r)J(Mt  -  4)  -  3. 

Expanding  Eq.  (12)  to  leading  order  in  t  provides  an 
expression  for  the  detection  threshold, 


1 


8B(t)  «  — 


yJ\Mrjt3’ 


t  »  (JM) 


-l 


(13) 


dp(B\dat,  B[0f*))  =  dq(dat\B,  a[o,t))p(B\a[0,t))>  (8) 

where  dq(dat\B,  B[o,*))  is  an  infinitesimal  conditional 
probability  that  describes  the  likelihood  of  the  evolving 
measurement  record,  dat,  given  a  field  with  magnitude  B 
and  past  history,  B[o,*).  The  utility  of  Bayes’s  rule  is  that 
q(dat\B,  B[o,f))  can  be  computed  using  quantum  trajec¬ 
tory  theory,  Eqs.  (4)  and  (5). 

Implementing  this  parameter  estimator  is  best  accom¬ 
plished  by  a  (recursive)  Kalman  filter  [13-15].  It  can  be 
shown  that  the  filtering  equations, 


dx  =  Axdt  +  D~2( B  +  \CT)(da  ~  Cxdt)  (9) 
with  x  =  (Jz  B)t  (J7  is  the  estimate  of  ( Jz)c ), 


A  =  yJe ^  Mtd2(^  \ 


C  =  (l  0), 


D  =  l/(2^/Mrj),  and  x(0)  =  0  implement  Eq.  (8).  We  note 
that  it  is  possible  to  extend  the  Kalman  filter  to  account 
for  time-varying  or  stochastic  fields  [16]  as  well  as  to 
implement  quantum  feedback  control  [9,16]. 

The  conditional  quantum  dynamics,  particularly  spin 
squeezing  and  the  exponential  decay  of  the  Bloch  vector, 
enter  the  estimation  process  via  the  covariance  matrix, 


that  is  directly  comparable  to  Eq.  (1)  when  the  measure¬ 
ment  strength  is  chosen  to  be  M  ~  T^1  such  that  maximal 
spin  squeezing  is  achieved  at  time  t  =  M~l.  Such  a  choice 
for  M  permits  a  superior  1  / J  (equivalently  l/N)  scaling 
that  is  characteristic  of  the  Heisenberg  squeezing  limit 
[6,9].  The  optical  shot  noise  [of  order  unity  in  this  model, 
see  Eq.  (2)]  enters  implicitly  through  the  signal  to  noise 
ratio,  SNR  =  JyfM,  which  highlights  the  utility  of  the 
Kalman  filter  as  a  whitening  filter  —  it  extracts  the  non¬ 
stationary  spin  squeezing  and  Larmor  precession  dynam¬ 
ics  despite  the  presence  of  Gaussian  noise. 

Figure  2  shows  numerical  results  that  demonstrate  the 
performance  of  our  quantum  Kalman  filter  (QKF).  The 
simulations  were  performed  for  an  atomic  ensemble  with 
J  =  4  X  106,  y=l  kHz/mG,  rj  =  1,  and  M  =  100  kHz 
in  a  background  magnetic  field  of  B  =  1  p,G.  These  val¬ 
ues  nearly  correspond  to  a  magnetometer  constructed 
from  N  =  106  ground  state  Cs  atoms  coupled  to  a  high- 
finesse  optical  cavity  with  single  photon  Rabi  frequency 
g0  =  10  MHz  and  decay  rate,  k  =  1  MHz.  The  QND 
measurement  corresponds  to  a  phase- quadrature  homo¬ 
dyne  detection  of  the  transmitted  cavity  light  with  a 
cavity  mode  (P  =  100  pi W)  that  is  blue  detuned  by  A  = 
1  GHz  from  the  Cs  transition  at  A  ~  852  nm.  The  initial 


V(t)  = 


A  J2z(t)  A  (JzB)(t)\ 

A (BJz)(t)  A B2(t)  )’ 


(10) 


which  describes  the  uncertainty  in  the  parameter  estima¬ 
tions  of  Jz  and  B.  \{t)  evolves  deterministically  accord¬ 
ing  to  the  matrix  Riccati  equation, 

V  =  (A  -  D~2 BC)V  +  V(A  -  £r2BC)r 

+  £r2V(CrC)V,  (11) 

subject  to  the  initial  conditions  A/2(0)  =  0  and 
A(JZ5)(0)  =  0,  with  A52(0)  chosen  to  reflect  prior 
knowledge  on  the  distribution  of  magnetic  field  values. 
Lacking  any  such  knowledge,  one  can  set  A52(0)  — ►  oo. 

The  smallest  detectable  magnetic  field  as  a  function  of 
J  and  the  measurement  duration,  t ,  is  determined  by  the 
estimator  variance,  A B2.  Solving  the  matrix  Riccati 
equation  [which  is  analytically  soluble  for  the  Kalman 
filter  in  Eq.  (9)]  provides  the  time-dependent  magnetic 

field  detection  threshold,  SB  =  VA B2(t), 


FIG.  2  (color  online).  Comparison  of  the  estimation  errors  for 
a  quantum  Kalman  filter  and  a  linear  least  squares  magnetic 
field  determination.  The  inset  plot  highlights  the  l/J  (Heisen¬ 
berg  limited)  scaling  of  both  procedures  for  t  »  ( JM)~l . 
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estimator  variance  was  chosen  to  be  A52(0)  =  100  ptG2 
which  is  the  initial  value  that  one  would  select  given  prior 
knowledge  that  the  magnetic  field  could  be  treated  as  a 
Gaussian  random  variable  with  this  variance.  For  the 
parameters  we  selected,  Larmor  precession  and  spin  pro¬ 
jection  noise  have  comparable  magnitudes  on  time  scales 
of  order  (JM)~l . 

The  Kalman  estimation  error  (crosses  in  Fig.  2)  was 
computed  from  the  ensemble  average,  E[(Bt  —  B)2\  for 
105  trajectories,  and  the  solid  line  shows  the  estimation 
uncertainty  A B(t)  obtained  by  integrating  Eq.  (11).  Since 
our  simulations  were  performed  with  B  =  1  julG,  the  em¬ 
pirical  performance  of  the  QKF  closely  matches  the  solid 
line.  The  dotted  line  in  Fig.  2  shows  the  analytic  Riccati 
solution,  given  by  Eq.  (12),  for  A52(0)  — ►  oo,  which 
would  be  the  expected  QKF  performance  in  a  scenario 
with  no  prior  knowledge  of  the  magnetic  field.  Figure  2 
also  shows  the  estimation  error  for  simple  linear  regres¬ 
sion  of  the  measurement  record  (open  circles).  Assuming 
that  the  Bloch  vector  has  not  decayed  significantly,  B  is 
proportional  to  the  slope  of  a  line  fit  to  the  (filtered) 
photocurrent,  y(/).  The  estimation  error  for  linear  regres¬ 
sion  was  obtained  by  computing  E[(Bi  —  B)2]  for  105 
trajectories.  Although  the  QKF  is  clearly  superior  for 
short  times,  the  two  estimation  procedures  converge  for 
«  t  <  M  1  and  provide  a  quantum  parameter 
estimation  with  <55  —  0.01  nG  in  t  —  1  ms.  The  inset  of 
Fig.  2  highlights  the  SB  oc  j~l  scaling  that  distinguishes 
both  the  QKF  (crosses)  and  regression  (open  circles)  esti¬ 
mators  from  the  conventional  shot-noise  limit,  Eq.  (1). 
For  sufficiently  large  times  both  estimators  achieve  the 
detection  threshold  in  Eq.  (13)  (solid  lines). 

The  QKF  and  linear  regression  differ  mainly  in  how 
they  treat  the  initial  diffusive  transient  of  J7  [Fig.  1(b)]. 
Since  the  QKF  is  derived  from  a  quantum  trajectory 
model,  it  is  aware  of  the  short-time  diffusion  and  strate¬ 
gically  underweights  the  photocurrent  at  early  times  [via 
the  Kalman  gain,  G  =  D~2( B  +  VCr)].  At  late  times  the 
regression  analysis  manages  to  absorb  the  initial  diffusive 
transient  into  the  y  intercept  of  the  linear  fit.  Although 
(Jz( 0))  =  0,  the  Jz  localization  process  gradually  deter¬ 
mines  an  effective  offset  in  the  photocurrent  during  the 
interval  t~  (, JM)~l .  Without  explicit  knowledge  of  the 
conditional  dynamics  the  linear  regression  equally 
weights  the  photocurrent  for  t  <  ( JM)~l .  This  decreases 
the  quality  of  the  fit,  but  the  resulting  error  becomes 
insignificant  for  t  »  (JM)~l . 

Our  analysis  suggests  that  estimation  procedures  based 
on  conditional  quantum  dynamics  can  play  a  crucial  role 
in  optimizing  both  the  sensitivity  and  the  bandwidth  in 
atomic  magnetometry.  While  conventional  steady-state 
magnetometers  can  only  improve  their  detection  capabili¬ 
ties  by  increasing  the  number  of  atoms  or  the  averaging 
time,  the  quantum  estimator  can  achieve  greater  precision 
for  the  same  value  of  t  and  N  by  improving  the  measure¬ 
ment  strength.  The  significance  of  the  Kalman  filter  is  the 
optimality  that  is  guaranteed  by  its  derivation  from  a 


Bayes’s  rule.  Our  finding  that  linear  regression  closely 
approximates  the  optimal  procedure  indicates  a  poten¬ 
tially  simpler  experimental  procedure  for  sub-shot- 
noise  magnetometry.  Although  Heisenberg  limited  spin 
squeezing  should  be  possible  using  current  techniques  in 
cavity  quantum  electrodynamics  (a  discussion  is  provided 
in  [9]),  the  experimental  difficulty  of  achieving  this  limit 
makes  it  desirable  to  have  an  optimal  estimator  such  as 
the  QKF  to  fully  exploit  even  a  small  amount  of  squeez¬ 
ing,  to  treat  fluctuating  fields,  and  to  achieve  estimator 
robustness  [16].  In  either  case,  estimation  procedures  that 
allow  and  account  for  conditional  quantum  dynamics  — 
whether  explicitly  as  in  the  QKF  or  implicitly  as  in  linear 
regression — offer  substantial  improvement  over  steady- 
state  procedures. 
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We  describe  novel  purification  protocols  for  bi-colorable  graph  states.  The  protocols  scale  effi¬ 
ciently  for  large  graph  states.  We  introduce  a  method  of  analysis  that  allows  us  to  derive  simple 
recursion  relations  characterizing  their  behavior  as  well  as  analytical  expressions  for  their  thresholds 
and  fixed  point  behavior.  We  introduce  two  purification  protocols  with  high  threshold.  They  can, 
for  graph  degree  four,  tolerate  1%  (3%)  gate  error  or  20%  (30%)  local  error. 


I.  INTRODUCTION 

The  known  protocols  in  quantum  information  pro¬ 
cessing  require  a  certain  degree  of  quantum  mechan¬ 
ical  entanglement  to  achieve  an  advantage  over  their 
classical  counterparts.  Often,  this  quantum-mechanical 
‘essence’  is  provided  in  terms  of  in-advance  prepared 
quantum  states.  For  example,  Bell  states  are  used  in 
a  well-known  protocol  for  quantum  cryptography  [1], 
and  schemes  for  multi-party  cryptographic  tasks  us¬ 
ing  Greenberger- Horne- Zeilinger  (GHZ)-states  and  other 
Calderbank-Shor-Steane  (CSS)-states  have  been  devised 
[2].  Further,  in  quantum  computation,  multi-particle  en¬ 
tangled  states  can  be  used  to  streamline  the  execution 
of  gates  and  sub-circuits  via  gate  teleportation  [3],  and 
cluster  states  represent  a  universal  resource  for  quantum 
computation  by  local  measurements  [4]. 

In  most  realistic  scenarios  the  quality  of  entangled  re¬ 
source  states  is  degraded  by  the  effects  of  decoherence, 
and  methods  of  error  detection  or  -correction  are  required 
to  counteract  this  process.  One  such  method  is  state  pu¬ 
rification  where  a  (close  to)  perfect  copy  of  a  quantum 
state  is  distilled  out  of  many  imperfect  ones.  Purification 
has  first  been  described  for  Bell  states  [5,  6,  7]  and  sub¬ 
sequently  generalized  to  bi-colorable  graph  states/  CSS- 
states  [8,  9,  10].  Recently,  a  protocol  for  the  purification 
of  VF-states  has  been  presented  [11].  State  purification  is 
used,  for  example,  to  establish  a  perfect  quantum  channel 
between  two  parties  [5],  to  efficiently  create  long-range 
entanglement  via  quantum  repeaters  [12]  or  to  render 
certain  schemes  for  topological  fault-tolerant  quantum 
computation  universal  [13]. 

Imperfect  initial  states  are  not  the  only  sources  of  er¬ 
ror  for  realistic  state  purification.  With  the  exception 
of  certain  schemes  of  topological  quantum  computation 
such  as  [13],  errors  in  the  gates  for  purification  also  need 
to  be  taken  into  account. 

What  can  we  expect  to  gain  from  an  imperfect  purifica¬ 
tion  procedure?  In  the  process  of  purification  the  errors 
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of  the  initial  state  are  replaced  by  the  errors  of  the  puri¬ 
fying  gates.  Thus,  the  amount  of  error  may  be  reduced 
if  the  quality  of  the  initial  states  is  low  compared  to  the 
quality  of  the  gates  for  purification  (but  above  threshold) . 
Further,  purification  can  be  used  to  condition  the  error 
of  a  quantum  state.  For  example,  imperfect  Bell  state 
purification  can  be  used  to  establish  a  perfectly  private 
if  imperfect  quantum  channel  [14].  In  a  multi-party  sce¬ 
nario,  for  some  protocols  the  purification  gates  act  locally 
on  each  copy  of  the  state  to  purify,  resulting  in  a  local  or 
close  to  local  error  model  for  the  final  state.  This  feature 
attains  relevance  in  the  context  of  fault-tolerant  quantum 
computation.  Threshold  theorems  have  been  established 
for  increasingly  general  types  of  error  including  coherent 
and  long-range  errors  [15,  16]  but  there  are  realistic  sce¬ 
narios  in  which  standard  error-correction  appears  to  fail 
[17].  In  such  a  situation,  state  purification  may  be  used 
to  turn  the  error  model  into  a  more  benign  one. 

The  focus  of  this  paper  is  purification  of  bi-colorable 
graph  states  by  imperfect  means,  a  subject  that  has  pre¬ 
viously  been  studied  in  [9,  18,  19].  We  are  interested  in 
the  interplay  between  threshold  and  overhead.  Specif¬ 
ically,  we  seek  protocols  that  (I)  work  with  erroneous 
purification  gates;  (II)  have  a  high  threshold  and  good 
quality  of  the  output  state;  (III)  scale  efficiently;  and 
(IV)  are  analytically  tractable. 

Hashing  protocols  have  a  high  threshold  in  the  error 
of  the  initial  state  and  require  only  a  minimal  resource 
overhead,  but  they  break  down  as  soon  as  the  purifica¬ 
tion  gates  become  slightly  imperfect.  Recursive  proto¬ 
cols  such  as  [8]  also  have  a  high  threshold  for  error  in 
the  initial  states  and  furthermore  work  with  imperfect 
purification  gates,  but  they  are  exponentially  inefficient 
in  the  number  of  particles. 

Our  protocols  are  resistant  to  initial  as  well  as  pu¬ 
rification  errors  and  are  computationally  efficient.  As  a 
bonus,  our  protocols  are  analytically  tractable  for  a  wide 
class  of  errors.  Specifically,  our  base  protocol  described 
in  Section  III  can  be  analyzed  for  arbitrary  input  states 
and  general  probabilistic  Pauli  errors  in  the  purification 
gates.  This  fact  arises  through  a  special  locality  prop¬ 
erty.  So  far,  the  exponential  increase  of  parameters  in 
the  description  of  n-particle  mixed  states — even  mixed 
stabilizer  states — has  been  found  an  obstacle  to  analytic 
discussion,  and  only  severely  restricted  error  models  have 
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FIG.  1:  Action  of  MCNOT  in  the  graph  basis.  The  arrows 
represent  the  direction  of  syndrome  (or  Z  error)  flow  (i.e.  the 
action  of  the  MCNOT  on  the  stabilizer) 


been  treated  in  the  literature. 

This  paper  is  organized  as  follows:  in  Section  II  we 
briefly  review  the  protocol  [8]  for  purification  of  bi¬ 
color  able  graph  states.  In  Sections  III  -  IV  C  we  describe 
our  purification  protocols  and  characterize  them  in  terms 
of  purification  threshold,  output  quality  and  overhead. 
We  conclude  with  a  discussion  of  our  results  in  Section 
V. 


II.  A  BRIEF  REVIEW 

Consider  a  graph  G(V,E)  with  vertex  set  V  and  edge 
set  E.  G(V,  E)  is  bi-colorable  if  V  can  be  partitioned  into 
two  disjoint  subsets  A  and  B  such  that  every  edge  in  E 
connects  a  vertex  in  A  with  a  vertex  in  B.  E  defines  a 
neighborhood  relation  on  elements  of  V ;  neigh(j)  :=  {i  G 
V  :  {i  j)  G  E}.  Define  the  correlation  operators 

Kj  :=  Xj  H  Zj  (1) 

neigh  (j) 

where  X,Y,Z  are  the  Pauli  matrices.  A  graph  state  is 
a  |Vj-qubit  state  | fi)  (/lx  G  {0, 1}^)  that  satisfies  the 
eigenvalue  equations 

K,-  1)"'  |/x).V./  \...\V\.  (2) 

The  states  { |/x) }  form  a  basis  of  the  Hilbert  space  of  |  V\ 
qubit  states  called  the  graph  basis. 

We  now  briefly  discuss  the  post-selection  protocol  of 
[8].  The  protocol  works  by  taking  two  identical  copies 
of  a  bi-colorable  graph  state  and  performing  multiple 
CNOTs  (MCNOT)  between  them,  in  a  definite  pattern 
as  illustrated  in  Fig.  1.  Relabeling  states  in  the  graph 
basis  to  reflect  the  partition  into  colors  A  and  B  (i.e. 

| fi)  =  | /M,/m)),  the  effect  of  the  MCNOT  is  [8] 

|/M,  /Tb)  I VA,  Vb)  I/M,  #  Vb)  \va  +  /M,  VB)  (3) 

where  +  is  element  wise  addition  modulo  2.  Notice  that 
information  about  pi  a  has  been  copied  into  state  two  and 
information  about  vb  has  been  copied  into  state  one.  We 
then  measure  local  observables  X  and  Z  on  copy  two,  and 


reconstruct  from  the  measurement  outcomes  the  eigen¬ 
values  of  all  K j  with  j  G  A.  Suppose  we  get  -1  at  the  kth 
qubit.  Then  we  know  that  either  /x&  or  was  1,  but  we 
don’t  have  enough  information  to  decide  which  one,  so 
we  throw  away  the  states  and  start  again.  We  keep  doing 
this  until  all  measurements  are  clear.  By  this  procedure 
we  correct,  to  lowest  order,  errors  in  the  qubits  of  color 
A.  In  the  next  round  we  interchange  the  roles  of  colors 
A  and  B  and  so  purify  the  B  qubits.  We  can  concate¬ 
nate  this  procedure  to  achieve  desired  levels  of  purity. 
Because  we  are  post-selecting  states  on  the  basis  of  a 
global  measurement  outcome,  this  protocol  is  inefficient 
for  large  states.  This  inefficiency  can  be  addressed  by 
using  error  correction  instead  of  post-selection,  to  which 
we  now  turn. 


III.  THE  3-COPY  PROTOCOL 


The  simplest  way  to  get  enough  information  to  perform 
error  correction  is  to  do  the  MCNOT  on  three  copies 
instead  of  two.  The  3-copy  protocol  consists  of  two  sub¬ 
protocols.  We  use  three  identical  copies  of  the  state  in 
each  sub-protocol.  The  output  of  the  first  sub-protocol 
is  used  as  input  for  the  next.  Thus,  we  need  nine  copies 
to  run  a  single  round.  Let  the  three  identical  copies  be 
p(°\  p^\  p^2\  Sub-protocol  1  (PI): 

i.  Partition  the  graph  into  two  colors  A  and  B  (V  = 
Va  U  Vb  and  Va  n  Vb  =  0). 

ii.  Perform  the  MCNOT  between  copies  p(°)  and  p ^ 

and  and  p ^  such  that  information  about 

qubits  of  color  A  flows  from  p — ►  p W  and 
p(°)  p( 2).  As  a  side  effect  information  about 

B  will  flow  from  p^\  p^  — >  p^\  See  Fig.  3a. 

iii.  Measure  qubits  of  color  A  in  the  X  basis  and  qubits 
of  color  B  in  the  Z  basis  in  states  p^  and  p^2\  This 
is  a  measurement  of  K j  for  j  G  A.  If  the  measure¬ 
ment  of  Kj  gives  +1(— 1)  we  get  a  syndrome  of  0(1). 
Thus,  for  each  j  G  A  we  have  two  bits  of  syndrome 


iv.  Apply  the  correction  YljeA  Z/7  ^  to  P^  • 

For  sub-protocol  P2  the  roles  colors  A  and  B  are  inter¬ 
changed. 

First,  we  will  analyze  this  protocol  with  ideal  CNOT 
gates.  This  will  allow  us  to  derive  simple  closed  form  re¬ 
cursion  relations  characterizing  the  behavior  of  the  pro¬ 
tocol,  as  well  as  analytical  estimates  of  the  threshold  and 
efficiency.  In  Section  IIIB  we  generalize  to  noisy  gates. 
The  analysis  is  restricted  to  density  matrices  that  are  di¬ 
agonal  in  the  graph  basis  (i.e.  probabilistic  mixtures  of 
graph  states).  At  the  end  of  Section  IIIB,  we  will  show 
that  our  results  are  valid  for  arbitrary  density  matrices. 
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A.  Ideal  gates 


Eq.  (3)  implies  that  the  effect  of  the  MCNOT  on  p(°\ 
p W  and  p ^  is 


\^A^{b)  ^  I \  +  Vb}) 

I  ^A’^b)  \V(A  +V(a\Vb)) 

IPa^^b)  ^  \^A  +Va\v{b) 

Eq.  (2)  implies  that  the  effect  of  the  correction  is 

\Pa\p{b)  ^  l/d°  +°’,Mb)>  -  (6) 


(4) 

(5) 


where  a j  :=  a ^  •  crj2\  By  measuring  /Z1)  and  pi2) ?  we 

get  two  bits  of  syndrome  for  each  qubit  of  color  A  in  p . 
The  syndrome  is  conclusive,  it  allows  us  to  identify,  to 
lowest  order  in  error  probability  on  which  state  the  er¬ 
ror  occurred.  We  can  thus  do  error  correction  instead 
of  post-selection.  This  will  make  the  protocol  scale  effi¬ 
ciently  in  the  size  of  the  states.  The  price  is  a  reduction 
of  the  threshold  value. 

We  now  derive  a  recursion  relation  for  the  expectation 
values  (Kj),  j  E  1 . . .  N.  They  yield  a  necessary  and 
sufficient  condition  for  purification.  For  the  moment  we 
assume  that  the  initial  state  p  is  diagonal  in  the  cluster 
basis,  i.e. ,  p  is  a  probabilistic  mixture.  It  is  then  safe 
to  consider  error  probabilities.  This  assumption  is  not 
necessary,  however.  It  is  removed  in  Section  IIIB.  Define 
Pj(p )  as  the  probability  to  find  the  eigenvalue  —1  in  the 
measurement  of  Kj  on  p  as 


PM  :=Tr 


1  —  K, 


1  -  (Kj 


(7) 


(Kj) 

FIG.  2:  Recurrence  curves  for  the  3-copy  protocol.  These 
simple  curves  fully  encapsulate  the  behavior  of  the  protocol 
with  ideal  gates.  The  point  of  intersection  with  (Kj)'  =  (Kj) 
gives  the  threshold.  If  the  gates  are  too  noisy  the  protocol 
breaks  down,  as  indicated  by  the  lowest  curve. 


Under  concatenation  of  PI  with  itself  qubits  of  color  A 
are  purified  with  (K j)p(0)  -»•  (Kj)|0)(0|  =  1. 

Sub-protocol  P 2  is  identical  to  PI  except  that  the  roles 
of  A  and  B  are  interchanged  and  the  three  copies  are 
the  output  states  from  running  PI  three  times.  The  3- 
copy  protocol  is  the  composition  of  P 2  with  PI.  Let 
P  =  P 2  o  PI,  then  Eq.  (8)  and  Eq.  (9)  imply  that  under 
the  action  of  P 


Consider  sub-protocol  PI.  In  order  to  analyze  this  pro¬ 
tocol  we  make  use  of  the  fact  that  the  error  correction 
operation  is  local.  It  only  uses  information  about  (Kj)  in 
each  copy  to  apply  a  correction  to  the  jth  qubit  in 
Thus,  (Kj)  should  have  nice  decoupled  recursion  rela¬ 
tions.  We  will  later  derive  the  recursion  relations  for  the 
expectation  value  of  arbitrary  stabilizer  elements,  which 
in  general  are  more  complex. 

First  consider  qubits  of  color  B.  From  Eq.  (4)  p^ 
p^  +  p^p  +  p^p .  Since  our  copies  are  identical  we  have 
PM0) )  =  Pj(PW)  =  PAM)  =  Pj •  Then,  Pj  + 

3Pj(l  —  Pj)2.  In  terms  of  expectation  values, 

(Kj)'  =  (Kj)3  .  (8) 

Under  concatenation  of  PI  with  itself  qubits  of  color  B 
are  polluted  with  (Kj)  (0)  —>  (Kj)t  =  0. 

Turning  our  attention  to  qubits  of  color  A  we  note  that 
error  correction  fails  if  pj  =  1  for  more  than  one  copy. 
Thus,  Pj  i  *  Pj  +  3Pj  (1  —  Pj).  In  terms  of  expectation 
values 

(Kj)'  =  \  (3  -  (Kj)2)  (Kj) .  (9) 


(Kj)' 


|  (3  —  (Kj)2)'  (Kj)3  if  j  e  A, 
\  (3  -  (Kj)6)  (Kj)3  if  jeB. 


(10) 


The  recursion  relations  Eq.  (10)  have,  for  each  color, 
a  unique  repulsive  fixed  point  in  the  interval  (0, 1)  which 
separates  the  basins  of  attraction  for  the  trivial  fixed 
point  at  0  and  the  nontrivial  fixed  point  at  1.  The  up¬ 
per  fixed  point  corresponds  to  the  perfect  cluster  state. 
Thus,  the  stated  protocol  purifies  a  cluster  state  if  and 
only  if 


(Kj)  >  0.7297  for  all  j  in  A 

(Kj)  >  0.9003  for  all  j  in  B.  (11) 

These  thresholds  should  be  compared  with  the  corre¬ 
sponding  numbers  for  the  post-selection  protocol  [9].  A 
direct  comparison  in  terms  of  (Kj)  is  possible  for  the 
particular  case  where  only  independent  local  phase  flip 
errors  are  assumed  for  the  initial  states  in  protocol  [8]. 
Then,  the  PI  recursion  relation  for  (Kj)  with  j  E  B  is 
(Kj)'  =  (Kj)2  and  for  j  e  A  is  (Kj/  =  The 

resulting  threshold  values  are  (Kj)th  =  0.2956  for  j  E  A 
and  (Kj)th  =  0.5437  for  j  E  B. 
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Returning  to  our  protocol,  it  is  possible  to  derive  re¬ 
cursion  relations  for  the  expectation  values  of  arbitrary 
stabilizer  elements.  They  are  not  in  general  decoupled, 
but  there  is  still  a  notion  of  locality.  The  generalized 
relation  allows  us  to  compute  the  recursion  relations  for 
stabilizers  with  small  support  efficiently.  Define 

\Va\  I  VbI 

k0>6  :=  n  kt  n  Kb  (12) 


where  a  G  {0,1}^  and  b  G  {0, 1}IV^L  The  terms  in 
the  first  product  act  on  qubits  of  color  A ,  while  those  in 
the  second  product  act  on  qubits  of  color  B.  Then  (see 
Appendix  A)  under  the  action  of  sub-protocol  PI, 


(Ka,b)'=^  E  (-l)ai'“2  (Ka+a1+a2>b)  (Kai,b)  (Ka2,b) 


ai  ,a-2  <^o 


(13) 


where  /  <C  g  iff  fj  =  0  whenever  gj  =  0.  Equa¬ 
tions  (9), (8)  are  special  cases  for  (Ka?b)  =  (K j)  with 
j  G  A,  B  respectively.  An  interesting  feature  of  this  equa¬ 
tion  is  that  it  relates  a  correlator  of  weight  w  =  \a\  +  |6| 
to  correlators  of  weight  no  more  than  w.  This  makes  it 
feasible  to  calculate  the  correlators  of  small  weight. 

In  order  to  discuss  the  behavior  of  this  protocol  under 
concatenation  with  itself,  it  is  useful  to  switch  back  to 
probability  variables.  Then  Eq.  (10)  implies  that  if  the 
protocol  is  concatenated  with  itself  k  times, 


Pj(pm  < 


(pjjpm 
V  Pth 


2k 


(14) 


where  Pt h  is  the  threshold  error  probability.  The  k- 
concatenated  protocol  requires  32k  identical  copies,  thus 
the  protocol  is  exponentially  efficient  under  concatena¬ 
tion  and  the  efficiency  is  independent  of  the  size  of  the 
graph.  We  conclude  that  under  concatenation  the  pro¬ 
tocol  reaches  the  reference  state  |0)  with  efficient  use  of 
resources. 


B.  Noisy  gates 


Now  we  investigate  what  happens  to  this  protocol 
when  the  CNOT  gates  themselves  are  noisy.  In  the  3- 
copy  protocol  CNOT  gates  act  on  the  same  qubit  in  two 
states  p and  p^n\  We  model  a  noisy  two  qubit  gate 
as  an  ideal  gate  followed  by  the  two  qubit  depolarizing 
channel  (i.e.  the  SU (4)  invariant  channel) 


T<*>  :=  (1  -p2)[l]  +  E  [D(pm)  ®  Dpn) 
i,j= 1 


(15) 


where  Dij  G  {I,  X,  Y,  Z}  and  k  is  the  qubit  index.  pK^771) 
acts  on  the  kth  qubit  of  p(m\  The  Z  gates  applied  in 
the  error-correction  steps  and  the  measurement  of  the 
syndrome  are  assumed  to  be  noiseless.  This  is  natural 


since  the  Pauli  phase  flips  Z  may  be  omitted  as  phys¬ 
ical  operations  and  instead  accounted  for  in  the  classi¬ 
cal  syndrome  processing.  We  will  include  the  effect  of 
measurement  errors  in  the  analysis  when  we  consider  the 
more  sophisticated  protocols,  that  have  higher  thresh¬ 
olds  than  the  3-copy  protocol.  If  we  consider  the  effect 
of  T^  only  on  (K j)  in  state  p(°\  then  using  Eq.  (2) 
we  can  reduce  the  noise  to  an  effective  error.  For  every 
k  eV  :  k  G  neigh(j)  U  {j} 

T^)(P(0))  =  (1  —  f)M  +  f  [Zf]-  (16) 

If  k  £  neigh(j)  U  {j}  then  T^^  is  just  the  identity  map. 
Since  every  error  channel  commutes  with  every  CNOT, 
we  can  model  the  noisy  MCNOT  as  the  ideal  MCNOT 
followed  by  \V\  noise  channels. 

The  error  channel  Eq.  (15)  is  local  (i.e.  it  acts  only 
on  qubit  k  in  p and  p^n\  Also  the  error  operators 
are  Pauli  operators,  which  map  graph  states  to  graph 
states  keeping  p  diagonal  in  the  graph  basis.  Thus  we 
can  expect  the  noisy  recursion  relations  to  have  the  same 
form  as  Eq.  (10).  Considering  only  sub-protocol  PI,  the 
jth  qubit  in  p^  is  affected  by  2{d  +  1)  error  channels. 
The  total  probability  that  it  is  flipped  by  an  error  is 

^  Thus,  for  qubits  of  color  B 

(Kj/  =  ol2  (Kj)3  ,  (17) 

where  a  =  (1  —  p2)^d+V) - 

The  situation  is  a  little  more  complex  for  qubits  of 
color  A  as  the  error  in  the  MCNOT  between  p and 
p is  propagated  by  the  MCNOT  between  p and  p(2\ 
However,  the  form  of  the  recursion  relation  remains  the 
same.  We  get 

<Ki>'=  y  (2  +  a-1-(Kj)2)(Ki).  (18) 

For  a  derivation  see  Appendix  A  2.  Composing  sub¬ 
protocols  PI  and  P 2  we  get  the  recursion  relations  for 
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FIG.  3:  The  MCNOT  for  sub-protocol  Pi  in  a)  The  3-copy  protocol,  b)  The  band-aid  protocol  and  c)  The  conditional  band-aid 
protocol.  The  dotted  lines  in  c)  indicate  that  the  band-aids  are  applied  only  if  there  is  an  ambiguous  syndrome  at  that  location. 
Here  we  show  graphs  of  degree  2,  but  these  protocols  can  be  applied  to  graphs  of  any  degree. 


the  3-copy  protocol  with  noisy  gates 

{K  y  =  1%  (2  + a"1  -  (K,-)2)3  (Kj)3  if  j  €  A, 

\  Ty2  +  a~1  —  0/4  (Kj)6)  (Kj)3  if  jeB, 

(19) 

Here,  qubits  of  color  A  behave  worse.  Solving  the  re¬ 
cursion  relations  for  fixed  points,  we  find  that  there  are 
two  non-trivial  positive  fixed  points  (see  Appendix  B)  for 
a  >  0.9902.  Consider  the  interval  [0,1].  It  has  at  most 
three  fixed  points  0  =  /o  <  /i  <  <  1.  /o  and  are 

attractive  while  /i  is  repulsive.  Thus  /2  will  be  a  stable 
fixed  point  for  a  >  0.9902  and  (Kj)initial  >  f\.  This  gives 
a  threshold  for  the  noise  affecting  the  gates  that  scales 
inversely  proportional  to  the  graph  degree  d, 


Pth 


9.8  x  10“3 
d  +  1 


Specifically  for  degrees  2  and  4  we  obtain, 


(20) 


We  only  consider  PI,  the  first  round  of  the  protocol. 
It  is  associated  with  a  transformation  PI  :  p  — >  p'  = 
R  (p®3).  R  and  V  commute,  i.e. , 

R  ((XV)®3)  =  V  o  R  (p®3)  ,  (23) 

for  any  p.  For  a  proof  see  Appendix  C. 

Consider  a  recursion  relation  of  the  form 

(Ka,b(p'D))=fa,bmi,j(pD))}),  (24) 

with  fab  some  function  depending  on  a,  b  as  in  Eq.  (13). 
Now, 

(K  a  Ap'd))  =  Tr  (k.,6  R  ((XV)®3)) 

=  Tr  (Ka>b  X?  o  R  (^®3))  (by  Eq.  (23)) 
=  Tr  (v\Ka^)pf)  (trace  cyclicity) 

=  (Ka,6(p'))-  (V'=V) 

Similarly,  (K ij(pD)}  =  (Kij(p)),  such  that 


Pth 


0.328  %  for  dm  2, 
0.197%  for  d  =  A. 


(21) 


This  is  a  rather  low  value,  but  it  will  be  substantially 
improved  when  we  consider  more  sophisticated  protocols. 

We  now  show  that  the  recursion  relations  Eq.  (19)  are 
valid  regardless  of  whether  or  not  the  considered  states 
are  diagonal  in  the  cluster  basis.  To  see  this,  let  us  define 
a  depolarization  operator  V  which  converts  an  arbitrary 
n-qubit  mixed  state  p  into  an  77-qubit  mixed  state  pD  = 
Vp  that  is  diagonal  in  the  cluster  basis.  V  takes  the  form 


p=  — + [Ka’°]  ^  |r — + 

where  a  and  b  are  vectors  in  a  basis  of  {0, 1}%AI  and 
{0, 1}%AI  respectively. 


<K0|6(p,))  =  /alb({<Kili(p))}).  (25) 

Thus,  a  recursion  relation  of  the  form  of  Eq.  (24)  such  as 
Eq.  (19)  holds  for  all  states  p  and  not  just  for  diagonal 
states  pD  =  dPp. 

IV.  IMPROVED  PROTOCOLS 
A.  The  error  model 

In  the  following,  we  consider  a  scenario  where  graph 
states  are  created  locally  from  product  states,  then  dis¬ 
tributed  to  several  parties  and  subsequently  purified.  Er¬ 
rors  occur  in  each  of  these  steps.  Specifically, 

•  There  is  a  two  qubit  error  T  Eq.  (15)  associated 
with  each  C-PHASE  gate  in  the  creation  of  the 
graph  state,  with  probability  P2 
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•  A  local  depolarizing  error  with  probability  pi  oc¬ 
curs  on  each  graph  state  qubit  during  transmission 

•  Every  CNOT  gate  used  in  purification  carries  a  2- 
qubit  error  Eq.  (15),  with  error  probability^-  Ev¬ 
ery  measurement  is  modeled  by  a  one  qubit  depo¬ 
larizing  channel  with  error  probability  p2  followed 
by  a  perfect  measurement. 


B.  The  band-aid  protocol 

In  order  to  raise  the  threshold  of  the  3-copy  protocol, 
we  will  try  to  combine  the  strategies  of  error-correction 
and  post-selection  (which  has  a  higher  threshold).  One 
way  to  do  this  is  to  use  small  highly  purified  GHZ 
states,  i.e.  band-aids,  to  purify  the  graph  one  vertex 
at  a  time.  The  usual  MCNOT  is  performed  between  the 
band-aid  and  the  large  graph  state  as  shown  in  Fig.  3b. 
This  copies  information  about  the  central  vertex  into  the 
band-aid  which  is  then  measured  to  give  a  syndrome. 
Since  the  band-aid  is  highly  purified  (for  example  by 
post-selection),  it  doesn’t  pollute  the  large  state  much.  It 
is  important  to  note  that  the  error  correction  is  still  local, 
and  we  expect  the  recursion  relations  to  be  de-coupled  as 
in  the  case  of  the  3-copy  protocol. 

The  band-aid  protocol  also  has  two  sub-protocols.  The 
first  one  PI  is 

i.  Partition  the  graph  into  two  colors  A  and  B  (V  = 
Va  U  Vb  and  Va  H  Vb  =  0). 

ii.  The  band-aids  are  placed  over  the  large  state  such 
that  each  central  qubit  of  the  band-aid  is  over  a 
vertex  of  qubit  A  for  all  qubits  of  color  A.  Perform 
the  MCNOT  as  shown  in  Fig.  3b 

iii.  Measure  the  central  qubit  of  each  band-aid  in  the 
X  basis  and  the  other  qubits  in  the  Z  basis.  For 
each  band-aid  multiply  the  measured  eigenvalues. 
If  the  product  is  (-1)1  then  the  syndrome  bit  a j  is 
(1)0. 

iv.  Apply  the  correction  YljeA  t°  large  state. 

P 2  is  the  same  as  PI,  with  the  roles  of  colors  A  and  B 
reversed. 

Consider  sub-protocol  PI.  For  qubits  of  color  B  the 
argument  is  very  similar  to  the  3-copy  protocol,  except 
that  each  qubit  is  affected  by  2  gates  from  each  of  d 
band- aids.  Thus, 


If  the  gates  are  noisy,  Eq.  (27)  is  multiplied  by  a  noise 
factor  of  the  form  (1  —  p2)^d^  as  in  the  case  of  the  3-copy 
protocol.  There  is  a  subtlety  involving  the  temporal  or¬ 
dering  of  the  band- aids.  The  band- aids  do  not  all  com¬ 
mute  with  each  other.  There  are  1  +  d(d  —  1)  band- aids 
that  affect  qubit  j.  One  of  them  is  the  band-aid  that  is 
used  to  purify  the  qubit.  On  average  k  =  d(^d~V  Qf  the 
rest  will  be  applied  before  the  purifying  one.  Any  effect 
from  the  k  prior  band-aids  will  be  erased  by  the  puri¬ 
fying  band-aid  (See  Eq.  (27)).  The  purifying  band-aid 
has  d  +  1  noisy  CNOTs  affecting  (Ky),  since  the  noisy 
MCNOT  is  modeled  as  an  ideal  MCNOT  followed  by 
noise,  no  information  about  the  noise  is  propagated  to 
the  band-aid.  Thus,  the  noise  will  commute  with  the 
error  correction  procedure.  Since  a  measurement  error 
that  flips  the  central  qubit  of  the  band-aid  will  cause  us 
to  apply  the  wrong  error  correction  operator,  it  can  also 
be  reduced  to  an  effective  error  as  given  by  Eq.  (16). 
Thus,  f(d)  =  2{d  +  1)  +  k  and  we  have 


(Kjy  =  (i-P2)^^(Kj)b 


(28) 


Combining  sub-protocols  PI  and  P 2,  we  get  the  recur¬ 
sion  relations  for  the  band-aid  protocol  with  noisy  gates 
as  well  as  noisy  measurements 


<K 


(!  -P2) 
0--p  2) 


d(d+  7)+4 
2 

d(d  +  3)+4 
2 


<K  i>6+1 
<K  j)b 


for  j  G  A, 
for  j  G  B. 


(29) 


The  behavior  of  qubits  of  color  A  is  worse  and  we  will 
use  their  purity  as  the  final  purity  of  the  large  state. 


0  0.01  0.02  0.03 


(Kj)/  =  (1  —  p2)2d  (Kj)  (Kj)f ,  (26) 

where  (K \j)b  is  the  constant  initial  purity  of  the  band-aid. 

For  qubits  of  color  A,  first  suppose  that  the  CNOT 
gates  are  ideal.  Then,  a  simple  transfer  of  purity  occurs. 

(Kj)/  =  (Kj)b .  (27) 


Gate  error  (^2) 

FIG.  4:  Trade-off  curves  for  the  band-aid  and  conditional 
band-aid  protocols  (d  —  4).  The  decreasing  curves  represent 
the  breakdown  of  the  post-selection  protocol,  when  there  is 
too  much  error.  The  increasing  curves  demarcate  the  region 
where  the  final  purity  of  the  purified  states  is  higher  than  the 
purity  of  the  unpurified  states.  It  makes  sense  to  purify  in 
the  shaded  regions. 
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As  per  our  error  model  in  Section  IV  A,  the 
noisy  C-PHASE,  C-NOT  and  measurement  gates  are 
parametrized  by  p2-  The  noisy  transmission  channel  is 
parametrized  by  p\.  For  the  final  result,  we  need  to  know 
the  quality  of  the  band- aids.  We  assume  that  these  are 
also  created  locally,  then  transmitted  and  purified.  The 
band- aids  however,  are  of  fixed  size  and  may  thus  be  pu¬ 
rified  by  the  post-selection  protocol  [9]  with  the  higher 
threshold.  The  output  quality  of  the  purified  band-aids 
is,  to  leading  order  in  P2 


(Kj)b  =  l-(d+l)p2,  (30) 

such  that 

1  —  d(3d  +  11)  +  6 

j)  =  1 - g - P2’  (31) 

for  small  p2-  As  Eq.  (31)  shows,  with  increasing  graph 
degree  the  effect  of  errors  in  the  purification  process  is 
strongly  enhanced.  One  may  therefore  ask  the  question 
whether  it  is  useful  to  purify  at  all  or  whether  the  trans¬ 
mitted  state  should  be  used  right  away.  To  decide  this 
we  compute  (K j)  after  graph  state  creation  and  trans¬ 
mission, 


d(d+ 1)  ,  . 

(KiH(l -P2)-*~(1-Pi)d+1.  (32) 

See  Appendix  D  for  a  derivation.  We  compare  this  ex¬ 
pression  with  Eq.  (29)  and  find  that  there  is  indeed  a 
parameter  region  where  it  makes  sense  to  purify.  This 
region  is  displayed  for  graphs  of  degree  d  =  4  in  Fig.  4. 
It  is  bounded  from  above  and  right  by  the  curve  which 
indicates  the  breakdown  of  the  band-aid  purification  ac¬ 
cording  to  the  post-selection  protocol  [9].  In  general, 
the  threshold  of  the  band-aid  protocol  for  degree  d  graph 
states  equals  the  threshold  for  purification  of  a  d+1  qubit 
GHZ  state  with  the  post-selection  protocol  [8].  However, 
the  output  purity  of  the  band-aid  protocol  is  smaller. 
Only  above  the  ascending  curve  is  it  advantageous  to 
purify. 


C.  The  conditional  band-aid  protocol 

In  order  to  correct  the  d2  dependence  of  the  fixed  point 
in  the  band-aid  protocol,  we  will  combine  it  with  the 
3-copy  protocol.  The  hybrid  protocol,  called  the  condi¬ 
tional  band-aid  protocol,  sacrifices  in  threshold  to  im¬ 
prove  the  fixed  point.  The  fixed  point  behavior,  at  least 
to  linear  order  in  gate  noise,  is  almost  as  good  as  that  of 
the  post-selection  protocol. 

This  protocol  proceeds  in  the  same  fashion  as  the  3- 
copy  protocol,  except  that  two  copies  are  used  per  round 
and  wherever  a  measurement  of  K j  yields  eigenvalue  —  1 
(i.e.  an  error),  a  post-selected  band-aid  is  applied  to 
purify  qubit  j  (See  Fig.  3).  For  small  gate  noise,  we 
expect  to  have  to  apply  only  a  few  band-aids  per  round, 
nonetheless,  the  threshold  is  set  by  the  qubits  to  which  we 


have  to  apply  band- aids.  Locations  where  a  measurement 
of  K j  yields  1  are  error  free  to  lowest  order.  Once  again, 
we  have  two  sub-protocols,  Pi  and  P2  each  purifying  a 
different  color. 

The  analysis  is  similar  to  that  used  in  arriving  at 
Eq.  (19)  for  the  3-copy  protocol.  However,  the  situation 
is  complicated  by  the  fact  that  the  band-aids  are  applied 
conditioned  on  the  results  of  measuring  p^\  As  a  result, 
the  recursion  relations  for  the  one  point  correlators  are 
no  longer  completely  de-coupled.  We  can  however  find  a 
simple  lower  bound  on  them. 

Define  (K&)  to  be  the  minimum  purity  of  the  post- 
selected  band-aid.  It  is  a  constant.  For  simplicity  we 
assume  that  all  qubits  in  the  band-aid  have  this  purity. 
As  before,  we  assume  that  the  graph  of  the  large  state  is 
translationally  invariant,  i.e.  all  vertices  have  the  same 
degree.  The  definition  (3  =  (1  —  P2)2  (K & )  will  be  useful. 
Consider  qubits  of  color  A  in  sub-protocol  Pi,  then  by  a 
derivation  similar  to  Eq.  (18) 

(Kj/  =  |  (2a  (Kj)  +  <K6>  -  a  <Kb)  (K,)2)  ,  (33) 

where  a  =  (1  —  P2)d+1  as  before. 

So  far,  we  have  been  exact.  Now  consider  sub-protocol 
P2.  Again  focus  on  qubits  of  color  A.  Break  P2  down  into 
two  steps.  In  step  one,  we  apply  the  MCNOT  to  and 
p^\  It  can  be  readily  verified  that  (Kj)  1— ►  a  (K j)2.  In 
step  two,  band- aids  conditioned  on  the  measurement  out¬ 
come  are  applied  to  qubits  of  color  B.  Let  y  £  { 0,  l}d  be 
the  measurement  results  for  the  neighbors  of  qubit  j.  A 
measurement  result  of  one  means  a  band-aid  must  be  ap¬ 
plied  at  that  location.  If  a  band-aid  is  applied  to  a  neigh¬ 
bor  of  j,  (Kj)  is  affected  by  the  errors  on  the  band-aid, 
characterized  by  (K&)  and  by  two  noisy  CNOTs.  Thus 
(Kj)  1  >  (3 (Kj).  Summing  over  measurement  outcomes 
and  including  step  one,  we  get 

<Ki>'  =  “(£  (  E  <lvPk\  j  <KT>  (34) 

\fc=o  \\y\=k  )  ) 

where  qy  is  the  probability  of  measurement  outcome  y. 
Unfortunately,  qy  is  a  function  of  the  general  stabilizer 
expectation  values  (Ka^),  so  we  will  resort  to  finding  a 
lower  bound.  Since  go  =  1  —  we  can  rewrTe  the 

above  equation  as 


(Kj/  =  aa  [  (1  -  Y,  Qv)  +  E  E  dvt3 k  )  (KT 

yy^O  k> 0  \y\=k 


>  a  j  1  —  (1  —  (3d)  ^2  Qy  )  (Kj)2  , 

y/0 


using  (3  <  1  to  arrive  at  the  inequality. 

Now,  go  is  just  the  probability  that  no  error  is  de¬ 
tected  on  any  of  the  neighbors  of  j.  Let  pj  be  the 
probability  of  detecting  an  error  on  site  j.  Then  by 
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definition,  (K *)  =  1  —  2  J2y\yi=i^y  This  implies  that 

J2y^o%  ^  g neigh (j )  *  Putting  this  into  the 

above  inequality, 

(K  jY  (3d)  (1  -  (Kj))^  (K,)2  ,  (35) 

where  (KQ  is  the  purity  of  qubits  of  color  B  from  the 
previous  round. 

Solving  for  the  fixed  point,  we  get,  to  leading  order  in 
gate  noise  p2 , 

(K  j)  =  l-2(d+l)p2.  (36) 

Comparing  this  to  Eq.  (30),  we  see  that  the  fixed  point 
scaling  with  degree  is  almost  as  good  as  in  the  post¬ 
selection  protocol.  We  now  apply  the  conditional  band- 
aid  protocol  to  the  same  situation;  of  a  graph  state  being 
shared  amongst  widely  separated  parties,  as  for  the  band- 
aid  protocol.  The  results  for  a  degree  four  state  are  plot¬ 
ted  in  Fig.  4.  We  see  that  the  threshold  (upper)  curve  is 
worse,  whereas  the  fixed-point  (lower)  curve  is  better  for 
this  protocol,  as  compared  to  the  band-aid  protocol.  The 
total  purifiable  area  is  smaller,  indicating  that  it  breaks 
down  faster.  In  some  sense,  we  have  traded  threshold  for 
fixed  point.  These  conclusions  hold  for  arbitrary  degree 
and  the  curves  are  independent  of  the  size  of  the  state, 
making  this  protocol  eminently  suitable  for  the  purifica¬ 
tion  of  large  bi-colorable  graph  states. 

V.  CONCLUSION  AND  OUTLOOK 

We  have  described  novel  purification  protocols  for  bi- 
colorable  graph  states  and  discussed  their  performance. 
The  criteria  for  our  protocols  are  that  they  do  not  break 
down  in  the  presence  of  small  amounts  of  noise  in  the 
purification  process,  that  they  have  a  high  purification 
threshold  and  good  output  quality,  scale  efficiently  and 
are  analytically  tractable. 

Our  final  protocol  can,  for  relevant  graph  states  of  de¬ 
gree  4,  tolerate  1%  gate  or  20%  local  transmission  error. 
These  are  about  1/3  and  2/3  of  the  respective  values  for 
the  post-selection  protocol  [8,  9].  However,  in  contrast 
to  this  reference  protocol,  our  protocol  scales  efficiently 
with  the  graph  size. 

All  our  protocols  can  be  treated  analytically.  In  par¬ 
ticular,  for  the  3-copy  protocol  we  derive  closed,  exact 
one-dimensional  recursion  relations  in  the  appropriate 
observables,  irrespective  of  the  size  of  the  state. 

We  would  like  to  comment  on  the  influence  of  the  graph 
degree  for  the  purification  threshold.  First  note  that  for 
the  three-copy  protocol  of  Section  III,  in  the  case  of  per¬ 
fect  purification  gates,  the  recursion  relations  Eq.  (10) 
are  completely  independent  of  the  graph  structure,  and 
so  are  the  thresholds  Eq.  (11).  This  behavior  changes  if 
noise  is  included  in  the  purification.  The  critical  noise 
level  per  purification  gate — at  which  the  protocol  breaks 


down — scales  inversely  proportional  with  the  graph  de¬ 
gree.  The  unfavorable  dependence  on  the  graph  degree 
is  present  in  all  three  protocols  we  discuss.  Thus,  the  les¬ 
son  we  learn  for  the  case  of  noisy  purification  is  “Beware 
of  large  graph  degrees” .  Large  graph  degrees  occur,  for 
example,  in  graphs  states  corresponding  to  codewords  of 
concatenated  CSS-codes. 

We  would  also  like  to  comment  on  the  structure  of 
the  non-trivial  fixpoint  in  our  protocols.  In  the  case  of 
erroneous  purification  gates,  the  nontrivial  fixed  point 
is  not  completely  specified  by  the  lowest  order  expecta¬ 
tion  values  (Kj)  and  it  remains  to  be  discussed  which 
error  correlations  are  removed  by  the  purification  pro¬ 
tocol.  As  a  first  result  in  this  direction,  for  the  3-copy 
protocol  discussed  in  Section  III  we  have  shown  (in  Ap¬ 
pendix  A3)  that  correlations  of  stabilizer  expectation 
values  located  on  non-overlapping  supports  are  not  in¬ 
troduced  by  the  purification  procedure  if  they  are  absent 
initially.  This  implies  that  such  correlations  are  absent  in 
all  purified  states  which  end  up  at  the  same  fixed  point 
as  the  perfect  state.  We  show  in  Appendix  B  that  the 
fixed  point  for  two-generator  correlations  with  distinct 
support  is  unique,  which  is  enough  to  establish  the  re¬ 
sult  that  all  states  at  the  fixed  point  obey  the  relation 
(K^Kj)  =  (K i)  (Kj)  for  such  correlations. 

A  question  of  further  interest  is  whether  the  non-trivial 
fixed  point  of  the  protocol  is  unique  at  all  levels  of  cor¬ 
relations.  This  would  imply  (K i+j)  =  (K i)  (Kj)  for  all 
correlations  with  distinct  supports. 

Another  question  of  further  interest  is  whether  the  de¬ 
scribed  or  related  protocols  may  be  used  to  boost  the 
threshold  value  for  fault-tolerant  quantum  computation 
[20,  21,  22,  23,  24]  based  on  graph  states. 
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APPENDIX  A:  GENERALIZED  RECURSION 
RELATIONS 

We  now  derive  the  generalized  recursion  relations 
(Eq.  (13))  for  the  3-copy  protocol.  While  the  method 
used  for  this  derivation  is  less  intuitive,  it  yields  recur- 
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sion  relations  for  arbitrary  stabilizer  elements  and  can 
handle  noisy  gates  easily. 

1.  Noiseless  Gates 

In  order  to  derive  Eq.  (13)  we  work  in  the  stabilizer 
basis.  Because  is  diagonal  and  the  set  {(Kajb)} 
where  a  G  {0,1  G  {0,  1}\Va\  forms  a  complete 

set  of  observables,  we  can  write  an  expansion  p = 

2 1 I  + 1  I  ^a,b  (Ka,&)  K a,b- 

Consider  sub-protocol  Pi,  that  purifies  the  A  sub¬ 
graph.  The  initial  state  is  p(°)  0  p^  0  p ^ ,  which  can  be 
re-written  as  a  sum  over  a,  b  of  terms  of  the  form 

(Ka(0)>b(0))  (Ka(l)j6(l))  (Ka(2)jb(2))  x 

E-a,(°),6(°)  Ea(2)jb(2)  •  (Al) 

Performing  step  ii,  this  term  becomes 

(E-a(o)  ,6(°)  )  (-K-aX1)  )  (E-a(2)  ,6(2)  )  ^ 

Ka(0)+a(l)+a(2)5b(0)  Ka(l)>b(0)+b(l)  Ka(2y5(0)+5(2)  .  (A2) 


Z  and  Ka?b  commute.  Effectively,  Z  is  a  diagonal  matrix 
with  entries  ±1.  Identical  reasoning  applies  to  X  and  Y. 
This  will  make  it  very  easy  to  add  gate  noise  into  the 
analysis.  It  also  allows  us  to  say  that  the  net  effect  of  the 
error-correction  step  iv  is  to  multiply  Eq.  (A4)  by  a  factor 

of  (—  1)(A(1)  xA(2)Ma0+ai+a2)^  wpere  ( p  x  qj.  =  p.  .  q.' 

Changing  basis  to  a  =  a +  a ^  +  a^2\  b  = 

Then  the  term  becomes 

^b,b(2)  22 1  Va  |  Ea,b 

Note  that  in  this  notation,  and  ignoring  the  delta 
functions,  the  original  coefficient  in  Eq.  (Al)  is 
(Ea_j_a(i)  +a(2)  ,b)  (Ea(i))b)  (^a(2),b)* 

Summing  over  measurement  outcomes,  the  coefficient 
of  Kajb  is 

(Ea_|_a(i) -)-a(2) ,b)  (-Ea(i)>b)  (Ea(2) ,£>)  ^ 

^  I(_1)A(1)-a(1)+A(2)-a(2)  +  (A(1)xA(2))-a) 

A(!),A(2)  4 

The  sum  can  be  re-expressed  as 


Now  consider  step  iii.  Suppose  we  get  measurement 
outcomes  A^,  for  the  stabilizers  in  sub-graph  A  on 
copies  p W ,  p .  Then  the  resultant  state  is  given  by 
applying  the  projector 

-  \VA\ 

j™  n  m®  (ro  +  Kf) 

3  = 1 

(A3) 

This  projector  acts  trivially  on  p^ ,  so  in  the  following  we 
track  only  what  happens  to  the  kets  in  the  Hilbert  spaces 
of  p t1) ,  p .  After  applying  the  projector  Eq.  (A3)  to  the 
term  Eq.  (A2),  we  have  a  sum  over  all  binary  vectors 
j(i),  j(2)  ^  jo,  l}IVk|  0f  terms  of  the  form 

jA(1)-J(1)+A(2)-J(2) 

22 1 VA  |  Ea(i)._|_j(i)?5(Q)+5(i)  * 

Ka(2)+J(2),6(°)+6(2)^ 

where  •  denotes  the  bitwise  inner  product.  Discarding 
pt1) ,  p(2) ,  we  perform  a  partial  trace  over  these  systems 
(recalling  that  Ka  b  are  all  traceless  except  K0,o  =  I)- 
Inserting  the  matrix  element  from  system-0  back  in,  we 
get  the  term 


(-l^U^+A^W2) 

,b(x)  ^6(1)  ,b(2)  22 1  Va  |  ^ 

Ka(°)+a(1)+a(2),6(1)5  (A4) 

where  SPjQ  is  the  Kronecker  delta  on  each  component  of 
p,  q.  Note  that  we  must  have  b =  b W  =  b ^  or  the 
term  goes  to  zero. 

Now  examine  the  action  of  the  Pauli  [Z]  operator  in  this 
basis.  [Z]  Kfl5{,  =  ZK^^Z  =  — l^K  a?5,  where  km  0  iff 


If  aj  =  0,  then  the  jth  term  is  zero  unless  a ^  = 
aKA  =  0,  in  which  case  it  is  4.  Hence  for  the  term 
(Ka+a^Wbb)  (Kdkb)  (Ka(2),b)  to  survive  the  proce¬ 
dure,  we  must  have  a^\a^  <C  a.  If  this  holds,  then  an 
overall  factor  of  2^1  Hal  comes  out.  If  aj  =  1,  then  a 

straightforward  calculation  shows  that  the  jth  term  con- 

(!)  (2) 

tributes  a  factor  of  2(— 1  )ao  'ao  .  The  overall  numerical 
factor  is  thus  To  get  the  new  value  of  (K0)b),  we 

simply  sum  over  a^\  a ^  since  these  and  only  these  will 
contribute  to  the  support  of  Ka  b  under  PI.  This  gives 
Eq.  (13) 


2.  Noisy  Gates 

Adding  noise  to  the  gates  requires  very  little  additional 
work.  We  can  rewrite  the  depolarizing  channel  on  qubit 
j  of  copy  k  as 

Df\p]  =  l([i}  +  mf))l{[i]  +  [x]f))[p] 

—  pO’A)  p(j’A) 

—  Lz  rx 

We  have  written  the  noise  channel  in  this  form  to  il¬ 
lustrate  how  the  noise  components  act  as  projectors 
P^,  ,  since  the  effective  Pauli  operators  have  ±1 

on  the  diagonal.  If  a  specific  ket  is  affected  by  noise  on 

(k) 

site  j  of  copy  &,  it  will  be  an  eigenvector  of  DK-  ;  with 
zero  eigenvalue. 
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The  noise  from  a  CNOT  at  site  j  between  copies  i  and 
k  is 

=  (1  _P2)  +P2(pWpW)(pWpg))  (A5) 

If  a  ket  Ka  b  is  affected  by  any  of  these  noise  terms  (that 
is,  if  the  noise  anti-commutes  with  Ka 5*>),  it  will  be  pro¬ 
jected  to  zero  and  thus  acquire  a  (1  —  P2)  multiplier  over¬ 
all. 

The  noise  from  the  first  MCNOT  is  P01  =  Ylj 

and  from  the  second  MCNOT  is  F02  =  ^ 

Clearly  the  overall  multiplier  is  independent  of  the  mea¬ 
surement  outcomes,  so  the  analysis  for  Eq.  (A3)  still 
holds.  The  recursion  relations  are  then  similar  in  struc¬ 
ture  to  Eq.  (13),  except  that  coefficients  dependent  on 
(1  —  P2)  are  inserted  before  each  term. 

We  illustrate  this  by  calculating  the  recursion  relations 
for  (K j).  Specializing  to  single  point  stabilizers,  let  dj  be 
the  degree  of  j.  If  j  G  P,  there  is  no  sum,  and  (K j)  — > 
Ej  (K j)  .  The  only  noise  terms  that  anti-commute  with 
K j  (and  hence  give  factors  of  (1  —  P2))  are  those  in  j  U Nj . 
There  are  2(d  +  1)  of  these  (since  there  are  two  sets  of 
noisy  gates),  so  (K j)  — >  (1  —  p2)2^+1^  (Kj)3,  which  is 
Eq.  (17). 

Now  suppose  j  G  A.  Let  j  =  (0, . . . ,  0,  j,  0, . . . ,  0). 
Our  sum  is  over  a^\a^  G  and  6  =  0.  Since  we 

are  interested  only  in  (Kj ) ,  our  effective  noise  model  is 
[X]fc  1 — *  [Z \jik  G  Nj  and  [X]j  1— »  [I].  All  other  noise  terms 
do  not  affect  the  state.  Then 

£01  ^  ((1  - P2)  +P2P{zj,0)P{zj,1))d+1  •  (A6) 

A  similar  replacement  holds  for  Po2-  F01  acts  on  terms 
Ka(i)0,  and  gives  a  factor  of  1  iff  j +  a(1)  + 
a =  0,  aA)  =  0  =>  j  =  a^2\a^  =  0,  and  a  factor  of 
(1  —  p2)dJrl  otherwise. 

Performing  the  MCNOT  between  p^  and  p^2\  the 
noise  channel  P02  acts  on  the  kets  KJ+a(i)  0  Ka(2>,  which 
gives  a  factor  of  1  iff  j  +  a^  =  0,  a ^  =  0  and  (1  —  p)d+1 
otherwise.  Putting  in  each  of  the  four  cases  a^\a^  G 
{0, 1}  gives  us  Eq.  (18). 


3.  Behavior  of  Correlations 

If  we  take  two  qubits  j,k  such  that  neigh(j)  n 
neigh(fc)  =  0,  then  the  noise  terms  on  sites  in  neigh(&)U& 
do  not  affect  terms  involving  j  and  vice-versa.  Hence  the 
sum  over  terms  in  the  recursion  relation  for  (K jk)  will 
factor  into  (Kj)  (Kk).  If  initially  (KjKk)  =  (K  ■)  (Kk), 
then  the  3-copy  protocol  will  not  generate  any  new  cor¬ 
relations  between  these  regions. 


APPENDIX  B:  UNIQUENESS  OF  THE  FIXED 
POINT 

Here  we  show  that  the  3-copy  protocol  has  a  unique 
fixed  point  for  stabilizer  elements  (K a,b)  with  weight 
w  =  \a\  +  |6|  <  2.  The  recursion  relations  for  stabi¬ 
lizer  elements  of  weight  w  >  1  (see  Eq.  (12))  depend 
only  on  stabilizer  elements  whose  weight  is  at  most  w. 
Thus,  we  can  use  an  inductive  argument.  If  all  the  stabi¬ 
lizer  elements  of  weight  less  than  w  have  reached  a  fixed 
point,  they  become  constants  and  then  the  recursion  re¬ 
lation  for  elements  of  weight  w  will  have  the  same  form 
as  those  for  weight  one  (i.e.  they  will  depend  only  on 
stabilizer  elements  of  weight  w).  First  consider  the  case 
when  |  a  m<i-  The  recursion  relations  have  the  form 

f(z )  =  az  +  bz 3 
g(z)  =  cz  +  dz3, 

with  a,  c  >  0  and  bd  <  0.  The  presence  of  noise  doesn’t 
change  the  form  of  the  recursion  relations,  it  only  multi¬ 
plies  each  term  by  a  number  between  zero  and  one  (see 
Section  A  2).  Let  y  =  z2  and  x  =  dy  +  c.  Define 

p(x)  :=  f(g(z))/z  —  1  =  bxA  —  bcx 3  +  adx  —  d. 

Without  loss  of  generality,  assume  b  <  0.  The  signature 
of  p(x)  is 

p(x)  :  -  +  H — 
p{-x)  : - . 

Then  by  Descartes’  Rule  of  Signs  [25],  p(x)  has  at  least 
two  complex  roots.  Thus  the  recursion  relation  f(g(z))  = 
z  has  at  most  two  positive  fixed  points.  The  recursion 
relation  g(f(z))  =  z  can  be  analyzed  identically.  It  was 
already  argued  in  Section  III  B  that  this  means  that  there 
is  a  unique  attractive  fixed  point. 

Now  consider  the  case  \a\  =  2  and  |6|  =0.  The  recur¬ 
sion  relations  now  have  the  form 

f(z)  =  az 3  +  bz  +  c 

g(z)  =  dz 3. 

It  is  easily  checked  that  a,  c  and  d  are  positive.  The  sign 
of  b  is  harder  to  fix,  but  we  note  that  for  there  to  be  a 
fixed  point  at  all,  b  must  be  negative.  The  case  f(g(z))  = 
2  is  easily  analyzed  as  above,  to  show  that  there  are  at 
most  two  positive  roots.  Let  p(z)  =  g(f(z)).  To  con¬ 
clude  the  proof  we  need  two  technical  results.  1)  If  the 
smallest  support  expectation  value  (Ka)  has  reached  its 
fixed  point  value  (Ka)f  then  the  physically  allowed  val¬ 
ues  for  (Ka+a/)  form  the  interval  I  =  [2  (Ka)fp  —  1, 1].  2) 
f(z)  >  0  for  all  z  e  I.  Proof  of  1.  a)  2  allowed  =>  z  G  /: 
P  =  1-2Ka  1~^a' ,  with  a  7^  a',  is  a  projector,  hence  (P)  > 
0.  Thus  z  =  (Ka+a/)  >  (Ka)  +  (Ka/)  -  1  (*).  Evaluate 
(*)  at  fixed  point  (Ka)f  .  2  <  1  is  obvious,  b)  2  G  /  =>  2 
allowed:  For  an  initial  state  of  the  protocol,  interpolate 
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between  pi  =  (Ka)fp  p+++(l-(Ka)fp)/2  (p+_+p_+)  and 
P2  =  (Ka)fp  p++  +  (1  -  (Ka)fp)p —  (The  signs  “±”  refer 
to  the  eigenvalues  of  K0  and  Ka>,  respectively.)  Proof 
of  2.  Be  (K0)fp  ,  (K6)fp  >  0  and  z  G  I.  Assume  as  an 
hypothesis  f[z )  <  0.  Apply  (*)  to  the  state  after  appli¬ 
cation  of  PI,  at  the  fixed  point  (Ka)fp  ,  Va  G  A.  Hence 
0  >  f(z )  >  2  (K &)fp  —  1.  (Under  PI  the  fixed  point  value 
(Ka)fp  for  a  G  A  is  mapped  to  (K &)fp  for  b  G  H,  assuming 
all  vertices  have  the  same  degree.)  Thus,  (K&)fp  <  1/2. 
But  then  (K&)f  =  0.  Contradiction.  Hence  f(z )  >  0. 

Now,  p"{z)  =  g"(f(z))f(z)2+g'{f(z))f"(z)  such  that, 
with  2),  p"  >  0  for  all  z  €  I.  Thus,  p(z)  is  convex  on 
I.  With  1),  I  is  a  single  interval  such  that  p(z)  and  2: 
intersect  at  most  twice  in  I.  At  most  one  of  these  fixed 
points  is  attractive. 


Eqs.  (C2),  (C3)  and  (C4)  yield  Eq.  (23) 


•,  i  k 


FIG.  5:  Creation  of  a  degree  (d  =  4)  bi-colorable  graph  state, 
the  figure  will  have  the  same  local  structure  for  other  de¬ 
grees/topologies  as  long  as  its  edges  are  d-colorable  and  its 
vertices  are  bi-colorable 


APPENDIX  C:  THE  DEPOLARIZING 
OPERATOR 


APPENDIX  D:  CREATION  OF  A 
BI-COLORABLE  GRAPH  STATE 


In  order  to  prove  that  the  depolarizing  operator  V  de¬ 
fined  in  Eq.  (22)  commutes  with  the  evolution  opera¬ 
tor  R  =  Tt(1j2)  M  o  £  o  [/,  we  note  that  the  protocol 
step  PI  consists  of  a  unitary  part  [/,  an  error  channel  £ 
comprising  probabilistic  Pauli  errors  and  a  measurement 
Tt(i)2)  M,  where  M  is  a  projector.  V  consists  of  a  set  of 
transversal  CNOT-gates  and  acts  on  the  stabilizer  as 


Kp  - 

-  KpKp0Kp0, 

K(1)  - 

^a,b 

-  K« 

(Cl) 

Kab  ~ 

-  KpGb 

Now  note  that 

[']+[<>,]  [/]+[<)]  [/]+[K<,2>] 


w+iQQ  m+tiQ  m+[K<,2>  q 


etc,  such  that 


U  o  £>(°)£>(i)p(2)  =  D(0)D(1)D(2)  o  u.  (C2) 

The  operations  pWpWpt2)  and  £  commute  because 
both  are  linear  combinations  of  Pauli  super-operators, 


£  o  pWpWpW  =  p(°)p(i)x)(2)  0  £m  (C3) 

The  measurements  comprising  Tr (lj2)  Af  are  of  stabilizer 

operators  Kq1^,  Kq2^  on  the  states  p^\  p^2\  respectively. 
They  are  performed  via  one-qubit  measurements  and 
classical  post-processing.  Kq1^,  Kq2^  commute  with  the 
Kraus  operators  in  (22),  such  that 

Tr  =  Tr  pWpWpC2)  oM 

(1,2)  (1,2) 

=  V (°)  o  Tr  M. 

(1,2) 


Here  we  discuss  the  noise  structure  of  a  bi-colorable 
graph  that  is  created  using  noisy  C-PHASE  gates.  The 
noisy  gates  are  modeled  as  the  ideal  gate  followed  by 
two  qubit  depolarizing  noise  as  defined  in  Eq.  (15).  The 
graph  state  is  created  by  performing  C-PHASE  gates  be¬ 
tween  qubits  in  the  |+)  state.  The  noise  structure  of 
the  final  state  depends  on  the  temporal  ordering  of  these 
gates.  If  we  assume  that  the  underlying  graph  has  con¬ 
stant  degree  d  and  that  its  edges  are  d-colorable,  then  the 
N  qubit  graph  state  can  be  created  in  d  timesteps  with 
Nd  C-PHASE  gates.  At  each  timestep  all  the  gates  cor¬ 
responding  to  edges  of  a  particular  color  are  performed. 
Thus,  at  every  timestep  t  G  {1, . . . ,  d},  each  qubit  is  af¬ 
fected  by  an  error  channel  of  the  form  of  Eq.  (15). 

We  are  interested  in  the  value  of  (K j),  so  we  focus  on 
the  neighborhood  of  qubits  j  in  the  larger  graph.  Since 
the  graph  is  bi-colorable,  it  contains  no  three  cycles,  and 
one  can  draw  a  diagram  of  the  form  of  Fig.  5.  The  gates 
are  represented  by  both  solid  as  well  as  dashed  lines. 
The  noise  channels  corresponding  to  the  solid  lines  each 
contribute  an  effective  error  Teff  as  defined  in  Eq.  (16) 
to  qubit  j.  Now  consider  the  qubit  k  which  is  a  neigh¬ 
bor  of  the  central  qubit  j.  Each  dashed  line  also  con¬ 
tributes  an  effective  error  Teff  to  qubit  j,  but  only  if  the 
C-PHASE  gate  corresponding  to  the  solid  line  between 
k  and  j  was  performed  in  a  previous  timestep.  This 
is  because  Z &  errors  commute  with  K j  and  X&  errors 
would  be  propagated  by  the  C-PHASE  to  X&  Z j  errors, 
that  also  commute  with  K j.  Thus  there  are  a  total  of 
d(d~1)  -p  d  =  d(d+1)  noise  channels  affecting  the  qubit  j. 
This  gives, 


(C4) 


(Kj)  =  (1  —  P2) 


d(d+ 1) 


(Dl) 
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